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NUCLEIC ACID BASED MODULATORS OF GENE EXPRESSION 
Background of the Invention 

This invention relates to reagents useful as inhibitors of gene expression relating to 
diseases such as cancers, diabetes, obesity, Alzheimer's disease, cardiac diseases, age- 
related diseases, and/or hepatitis B infections and related conditions. 

Summary of the Invention 

The invention features novel nucleic acid-based techniques [e.g., enzymatic nucleic 
acid molecules (ribozymes), antisense nucleic acids, 2-5A antisense chimeras, triplex 
DNA, antisense nucleic acids containing RNA cleaving chemical groups (for example, 
Cook et al., U.S. Patent 5,359,05 1)] and methods for their use to modulate the expression 
of molecular targets impacting the development and progression of cancers, diabetes, 
obesity, Alzheimer's disease, cardiac diseases, age-related diseases, and/or hepatitis B 
infections and related conditions 

In a preferred embodiment, the invention features novel nucleic acid-based 
techniques [e.g., enzymatic nucleic acid molecules (ribozymes), antisense nucleic acids, 2- 
5A antisense chimeras, triplex DNA, antisense nucleic acids containing RNA cleaving 
chemical groups (for exaple, Cook et al., U.S. Patent 5,359,051)] and methods for their use 
for inhibiting the expression of disease related genes, e.g., Protein-Tyrosine-Phosphatase- 
lb (PTP-1B, Genbank accession No. NM_002827), Methionine Aminopeptidase (MetAP- 
2, Genbank accession No. U29607), beta-Secretase (BACE, Genbank accession No. 
AF190725), Presenilin-1 (ps-1, Genbank accession No. L76517), Presenilin-2 (ps-2, 
Genbank accession No. L43964), Human Epidermal Growth Factor Receptor-2 (HER2/c- 
erb2/neu, Genbank accession No. X03363), Phospholamban (PLN, Genbank accession No. 
NM_002667), Telomerase (TERT, Genbank accession No. NM_003219) and Hepatitis B 
virus genes (HBV, Genbank accession No. AF100308. 1). Such ribozymes can be used in 
a method for treatment of diseases caused by the expression of these genes in man and 
other animals, including other primates. 

Thus, in an additional preferred embodiment, the invention features novel nucleic 
acid-based techniques such as enzymatic nucleic acid molecules and antisense molecules 
and methods for their use to down regulate or inhibit the expression of genes encoding 
Protein-Tyrosine-Phosphatase-lb (PTP-1B), Methionine Aminopeptidase (MetAP-2), 



WO 01/16312 PCT/US00/23998 
2 

beta-Secretase (BACE), Presenilin-1 (ps-1), Presenilin-2 (ps-2), Human Epidermal Growth 
Factor Receptor-2 (HER2/c-erb2/neu), Phospholamban (PLN), Telomerase (hTERT) PKC 
alpha, and Hepatitis B (HBV) proteins. In particular, applicant describes the selection and 
function of nucleic acid molecules capable of cleaving RNAs encoded by these genes and 
5 their use to reduce levels of PTP-1B, MetAP-2, BACE, ps-1, ps-2, HER2, PLN, TERT, 
and/or HBV proteins in various tissues to treat the diseases discussed herein. Such nucleic 
acid molecules are also useful for diagnostic uses. 

In a preferred embodiment, the invention features the use of one or more of the 
nucleic acid-based techniques independently or in combination to inhibit the expression of 

10 the genes encoding PTP-1B, MetAP-2, BACE, ps-1, ps-2, HER2, PLN, TERT, and/or 
HBV. Specifically, the invention features the use of nucleic acid-based techniques to - 
specifically inhibit the expression of PTP- IB, MetAP-2, BACE, ps-1, ps-2, HER2, PLN, 
TERT, PKC alpha, and/or HBV genes. 

In yet another preferred embodiment, the invention features the use of an enzymatic 

1 5 nucleic acid molecule, preferably in the hammerhead, NCH (Inozyme), G-cleaver, 
amberzyme, zinzyme, and/or DNAzyme motif, to inhibit the expression of PTP- IB, 
MetAP-2, BACE, ps-1, ps-2, HER2, PLN, TERT, PKC alpha and/or HBV RNA. 

Applicant indicates that these nucleic acid molecules are able to inhibit expression of 
PTP-1B, MetAP-2, BACE, ps-1, ps-2, HER2, PLN, TERT, PKC alpha, and/or HBV 

20 genes. Those of ordinary skill in the art, will find that it is clear from the examples 

described that other nucleic acid molecules that inhibit target PTP-1B, MetAP-2, BACE, 
ps-1, ps-2, HER2, PLN, TERT, and/or HBV encoding mRNAs may be readily designed 
and are within the scope of the invention. 

By "inhibit" it is meant that the activity of target genes or level of mRNAs or 

25 equivalent RNAs encoding target genes is reduced below that observed in the absence of 
the nucleic acid molecules of the instant invention (e.g., enzymatic nucleic acid 
molecules), antisense nucleic acids, 2-5A antisense chimeras, triplex DNA, antisense 
nucleic acids containing RNA cleaving chemical groups). In one embodiment, inhibition 
with an enzymatic nucleic acid molecule preferably is below that level observed in the 

30 presence of an enzymatically attenuated nucleic acid molecule that is able to bind to the 
same site on the mRNA, but is unable to cleave that RNA. In another embodiment, 
inhibition with nucleic acid molecules, including enzymatic nucleic acid and antisense 
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molecules, is preferably greater than that observed in the presence of, for example, an 
oligonucleotide with scrambled sequence or with mismatches. In another embodiment, 
inhibition of target genes with the nucleic acid molecule of the instant invention is greater 
than in the presence of the nucleic acid molecule than in its absence. According to the 
5 invention, the activity of telomerase enzyme or the level of RNA encoding one or more 
portein subunits of the telomerase enzyme is inhibited if it is at least 10% less, 20% less, 
50% less, 75% less or even not active or present at all, in the presence of a nucleic acid of 
the invention relative to the level in the absence of such a nucleic acid. 

By "enzymatic nucleic acid molecule" it is meant a nucleic acid molecule which has 

1 0 complementarity in a substrate binding region to a specified gene target, and also has an 
enzymatic activity which is active to specifically cleave target RNA. That is, the 
enzymatic nucleic acid molecule is able to intermolecularly cleave RNA and thereby 
inactivate a target RNA molecule. These complementary regions allow sufficient 
hybridization of the enzymatic nucleic acid molecule to the target RNA and thus permit 

1 5 cleavage. One hundred percent complementarity is preferred, but complementarity as low 
as 50-75% may also be useful in this invention. The nucleic acids maybe modified at the 
base, sugar, and/or phosphate groups. The term enzymatic nucleic acid is used 
interchangeably with phrases such as ribozymes, catalytic RNA, enzymatic RNA, catalytic 
DNA, aptazyme or aptamer-binding ribozyme, regulatable ribozyme, catalytic 

20 oligonucleotides, nucleozyme, DNAzyme, RNA enzyme, endoribonuclease, endonuclease, 
minizyme, leadzyme, oligozyme or DNA enzyme. All of these terminologies describe 
nucleic acid molecules with enzymatic activity. The specific enzymatic nucleic acid 
molecules described in the instant application are not meant to be limiting and those 
skilled in the art will recognize that all that is important in an enzymatic nucleic acid 

25 molecule of this invention is that it have a specific substrate binding site which is 
complementary to one or more of the target nucleic acid regions, and that it have 
nucleotide sequences within or surrounding that substrate binding site which impart a 
nucleic acid cleaving activity to the molecule (Cech et al., U.S. Patent No. 4,987,071 ; 
Cech et al., 1988, JAMA 260:20 3030-4). 

30 By "nucleic acid molecule" as used herein is meant a molecule having nucleotides. 

The nucleic acid can be single, double, or multiple stranded and may comprise modified or 
unmodified nucleotides or non-nucleotides or various mixtures and combinations thereof. 
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An example of a nucleic acid molecule according to the invention is a gene which encodes 
for a macromolecule such as a protein. 

By "enzymatic portion" or "catalytic domain" is meant that portion/region of the 
enzymatic nucleic acid molecule essential for cleavage of a nucleic acid substrate (for 
5 example see Figures 1-5). 

By "substrate binding arm" or "substrate binding domain" is meant that 
portion/region of a ribozyme which is complementary to (i.e., able to base-pair with) a 
portion of its substrate. Generally, such complementarity is 100%, but can be less if 
desired. For example, as few as 10 bases out of 14 may be base-paired. Such arms are 

1 0 shown generally in Figures 1-5. That is, these arms contain sequences within a ribozyme 
which are intended to bring ribozyme and target RNA together through complementary 
base-pairing interactions. The ribozyme of the invention may have binding arms that are 
contiguous or non-contiguous and may be of varying lengths. The length of the binding 
arm(s) are preferably greater than or equal to four nucleotides and of sufficient length to 

1 5 stably interact with the target RNA; specifically 1 2- 1 00 nucleotides; more specifically 1 4- 
24 nucleotides long. If two binding arms are chosen, the design is such that the length of 
the binding arms are symmetrical (i.e., each of the binding arms is of the same length; e.g., 
five and five nucleotides, six and six nucleotides or seven and seven nucleotides long) or 
asymmetrical (i.e., the binding arms are of different length; e.g., six and three nucleotides; 

20 three and six nucleotides long; four and five nucleotides long; four and six nucleotides 

long; four and seven nucleotides long; and the like). Binding arms can be complementary 
to the specified substrate, to a portion of the indicated substrate, to the indicated substrate 
sequence and additional adjacent sequence, or a portion of the indicated sequence and 
additional adjacent sequence. 

25 By "NCH" or "Inozyme" motif is meant, an enzymatic nucleic acid molecule 

comprising a motif as described in Ludwig et al, USSN No. 09/406,643, filed September 
27, 1999, entitled "COMPOSITIONS HAVING RNA CLEAVING ACTIVITY", and 
International PCT publication Nos. WO 98/58058 and WO 98/58057, all incorporated by 
reference herein in their entirety, including the drawings. 

30 By "G-cleaver" motif is meant, an enzymatic nucleic acid molecule comprising a 

motif as described in Eckstein et al, International PCT publication No. WO 99/16871, 
incorporated by reference herein in its entirety, including the drawings. 
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By "zinzyme" motif is meant, a class II enzymatic nucleic acid molecule comprising 
a motif as described herein and in Beigelman et al, International PCT publication No. WO 
99/55857, incorporated by reference herein in its entirety, including the drawings. 

By "amberzyme" motif is meant, a class I enzymatic nucleic acid molecule 
5 comprising a motif as described herein and in Beigelman et al, International PCT 

publication No. WO 99/55857, incorporated by reference herein in its entirety, including 
the drawings. 

By 'DNAzyme' is meant, an enzymatic nucleic acid molecule lacking a 
ribonucleotide (2'-OH) group. In particular embodiments, the enzymatic nucleic acid 
1 0 molecule may have an attached linker(s) or other attached or associated groups, moieties, 
or chains containing one or more nucleotides with 2'-OH groups. A DNAzyme can be 
synthesized chemically or can be expressed by means of a single stranded DNA vector or 
equivalent thereof. 

By "sufficient length" is meant an oligonucleotide of greater than or equal to 3 

1 5 nucleotides that is of a length great enough to provide the intended function under the 
expected condition. For example, for binding arms of enzymatic nucleic acid "sufficient 
length" means that the binding arm sequence is long enough to provide stable binding to a 
target site under the expected binding conditions. Preferably, the binding arms are not so 
long as to prevent useful turnover. 

20 By "stably interact" is meant, interaction of the oligonucleotides with target nucleic 

acid (e.g., by forming hydrogen bonds with complementary nucleotides in the target under 
physiological conditions). 

By "equivalent" RNA to PTP-1B, MetAP-2, BACE, ps-1, ps-2, HER2, PLN, TERT, 
and/or HBV is meant to include those naturally occurring RNA molecules having 

25 homology (partial or complete) to PTP-1B, MetAP-2, BACE, ps-1 , ps-2, HER2, PLN, 
TERT, and/or HBV proteins or encoding for proteins with similar function as PTP-1B, 
MetAP-2, BACE, ps-1, ps-2, HER2, PLN, TERT, and/or HBV in various organisms, 
including human, rodent, primate, rabbit, pig, protozoans, fungi, plants, and other 
microorganisms and parasites. The equivalent RNA sequence also includes in addition to 

30 the coding region, regions such as 5 '-untranslated region, 3 '-untranslated region, introns, 
intron-exon junction and the like in HBV. 
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By "homology" is meant the nucleotide sequence of two or more nucleic acid 
molecules is partially or completely identical. 

By "antisense nucleic acid", it is meant a non-enzymatic nucleic acid molecule that 
binds to target RNA by means of RNA-RNA or RNA-DNA or RNA-PNA (protein nucleic 
5 acid; Egholm et al, 1993 Nature 365, 566) interactions and alters the activity of the target 
RNA (for a review, see Stein and Cheng, 1993 Science 261, 1004 and Woolf et al, US 
patent No. 5,849,902). Typically, antisense molecules will be complementary to a target 
sequence along a single contiguous sequence of the antisense molecule. However, in 
certain embodiments, an antisense molecule may bind to substrate such that the substrate 

1 0 molecule forms a loop, and/or an antisense molecule may bind such that the antisense 
molecule forms a loop. Thus, the antisense molecule may be complementary to two (or 
even more) non-contiguous substrate sequences or two (or even more) non-contiguous 
sequence portions of an antisense molecule maybe complementary to a target sequence or 
both. For a review of current antisense strategies, see Schmajuk et al, 1999, J. Biol. 

1 5 Chem., 274, 21783-21789, Delihas et al, 1997, Nature, 15, 751-753, Stein et al, 1997, 
Antisense N. A. Drug Dev., 7, 151, Crooke, 1998, Biotech. Genet. Eng. Rev., 15, 121-157, 
Crooke, 1997, Ad. Pharmacol, 40, 1-49. In addition, antisense DNA can be used to target 
RNA by means of DNA-RNA interactions, thereby activating RNase H, which digests the 
target RNA in the duplex. Antisense DNA can be synthesized chemically or can be 

20 expressed via the use of a single stranded DNA expression vector or the equivalent 
thereof. 

By "2-5A antisense chimera" it is meant, an antisense oligonucleotide containing a 
5'-phosphorylated 2'-5'-linked adenylate residue. These chimeras bind to target RNA in a 
sequence-specific manner and activate a cellular 2-5A-dependent ribonuclease which, in 
25 turn, cleaves the target RNA (Torrence et al, 1993 Proc. Natl. Acad. Sci. USA 90, 1300). 

By "triplex DNA" it is meant an oligonucleotide that can bind to a double-stranded 
DNA in a sequence-specific manner to form a triple-strand helix. Formation of such triple 
helix structure has been shown to inhibit transcription of the targeted gene (Duval- 
Valentin et al, 1992, Proc. Natl Acad. Sci.USA, 89, 504). 
30 By "gene" it is meant a nucleic acid that encodes a RNA. 
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By "complementarity" is meant that a nucleic acid can form hydrogen bond(s) with 
another RNA sequence by either traditional Watson-Crick or other non-traditional types. 
In reference to the nucleic molecules of the present invention, the binding free energy for a 
nucleic acid molecule with its target or complementary sequence is sufficient to allow the 
5 relevant function of the nucleic acid to proceed, e.g., ribozyme cleavage, antisense or triple 
helix inhibition. Determination of binding free energies for nucleic acid molecules is well 
known in the art (see, e.g., Turner et al., 1987, CSHSymp. Quant. Biol. UL pp.123-133; 
Frier et al., 1986, Proc. Nat. Acad. Sci. USA 83:9373-9377; Turner et al., 1987, J. Am. 
Chem. Soc. 109:3783-3785). A percent complementarity indicates the percentage of 

1 0 contiguous residues in a nucleic acid molecule which can form hydrogen bonds (e.g., 

Watson-Crick base pairing) with a second nucleic acid sequence (e.g., 5, 6, 7, 8, 9, 10 out 
of 10 being 50%, 60%, 70%, 80%, 90%, and 100% complementary). "Perfectly 
complementary" means that all the contiguous residues of a nucleic acid sequence will 
hydrogen bond with the same number of contiguous residues in a second nucleic acid 

1 5 sequence. 

At least seven basic varieties of naturally-occurring enzymatic RNAs are known 
presently. Each can catalyze the hydrolysis of RNA phosphodiester bonds in trans (and 
thus can cleave other RNA molecules) under physiological conditions. Table I 
summarizes some of the characteristics of these ribozymes. In general, enzymatic nucleic 

20 acids act by first binding to a target RNA. Such binding occurs through the target binding 
portion of a enzymatic nucleic acid which is held in close proximity to an enzymatic 
portion of the molecule that acts to cleave the target RNA. Thus, the enzymatic nucleic 
acid first recognizes and then binds a target RNA through complementary base-pairing, 
and once bound to the correct site, acts enzymatically to cut the target RNA. Strategic 

25 cleavage of such a target RNA will destroy its ability to direct synthesis of an encoded 
protein. After an enzymatic nucleic acid has bound and cleaved its RNA target, it is 
released from that RNA to search for another target and can repeatedly bind and cleave 
new targets. Thus, a single ribozyme molecule is able to cleave many molecules of target 
RNA. In addition, the ribozyme is a highly specific inhibitor of gene expression, with the 

30 specificity of inhibition depending not only on the base-pairing mechanism of binding to 
the target RNA, but also on the mechanism of target RNA cleavage. Single mismatches, 
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or base-substitutions, near the site of cleavage can completely eliminate catalytic activity 
of aribozyme. 

The enzymatic nucleic acid molecule that cleave the specified sites in PTP-1B, 
MetAP-2, BACE, ps-1, ps-2, HER2, PLN, TERT, and/or HBV-specific RNAs represent a 
5 novel therapeutic approach to treat a variety of pathologic indications, including, HBV 
infection, hepatitis, hepatocellular carcinoma, tumorigenesis, cirrhosis, liver failure, 
cancers including breast, ovarian, prostate, and esophogeal cancer, tumorigenesis, 
retinopathy, arthritis, psoriasis, female reproduction, restinosis, certain infectious diseases, 
transplant rejection and autoimmune disease such as multiple sclerosis, lupus, and AIDS, 

1 0 age related diseases such as macular degeneration and skin ulceration, Alzheimer's 

disease, dementia, diabetes, obesity and any other condition related to the level of PTP-1B, 
MetAP-2, BACE, ps-1, ps-2, HER2, PLN, TERT, and/or HBV in a cell or tissue. 

In one of the preferred embodiments of the inventions described herein, the 
enzymatic nucleic acid molecule is formed in a hammerhead or hairpin motif, but may also 

1 5 be formed in the motif of a hepatitis delta virus, group I intron, group II intron or RNase P 
RNA (in association with an RNA guide sequence), Neurospora VS RNA, DNAzymes, 
NCH cleaving motifs, or G-cleavers. Examples of such hammerhead motifs are described 
by Dreyfus, supra, Rossi et al, 1992, AIDS Research and Human Retroviruses 8, 183. 
Examples of hairpin motifs are described by Hampel et al, EP0360257, Hampel and Tritz, 

20 1989 Biochemistry 28, 4929, Feldstein et al, 1989, Gene 82, 53, Haseloff and Gerlach, 
1989, Gene, 82, 43, Hampel et al, 1990 Nucleic Acids Res. 18, 299; and Chowrira & 
McSwiggen, US. Patent No. 5,631,359. The hepatitis delta virus motif is described by 
Perrotta and Been, 1992 Biochemistry 31,16. The RNase P motif is described by 
Guerrier-Takada et al, 1983 Cell 35, 849; Forster and Airman, 1990, Science 249, 783; 

25 and Li and Altaian, 1 996, Nucleic Acids Res. 24, 835. The Neurospora VS RNA 
ribozyme motif is described by Collins (Saville and Collins, 1990 Cell 61, 685-696; 
Saville and Collins, 1991 Proc. Natl. Acad. Sci. USA 88, 8826-8830; Collins and Olive, 
1993 Biochemistry 32, 2795-2799; and Guo and Collins, 1995, EMBO. J. 14, 363). Group 
H introns are described by Griffin et al, 1995, Chem. Biol. 2, 761; Michels and Pyle, 1995, 

30 Biochemistry 34, 2965 ; and Pyle et al, International PCT Publication No. WO 96/22689. 
The Group I intron is described by Cech et al, U.S. Patent 4,987,071. DNAzymes are 
described by Usman et al, International PCT Publication No. WO 95/1 1304; Chartrand et 
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al, 1995, NAR 23, 4092; Breaker et al., 1995, Chem. Bio. 2, 655; and Santoro et al, 1997, 
PNAS 94, 4262. NCH cleaving motifs are described in Ludwig & Sproat, International 
PCT Publication No. WO 98/58058; and G-cleavers are described in Kore et al, 1998, 
Nucleic Acids Research 26, 41 16-4120 and Eckstein et al, International PCT Publication 
5 No. WO 99/1 6871 . Additional motifs include the Aptazyme (Breaker et al, WO 
98/43993), Amberzyme (Class I motif; Figure 3; Beigelman et al, International PCT 
publication No. WO 99/55857) and Zinzyme (Beigelman et al, International PCT 
publication No. WO 99/55857), all these references are incorporated by reference herein in 
their totalities, including drawings and can also be used in the present invention. These 

1 0 specific motifs are not limiting in the invention and those skilled in the art will recognize 
that all that is important in an enzymatic nucleic acid molecule of this invention is that it 
has a specific substrate binding site which is complementary to one or more of the target 
gene RNA regions, and that it have nucleotide sequences within or surrounding that 
substrate binding site which impart an RNA cleaving activity to the molecule (Cech et al, 

1 5 U.S. Patent No. 4,987,071). 

In preferred embodiments of the present invention, a nucleic acid molecule, e.g., an 
anti sense molecule, a triplex DNA, or a ribozyme, is 13 to 100 nucleotides in length, e.g., 
in specific embodiments 35, 36, 37, or 38 nucleotides in length (e.g., for particular 
ribozymes or antisense). In particular embodiments, the nucleic acid molecule is 15-100, 

20 17-100, 20-100, 21-100, 23-100, 25-100, 27-100, 30-100, 32-100, 35-100, 40-100, 50-100, 
60-100, 70-100, or 80-100 nucleotides in length. Instead of 100 nucleotides being the 
upper limit on the length ranges specified above, the upper limit of the length range can be, 
for example, 30, 40, 50, 60, 70, or 80 nucleotides. Thus, for any of the length ranges, the 
length range for particular embodiments has lower limit as specified, with an upper limit 

25 as specified which is greater than the lower limit. For example, in a particular 

embodiment, the length range can be 35-50 nucleotides in length. All such ranges are 
expressly included. Also in particular embodiments, a nucleic acid molecule can have a 
length which is any of the lengths specified above, for example, 21 nucleotides in length. 
In a preferred embodiment, the invention provides a method for producing a class of 

30 nucleic acid-based gene inhibiting agents which exhibit a high degree of specificity for the 
RNA of a desired target. For example, the enzymatic nucleic acid molecule is preferably 
targeted to a highly conserved sequence region of target RNAs encoding PTP-1B, MetAP- 
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2, BACE, ps-1, ps-2, HER2, PLN, TERT, and/or HBV proteins (specifically PTP-1B, 
MetAP-2, BACE, ps-1, ps-2, HER2, PLN, TERT, and/or HBV RNA) such that specific 
treatment of a disease or condition can be provided with either one or several nucleic acid 
molecules of the invention. Such nucleic acid molecules can be delivered exogenously to 
5 specific tissue or cellular targets as required. Alternatively, the nucleic acid molecules 
(e.g., ribozymes and antisense) can be expressed from DNA and/or RNA vectors that are 
delivered to specific cells. 

As used in herein "cell" is used in its usual biological sense, and does not refer to an 
entire multicellular organism, e.g., specifically does not refer to a human. The cell may be 

1 0 present in an organism which may be a human but is preferably a non-human multicellular 
organism, e.g., birds, plants and mammals such as cows, sheep, apes, monkeys, swine, 
dogs, and cats. The cell may be prokaryotic (e.g., bacterial cell) or eukaryotic (e.g., 
mammalian or plant cell). 

By "PTP-1B, MetAP-2, BACE, ps-1, ps-2, HER2, PLN, TERT, and/or HBV 

1 5 proteins" is meant, a protein or a mutant protein derivative thereof, comprising sequence 
expressed and/or encoded by PTP-1B, MetAP-2, BACE, ps-1, ps-2, HER2, PLN, TERT, 
genes and/or the HBV genome respectively. 

By "highly conserved sequence region" is meant a nucleotide sequence of one or 
more regions in a target gene does not vary significantly from one generation to the other 

20 or from one biological system to the other. 

The enzymatic nucleic acid-based inhibitors of PTP-1B, MetAP-2, BACE, ps-1, ps- 
2, HER2, PLN, TERT, and/or HBV expression are useful for the prevention of the diseases 
and conditions including HBV infection, hepatitis, hepatocellular carcinoma, 
tumorigenesis, cirrhosis, liver failure, cancers including breast, ovarian, prostate, and 

25 esophogeal cancer, tumorigenesis, retinopathy, arthritis, psoriasis, female reproduction, 
restinosis, certain infectious diseases, transplant rejection and autoimmune disease such as 
multiple sclerosis, lupus, and AIDS, age related diseases such as macular degeneration and 
skin ulceration, Alzheimer's disease, dementia, diabetes, obesity and any other condition 
related to the level of PTP-1B, MetAP-2, BACE, ps-1, ps-2, HER2, PLN, TERT, and/or 

30 HBV in a cell or tissue, and any other diseases or conditions that are related to the levels 
of PTP-1B, MetAP-2, BACE, ps-1, ps-2, HER2, PLN, TERT, and/or HBV in a cell or 
tissue. 
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By "related" is meant that the reduction of PTP-1B, MetAP-2, BACE, ps-1, ps-2, 
HER2, PLN, TERT, and/or HBV expression (specifically PTP-1B, MetAP-2, BACE, ps-1, 
ps-2, HER2, PLN, TERT, and/or HBV genes) RNA levels and thus reduction in the level 
of the respective protein will relieve, to some extent, the symptoms of the disease or 
5 condition. 

The nucleic acid-based inhibitors of the invention are added directly, or can be 
complexed with cationic lipids, packaged within liposomes, or otherwise delivered to 
target cells or tissues. The nucleic acid or nucleic acid complexes can be locally 
administered to relevant tissues ex vivo, or in vivo through injection, infusion pump or 

1 0 stent, with or without their incorporation in biopolymers. In preferred embodiments, the 
enzymatic nucleic acid inhibitors comprise sequences, which are complementary to the 
substrate sequences in Tables 3-31, 33, 34, 36-43, 56, 58, 59, 62, 63. Examples of such 
enzymatic nucleic acid molecules also are shown in Tables 3-29, 31, 33, 34, 37-43, 56, 58, 
59, 62, 63. Examples of such enzymatic nucleic acid molecules consist essentially of 

1 5 sequences defined in these tables. 

In yet another embodiment, the invention features antisense nucleic acid molecules 
including sequences complementary to the substrate sequences shown in Tables 3-31, 33, 
34, 36, 37-43, 56, 58, 59, 62, 63. Such nucleic acid molecules can include sequences as 
shown for the binding arms of the enzymatic nucleic acid molecules in Tables 3-29, 31, 

20 33, 34, 37-43, 56, 58, 59, 62, 63. Similarly, triplex molecules can be provided targeted to 
the corresponding DNA target regions, and containing the DNA equivalent of a target 
sequence or a sequence complementary to the specified target (substrate) sequence. 
Typically, antisense molecules will be complementary to a target sequence along a single 
contiguous sequence of the antisense molecule. However, in certain embodiments, an 

25 antisense molecule may bind to substrate such that the substrate molecule forms a loop, 
and/or an antisense molecule may bind such that the antisense molecule forms a loop. 
Thus, the antisense molecule may be complementary to two (or even more) non- 
contiguous substrate sequences or two (or even more) non-contiguous sequence portions 
of an antisense molecule may be complementary to a target sequence or both. 

30 In another aspect, the invention provides mammalian cells containing one or more 

nucleic acid molecules and/or expression vectors of this invention. The one or more 
nucleic acid molecules may independently be targeted to the same or different sites. 
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By "consists essentially of is meant that the active nucleic acid molecule of the 
invention, for example, an enzymatic nucleic acid molecule, contains an enzymatic center 
or core equivalent to those in the examples, and binding arms able to bind mRNA such 
that cleavage at the target site occurs. Other sequences may be present which do not 
5 interfere with such cleavage. Thus, a core region may, for example, include one or more 
loop or stem-loop structures, which do not prevent enzymatic activity. "X" in the 
sequences in Tables 3, 4, 9, 10, 13, 14, 18, 19, 24, 25, 33, 34, 37, 38, 63 can be such a 
loop. A core sequence for a hammerhead ribozyme can be CUGAUGAG X CGAA where 
X=GCCGUUAGGC or other stem II region as specifically or generally known in the art. 

10 In another aspect of the invention, ribozymes or antisense molecules that interact 

with target RNA molecules and inhibit PTP-1B, MetAP-2, BACE, ps-1, ps-2, HER2, PLN, 
TERT, and/or HBV (specifically PTP- IB, MetAP-2, BACE, ps-1, ps-2, HER2, PLN, 
TERT, and/or HBV RNA) activity are expressed from transcription units inserted into 
DNA or RNA vectors. The recombinant vectors are preferably DNA plasmids or viral 

1 5 vectors. Ribozyme or antisense expressing viral vectors could be constructed based on, 
but not limited to, adeno-associated virus, retrovirus, adenovirus, or alphavirus. 
Preferably, the recombinant vectors capable of expressing the ribozymes or antisense are 
delivered as described above, and persist in target cells. Alternatively, viral vectors may 
be used that provide for transient expression of ribozymes or antisense. Such vectors 

20 might be repeatedly administered as necessary. Once expressed, the ribozymes or 
antisense bind to the target RNA and inhibit its function or expression. Delivery of 
ribozyme or antisense expressing vectors could be systemic, such as by intravenous or 
intramuscular administration, by administration to target cells ex -planted from the patient 
followed by reintroduction into the patient, or by any other means that would allow for 

25 introduction into the desired target cell. Antisense DNA can be expressed via the use of a 
single stranded DNA intracellular expression vector. 

By RNA is meant a molecule comprising at least one ribonucleotide residue. By 
"ribonucleotide" is meant a nucleotide with a hydroxyl group at the 2' position of a 0-D- 
ribo-furanose moiety. 

30 By "vectors" is meant any nucleic acid- and/or viral-based technique used to deliver 

a desired nucleic acid. 
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By "patient" is meant an organism, which is a donor or recipient of explanted cells or 
the cells themselves. "Patient" also refers to an organism to which the nucleic acid 
molecules of the invention can be administered. Preferably, a patient is a mammal or 
mammalian cells. More preferably, a patient is a human or human cells. 
5 The nucleic acid molecules of the instant invention, individually, or in combination 

or in conjunction with other drugs, can be used to treat diseases or conditions discussed 
above. For example, to treat a disease or condition associated with PTP-1B, MetAP-2, 
BACE, ps-1, ps-2, HER2, PLN, TERT, and/or HBV, the patient may be treated, or other 
appropriate cells maybe treated, as is evident to those skilled in the art, individually or in 

1 0 combination with one or more drugs under conditions suitable for the treatment. 

In a further embodiment, the described molecules, such as anti sense or ribozymes, 
can be used in combination with other known treatments to treat conditions or diseases 
discussed above. For example, the described molecules could be used in combination with 
one or more known therapeutic agents to treat HBV infection, hepatitis, hepatocellular 

1 5 carcinoma, tumorigenesis, cirrhosis, liver failure, cancers including breast, ovarian, 
prostate, and esophogeal cancer, tumorigenesis, retinopathy, arthritis, psoriasis, female 
reproduction, restinosis, certain infectious diseases, transplant rejection and autoimmune 
disease such as multiple sclerosis, lupus, and AIDS, age related diseases such as macular 
degeneration and skin ulceration, Alzheimer's disease, dementia, diabetes, and/or obesity. 

20 In another preferred embodiment, the invention features nucleic acid-based 

inhibitors (e.g., enzymatic nucleic acid molecules (ribozymes), antisense nucleic acids, 
triplex DNA, antisense nucleic acids containing RNA cleaving chemical groups) and 
methods for their use to down regulate or inhibit the expression of RNA (e.g., PTP-1B, 
MetAP-2, BACE, ps-1, ps-2, HER2, PLN, TERT, and/or HBV) capable of progression 

25 and/or maintenance of HBV infection, hepatitis, hepatocellular carcinoma, tumorigenesis, 
cirrhosis, liver failure, cancers including breast, ovarian, prostate, and esophogeal cancer, 
tumorigenesis, retinopathy, arthritis, psoriasis, female reproduction, restinosis, certain 
infectious diseases, transplant rejection and autoimmune disease such as multiple sclerosis, 
lupus, and AIDS, age related diseases such as macular degeneration and skin ulceration, 

30 Alzheimer's disease, dementia, diabetes, and/or obesity. 
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In another preferred embodiment, the invention features nucleic acid-based 
techniques (e.g., enzymatic nucleic acid molecules (ribozymes), antisense nucleic acids, 
triplex DNA, antisense nucleic acids containing RNA cleaving chemical groups) and 
methods for their use to down regulate or inhibit the expression of PTP-1B, MetAP-2, 
BACE, ps-1, ps-2, HER2, PLN, TERT, and/or HBV RNA expression. 

By "comprising" is meant including, but not limited to, whatever follows the word 
"comprising". Thus, use of the term "comprising" indicates that the listed elements are 
required or mandatory, but that other elements are optional and may or may not be present. 
By "consisting of is meant including, and limited to, whatever follows the phrase 
"consisting of. Thus, the phrase "consisting of indicates that the listed elements are 
required or mandatory, and that no other elements may be present. By "consisting 
essentially of is meant including any elements listed after the phrase, and limited to other 
elements that do not interfere with or contribute to the activity or action specified in the 
disclosure for the listed elements. Thus, the phrase "consisting essentially of indicates 
that the listed elements are required or mandatory, but that other elements are optional and 
may or may not be present depending upon whether or not they affect the activity or action 
of the listed elements. 

Other features and advantages of the invention will be apparent from the following 
description of the preferred embodiments thereof, and from the claims. 

Description Of The Preferred Embodiments 

The drawings will first briefly be described. 



Drawings: 

25 Figure 1 shows the secondary structure model for seven different classes of 

enzymatic nucleic acid molecules. Arrow indicates the site of cleavage. indicate 

the target sequence. Lines interspersed with dots are meant to indicate tertiary interactions. 
- is meant to indicate base-paired interaction. Group I Intron: P1-P9.0 represent various 
stem-loop structures (Cech et al, 1994, Nature Struc. Bio., 1, 273). RNase P (Ml RNA): 

30 EGS represents external guide sequence (Forster et al, 1 990, Science, 249, 783 ; Pace et 
al, 1990, J. Biol Chem., 265, 3587). Group II Intron: 5'SS means 5' splice site; 3'SS 
means 3'-splice site; IBS means intron binding site; EBS means exon binding site (Pyle et 



WO 01/16312 PCT/US00/23998 
15 

al, 1994, Biochemistry, 33, 2716). VS RNA: I-VI are meant to indicate six stem-loop 
structures; shaded regions are meant to indicate tertiary interaction (Collins, International 
PCT Publication No. WO 96/19577). HDV Ribozyme: I-IV are meant to indicate four 
stem-loop structures (Been et al, US Patent No. 5,625,047). Hammerhead Ribozyme: I- 
5 HI are meant to indicate three stem-loop structures; stems I-m can be of any length and 
maybe symmetrical or asymmetrical (Usman et al, 1996, Curr. Op. Struct, Bio., 1, 527). 
Hairpin Ribozyme: Helix 1, 4 and 5 can be of any length; Helix 2 is between 3 and 8 
base-pairs long; Y is a pyrimidine; Helix 2 (H2) is provided with a least 4 base pairs (i.e., 
n is 1, 2, 3 or 4) and helix 5 can be optionally provided of length 2 or more bases 

1 0 (preferably 3 - 20 bases, i.e., m is from 1 - 20 or more). Helix 2 and helix 5 may be 
covalently linked by one or more bases (i.e., r is > 1 base). Helix 1, 4 or 5 may also be 
extended by 2 or more base pairs (e.g., 4-20 base pairs) to stabilize the ribozyme 
structure, and preferably is a protein binding site. In each instance, each N and N' 
independently is any normal or modified base and each dash represents a potential base- 

1 5 pairing interaction. These nucleotides may be modified at the sugar, base or phosphate. 

Complete base-pairing is not required in the helices, but is preferred. Helix 1 and 4 can be 
of any size (i.e., o and p is each independently from 0 to any number, e.g., 20) as long as 
some base-pairing is maintained. Essential bases are shown as specific bases in the 
structure, but those in the art will recognize that one or more may be modified chemically 

20 (abasic, base, sugar and/or phosphate modifications) or replaced with another base without 
significant effect. Helix 4 can be formed from two separate molecules, i.e., without a 
connecting loop. The connecting loop when present may be a ribonucleotide with or 
without modifications to its base, sugar or phosphate, "q" > is 2 bases. The connecting 
loop can also be replaced with a non-nucleotide linker molecule. H refers to bases A, U, 

25 or C. Y refers to pyrimidine bases. " " refers to a covalent bond. (Burke et al, 1996, 

Nucleic Acids & Mol. Biol, 10, 129; Chowrira et al, US Patent No. 5,631,359). 

Figure 2 shows examples of chemically stabilized ribozyme motifs. HH Rz, 
represents hammerhead ribozyme motif (Usman et al, 1996, Curr. Op. Struct. Bio., 1, 
527); NCH Rz represents the NCH ribozyme motif (described herein and in Ludwig & 

30 Sproat, International PCT Publication No. WO 98/58058); G-Cleaver, represents G- 

cleaver ribozyme motif (Kore et al., 1998, Nucleic Acids Research, 26, 41 16-4120). N or 
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n, represent independently a nucleotide which may be same or different and have 
complementarity to each other; rl, represents ribo-Inosine nucleotide; arrow indicates the 
site of cleavage within the target. Position 4 of the HH Rz and the NCH Rz is shown as 
having 2'-C-allyl modification, but those skilled in the art will recognize that this position 
5 can be modified with other modifications well known in the art, so long as such 
modifications do not significantly inhibit the activity of the ribozyme. 

Figure 3 shows an example of the Amberzyme ribozyme motif that is chemically 
stabilized (see, for example, Beigelman et al, International PCT publication No. WO 
99/55857; also referred to as Class I Motif). The Amberzyme motif is a class of enzymatic 
1 0 nucleic acid molecules that do not require the presence of a ribonucleotide (2'-OH) group 
for activity. 

Figure 4 shows an example of the Zinzyme A ribozyme motif that is chemically 
stabilized (see, for example, International PCT publication No. WO 99/55857; also 
referred to as Class A Motif). The Zinzyme motif is a class of enzymatic nucleic acid 
1 5 molecules that do not require the presence of a ribonucleotide (2'-OH) group for activity. 

Figure 5 shows an example of a DNAzyme motif described by Santoro et al., 1997, 
PNAS, 94, 4262. 

Figure 6 is a diagrammatic representation of the hammerhead ribozyme motif 
known in the art and the NCH motif. Stem n can be 2 base-pair long, preferably, 2, 3, 

20 4, 5, 6, 7, 8, and 10 base-pairs long. Each N and N' is independently any base or non- 
nucleotide as used herein; X is adenosine, cytidine or uridine; Stem I-DI are meant to 
indicate three stem- loop structures; stems I-HI can be of any length and may be 
symmetrical or asymmetrical (Usman et al, 1996, Curr. Op. Struct. Bio., 1, 527); arrow 
indicates the site of cleavage in the target RNA; Rz refers to ribozyme; Loop II may be 

25 present or absent. If Loop II is present it is greater than or equal to three nucleotides, 
preferably four nucleotides. The Loop II sequence is preferably 5' -GAAA-3' or 5'- 
GUUA-3'. 

Figure 7 shows examples of chemically stabilized ribozyme motifs. HH Rz, 
represents hammerhead ribozyme motif (Usman et al, 1996, Curr. Op. Struct. Bio., 1, 
30 527); NCH-Inosine Rz represents the NCH ribozyme motif with riboinosine at 15.1 

position; NCH-Xylo Rz represents the NCH ribozyme with xylo inosine at 15.1 position. 
N or n, represent independently a nucleotide which may be same or different and may have 
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complementarity to each other; rl, represents ribo-Inosine nucleotide; xl represent xylo- 
inosine; arrow indicates the site of cleavage within the target. Position 4 of the HH Rz and 
the NCH Rzs is shown as having 2' -C-allyl modification, but those skilled in the art will 
recognize that this position can be modified with other modifications well known in the 
5 art, so long as such modifications do not significantly inhibit the activity of the ribozyme. 

Figure 8 is a graphical representation of data showing inhibition of cell proliferation 
mediated by NCH and HH ribozymes targeted against HER2/new/ErbB2 gene. Untreated, 
refers to cells not treated with ribozymes; HH RZ refers to hammerhead ribozyme; NCX 
RZ refers to the NCH ribozymes of the invention; IA refers to catalytically inactive or 
1 0 attenuated ribozyme used as a control . 

Figure 9 is a schematic diagram of the process for the synthesis of beta-D- 
xylofuranosyl hypoxantine 3'-phosphoramidite. 

Figure 10 displays a schematic representation of NTP synthesis using nucleoside 
substrates. 

1 5 Figure 1 1 shows a scheme for an in vitro selection method. A pool of nucleic acid 

molecules is generated with a random core region and one or more region(s) with a defined 
sequence. These nucleic acid molecules are bound to a column containing immobilized 
oligonucleotide with a defined sequence, where the defined sequence is complementary to 
region(s) of defined sequence of nucleic acid molecules in the pool. Those nucleic acid 

20 molecules capable of cleaving the immobilized oligonucleotide (target) in the column are 
isolated and converted to complementary DNA (cDNA), followed by transcription using 
NTPs to form a new nucleic acid pool. 

Figure 12 shows a scheme for a two column in vitro selection method. A pool of 
nucleic acid molecules is generated with a random core and two flanking regions (region A 

25 and region B) with defined sequences. The pool is passed through a column which has 

immobilized oligonucleotides with regions A' and B' that are complementary to regions A 
and B of the nucleic acid molecules in the pool, respectively. The column is subjected to 
conditions sufficient to facilitate cleavage of the immobilized oligonucleotide target. The 
molecules in the pool that cleave the target (active molecules) have A' region of the target 

30 bound to their A region, whereas the B region is free. The column is washed to isolate the 
active molecules with the bound A' region of the target. This pool of active molecules 
may also contain some molecules that are not active to cleave the target (inactive 
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molecules) but have dissociated from the column. To separate the contaminating inactive 
molecules from the active molecules, the pool is passed through a second column (column 
2) which contains immobilized oligonucleotides with the A' sequence but not the B' 
sequence. The inactive molecules will bind to column 2 but the active molecules will not 
5 bind to column 2 because their A region is occupied by the A' region of the target 

oligonucleotide from column 1. Column 2 is washed to isolate the active molecules for 
further processing as described in the scheme shown in Figure 11. 

Figure 13 is a diagram of a novel 48 nucleotide enzymatic nucleic acid motif which 
was identified using in vitro methods described in the instant invention. The molecule 

1 0 shown is only exemplary. The 5' and 3 ' terminal nucleotides (referring to the nucleotides 
of the substrate binding arms rather than merely the single terminal nucleotide on the 5' 
and 3' ends) can be varied so long as those portions can base-pair with target substrate 
sequence. In addition, the guanosine (G) shown at the cleavage site of the substrate can be 
changed to other nucleotides so long as the change does not eliminate the ability of 

1 5 enzymatic nucleic acid molecules to cleave the target sequence. Substitutions in the 

nucleic acid molecule and/or in the substrate sequence can be readily tested, for example, 
as described herein. 

Figure 14 is a schematic diagram of HCV luciferase assay used to demonstrate 
efficacy of class I enzymatic nucleic acid molecule motif. 
20 Figure 15 is a graph indicating the dose curve of an enzymatic nucleic acid molecule 

targeting site 146 on HCV RNA. 

Figure 16 is a bar graph showing enzymatic nucleic acid molecules targeting 4 sites 
within the HCV RNA are able to reduce RNA levels in cells. 

Figure 17 shows secondary structures and cleavage rates for characterized Class H 
25 enzymatic nucleic acid motifs. 

Figure 18 is a diagram of a novel 35 nucleotide enzymatic nucleic acid motif which 
was identified using in vitro methods described in the instant invention. The molecule 
shown is only exemplary. The 5' and 3' terminal nucleotides (referring to the nucleotides 
of the substrate binding arms rather than merely the single terminal nucleotide on the 5' 
30 and 3' ends) can be varied so long as those portions can base-pair with target substrate 

sequence. In addition, the guanosine (G) shown at the cleavage site of the substrate can be 
changed to other nucleotides so long as the change does not eliminate the ability of 
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enzymatic nucleic acid molecules to cleave the target sequence. Substitutions in the 
nucleic acid molecule and/or in the substrate sequence can be readily tested, for example, 
as described herein. 

Figure 19 is a bar graph showing substrate specificities for Class n (zinzyme) 
5 ribozymes. 

Figure 20 is a bar graph showing Class II enzymatic nucleic acid molecules 
targeting 10 representative sites within the HER2 RNA in a cellular proliferation screen. 

Figure 21 is a synthetic scheme outlining the synthesis of 5-[3- 
aminopropynyl(propyl)]uridine 5 '-triphosphates and 4-imidazoleaceticacid conjugates. 
1 0 Figure 22 is a synthetic scheme outlining the synthesis of 5-[3-(N-4- 

imidazoleacetyl)aminopropynyl(propyl)]uridine 5 '-triphosphates. 

Figure 23 is a synthetic scheme outlining the synthesis of carboxylate tethered 
uridine 5'-triphosphoates. 

Figure 24 is a synthetic scheme outlining the synthesis of 5-(3-aminoalkyl) and 5- 
1 5 [3(N-succinyl)aminopropyl] functionalized cytidines. 

Figure 25 is a diagram of a class I ribozyme stem truncation and loop replacement 
analysis. 

Figure 26 is a diagram of class I ribozymes with truncated stem(s) and/or non- 
nucleotide linkers used in loop structures. 
20 Figure 2 7 is a diagram of "no-ribo" class II ribozymes . 

Figure 28 is a graph showing cleavage reactions with class n ribozymes under 
differing divalent metal concentrations. 

Figure 29 is a diagram of differing class II ribozymes with varying ribo content and 
their relative rates of catalysis. 
25 Figure 30 is a graph showing class II ribozyme (zinzyme) mediated reduction of 

HER2 RNA in SKBR3 breast carcinoma cells. Cells were treated with 100 nm, and 200 
nm of zinzyme (RPI 18656) targeting site 972 of HER2 RNA and a corresponding 
scrambled attenuated control complexed with 2.5 ug/ml of lipid. Active zinzymes and 
scrambled attenuated controls were compared to untreated cells after 24 hours post 
30 treatment. 
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Figure 31 is a graph showing class II ribozyme (zinzyme) mediated dose response 
anti-prolferation assay in SKBR3 breast carcinoma cells. Cells were treated with 100 nm, 
and 200 nm of zinzyme (RPI 18656) targeting site 972 of HER2 RNA and a corresponding 
scrambled attenuated control complexed with 2.0 pg/ml of lipid. Active zinzymes and 
5 scrambled attenuated controls were compared to untreated cells after 24 hours post 
treatment. 

Figure 32 is a graph which shows the dose dependent reduction of HER2 RNA in 
SKOV-3 cells treated with RPI 19293 from 0 to 100 nM with 5.0 |!g/ml of cationic lipid. 

Figure 33 is a graph which shows the dose dependent reduction of HER2 RNA and 
1 0 inhibition of cellular proliferation in SKBR-3 cells treated with RPI 1 9293 from 0 to 400 
nM with 5.0 ug/ml of cationic lipid. 

Figure 34 shows a non-limiting example of the replacement of a 2'-0-methyl 5'- 
CA-3' with a ribo G in the class II (zinzyme) motif. The representative motif shown for 
the purpose of the figure is a "seven-ribo" zinzyme motif, however, the interchangeability 
15 of a G and a CA in the position shown in Figure 25 of the class II (zinzyme) motif extends 
to any combination of 2-O-methyl and ribo residues. For instance, a 2'-0-methyl G can 
replace the 2'-0-methyl 5'-CA-3' and vise versa. 

Figure 35 is a graph which shows a screen of class II ribozymes (zinzymes) 
targeting site 972 of HER2 RNA which contain ribo-G reductions (RPI 19727 = no ribo, 
20 RPI 19728 = one ribo, RPI 19293 = two ribo, RPI 19729 = three ribo, RPI 19730 = four 
ribo, 19731 = five ribo, and RPI 19292 = seven ribo) for anti-proliferative activity in 
SKBR3 cells. 

Figure 36 summarizes the results of functional group modification studies in which 
various nucleoside analogs were tested for activity in the NCH ribozyme motif. K rel 
25 values describe the cleavage values of a given substituent at position 15.1 relative the 
Inosine at position 1 5 . 1 (I- 1 5 . 1 ). 

Figure 37 summarizes reported functional group modification studies performed at 
the A 15.1 residue in the A-15.1 •U-16.1 context of NUH cleaving ribozymes. K rel values 
describe the cleavage values of a given substituent at position 15.1 relative the adenosine 
30 at position 15.1 (A-15.1). 
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Mechanism of action of Nucleic Acid Molecules of the Invention 

Antisense : Antisense molecules may be modified or unmodified RNA, DNA, or 
mixed polymer oligonucleotides and primarily function by specifically binding to 
matching sequences resulting in inhibition of peptide synthesis (Wu-Pong, Nov 1994, 
5 BioPharm, 20-33). The antisense oligonucleotide binds to target RNA by Watson Crick 
base-pairing and blocks gene expression by preventing ribosomal translation of the bound 
sequences either by steric blocking or by activating RNase H enzyme. Antisense 
molecules may also alter protein synthesis by interfering with RNA processing or transport 
from the nucleus into the cytoplasm (Mukhopadhyay & Roth, 1996, Crit. Rev. in 

1 0 Oncogenesis 7, 1 5 1 - 1 90). 

In addition, binding of single stranded DNA to RNA may result in nuclease 
degradation of the heteroduplex (Wu-Pong, supra; Crooke, supra). To date, the only 
backbone modified DNA chemistry which will act as substrates for RNase H are 
phosphorothioates, phosphorodithioates, and borontrifluoridates. Recently, it has been 

1 5 reported that 2'-arabino and 2'-fluoro arabino- containing oligos can also activate RNase 
H activity. 

A number of antisense molecules have been described that utilize novel 
configurations of chemically modified nucleotides, secondary structure, and/or RNase H 
substrate domains (Woolf et al., International PCT Publication No. WO 98/13526; 

20 Thompson et al, International PCT Publication No. WO 99/54459 ; Hartmann et al, 
International PCT Publication No. WO 00/17346) all of these are incorporated by 
reference herein in their entirety. 

Antisense DNA can be used to target RNA by means of DNA-RNA interactions, 
thereby activating RNase H, which digests the target RNA in the duplex. Antisense DNA 

25 can be chemically synthesized or can be expressed via the use of a single stranded DNA 
intracellular expression vector or the equivalent thereof. 

Triplex Forming Oligonucleotides (TFO ): Single stranded DNA may be designed to 
bind to genomic DNA in a sequence specific manner. TFOs are comprised of pyrimidine- 
rich oligonucleotides which bind DNA helices through Hoogsteen Base-pairing (Wu-Pong, 

30 supra). The resulting triple helix composed of the DNA sense, DNA antisense, and TFO 
disrupts RNA synthesis by RNA polymerase. The TFO mechanism may result in gene 
expression or cell death since binding may be irreversible (Mukhopadhyay & Roth, supra) 
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2 '-5' Qligoadenylates : The 2-5 A system is an inter feron-mediated mechanism for 
RNA degradation found in higher vertebrates (Mitra et al, 1996, Proc Nat Acad Sci USA 
93, 6780-6785). Two types of enzymes, 2-5A synthetase and RNase L, are required for 
RNA cleavage. The 2-5A synthetases require double stranded RNA to form 2'-5' 
5 oligoadenylates (2-5A). 2-5A then acts as an allosteric effector for utilizing RNase L 
which has the ability to cleave single stranded RNA. The ability to form 2-5 A structures 
with double stranded RNA makes this system particularly useful for inhibition of viral 
replication. 

(2'-5') oligoadenylate structures may be covalently linked to antisense molecules to 

1 0 form chimeric oligonucleotides capable of RNA cleavage (Torrence, supra). These 
molecules putatively bind and activate a 2-5A dependent RNase, the 
oligonucleotide/enzyme complex then binds to a target RNA molecule which can then be 
cleaved by the RNase enzyme. The covalent attachment of 2'-5' oligoadenylate structures 
is not limited to antisense applications, and can be further elaborated to include attachment 

1 5 to nucleic acid molecules of the instant invention. 

Enzymatic Nucleic Acid : Seven basic varieties of naturally-occurring enzymatic 
RNAs are presently known. In addition, several in vitro selection (evolution) strategies 
(Orgel, 1979, Proc. R. Soc. London, B 205, 435) have been used to evolve new nucleic 
acid catalysts capable of catalyzing cleavage and ligation of phosphodiester linkages 

20 (Joyce, 1989, Gene, 82, 83-87; Beaudry et al, 1992, Science 257, 635-641 ; Joyce, 1992, 
Scientific American 267, 90-97; Breaker et al, 1994, TIBTECH 12, 268; Bartel et al, 1993, 
Science 261:1411-1418; Szostak, 1993, TIBSM, 89-93; Kumar et al, 1995, FASEB J., 9, 
1 183; Breaker, 1996, Curr. Op. Biotech., 7, 442; Santoro et al, 1997, Proc. Natl. Acad. 
Sci., 94, 4262; Tang et al, 1997, RNA 3, 914; Nakamaye & Eckstein, 1994, supra; Long & 

25 Uhlenbeck, 1994, supra; Ishizaka et al., 1995, supra; Vaish et al, 1997, Biochemistry 36, 
6495; all of these are incorporated by reference herein). Each can catalyze a series of 
reactions including the hydrolysis of phosphodiester bonds in trans (and thus can cleave 
other RNA molecules) under physiological conditions. 

In general, enzymatic nucleic acids act by first binding to a target RNA. Such 

30 binding occurs through the target binding portion of an enzymatic nucleic acid which is 
held in close proximity to an enzymatic portion of the molecule that acts to cleave the 
target RNA. Thus, the enzymatic nucleic acid first recognizes and then binds a target 
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RNA through complementary base-pairing, and once bound to the correct site, acts 
enzymatically to cut the target RNA. Strategic cleavage of such a target RNA will destroy 
its ability to direct synthesis of an encoded protein. After an enzymatic nucleic acid has 
bound and cleaved its RNA target, it is released from that RNA to search for another target 
5 and can repeatedly bind and cleave new targets. 

Nucleic acid molecules of this invention will block to some extent PTP-1B, MetAP- 
2, BACE, ps-1, ps-2, HER2, PLN, TERT, and/or HBV protein expression and can be used 
to treat disease or diagnose disease associated with the levels of PTP-1B, MetAP-2, 
BACE, ps-1, ps-2, HER2, PLN, TERT, and/or HBV. 

1 0 The enzymatic nature of a ribozyme has significant advantages, such as the 

concentration of ribozyme necessary to affect a therapeutic treatment is low. This 
advantage reflects the ability of the ribozyme to act enzymatically. Thus, a single 
ribozyme molecule is able to cleave many molecules of target RNA. In addition, the 
ribozyme is a highly specific inhibitor, with the specificity of inhibition depending not 

1 5 only on the base-pairing mechanism of binding to the target RNA, but also on the 

mechanism of target RNA cleavage. Single mismatches, or base-substitutions, near the 
site of cleavage can be chosen to completely eliminate catalytic activity of a ribozyme. 

Nucleic acid molecules having an endonuclease enzymatic activity are able to 
repeatedly cleave other separate RNA molecules in a nucleotide base sequence-specific 

20 manner. Such enzymatic nucleic acid molecules can be targeted to virtually any RNA 
transcript, and achieve efficient cleavage in vitro (Zaug et al, 324, Nature, 429 1986 ; 
Uhlenbeck, 1987 Nature, 328, 596; Kim et al., 84 Proc. Natl. Acad. Sci. USA, 8788, 1987; 
Dreyfus, 1988, Einstein Quart. J. Bio. Med., 6, 92; Haseloff and Gerlach, 334 Nature, 585, 
1988; Cech, 260 JAMA, 3030, 1988; Jefferies et al., 17 Nucleic Acids Research, 1371, 

25 1989; and Santoro et al., 1997 supra). 

Because of their sequence specificity, frans-cleaving ribozymes show promise as 
therapeutic agents for human disease (Usman & McSwiggen, 1995 Ann. Rep. Med. Chem. 
30, 285-294; Christoffersen and Marr, 1995 J. Med. Chem. 38, 2023-2037). Ribozymes 
can be designed to cleave specific RNA targets within the background of cellular RNA. 

30 Such a cleavage event renders the RNA non-functional and abrogates protein expression 
from that RNA. In this manner, synthesis of a protein associated with a disease state can 
be selectively inhibited (Warashina et al., 1999, Chemistry and Biology, 6, 237-250. 
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The nucleic acid molecules of the instant invention are also referred to as 
GeneBloc™ reagents, which are essentially nucleic acid molecules (e.g.; ribozymes, 
antisense) capable of down-regulating gene expression. 



5 Target sites 

Targets for useful ribozymes and antisense nucleic acids can be determined as 
disclosed in Draper et al, WO 93/23569; Sullivan et al, WO 93/23057; Thompson et al, 
WO 94/02595; Draper et al, WO 95/04818; McSwiggen et al, US Patent No. 5,525,468, 
and all hereby incorporated in their entireties by reference herein. Other examples include 

1 0 the following PCT applications, which concern inactivation of expression of disease- 
related genes: WO 95/23225, WO 95/13380, WO 94/02595, all incorporated by reference 
herein. Rather than repeat the guidance provided in those documents here, below are 
provided specific examples of such methods, not limiting to those in the art. Ribozymes 
and antisense to such targets are designed as described in those applications and 

1 5 synthesized to be tested in vitro and in vivo, as also described. The sequence of human 
PTP-1B, MetAP-2, BACE, ps-1, ps-2, HER2, PLN, TERT, and/or HBV RNAs (for 
example, GenBank accession Nos. (PTP-1B,. NM_002827), (MetAP-2, U29607), (BACE, 
AF190725), (ps-1, L76517), (ps-2, L43964), (HER2/c-erb2/neu, X03363), (PLN, 
NM_002667), (TERT, NM_003219) and (HBV, AF100308.1, HBV strain 2-18; 

20 additionally, other HBV strains can be screened by one skilled in the art, see Table 35 for 
other possible strains) were screened for optimal enzymatic nucleic acid and antisense 
target sites using a computer-folding algorithm. Antisense, hammerhead, DNAzyme, 
NCH (Inozyme), amberzyme, zinzyme or G-Cleaver ribozyme binding/cleavage sites were 
identified. These sites are shown in Tables 3-29, 31, 33, 34, 37-43, 56, 58, 59, 62, 63 (all 

25 sequences are 5' to 3' in the tables; X can be any base-paired sequence, the actual 

sequence is not relevant here). The nucleotide base position is noted in the Tables as that 
site to be cleaved by the designated type of enzymatic nucleic acid molecule. Table 36 
shows substrate positions selected from Renbo et al, 1987, Sci. Sin., 30, 507, used in 
Draper, US patent No. 6,017,756 entitled "METHOD AND REAGENT FOR 

30 INHIBITING HEPATITIS B VIRUS REPLICATION" and Draper et al., International 
PCT publication No. WO 93/23569, filed April 29, 1993, entitled "METHOD AND 
REAGENT FOR INHIBITING VIRAL REPLICATION". While human sequences can be 
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screened and enzymatic nucleic acid molecule and/or antisense thereafter designed, as 
discussed in Stinchcomb et al, WO 95/23225, mouse targeted ribozymes may be useful to 
test efficacy of action of the enzymatic nucleic acid molecule and/or antisense prior to 
testing in humans. 

5 Antisense, hammerhead, DNAzyme, NCH (Inozyme), amberzyme, zinzyme or G- 

Cleaver ribozyme binding/cleavage sites were identified, as discussed above. The nucleic 
acid molecules were individually analyzed by computer folding (Jaeger et al, 1989 Proc. 
Natl. Acad. Sci. USA, 86, 7706) to assess whether the sequences fold into the appropriate 
secondary structure. Those nucleic acid molecules with unfavorable intramolecular 

1 0 interactions such as between the binding arms and the catalytic core were eliminated from 
consideration. Varying binding arm lengths can be chosen to optimize activity. 

Antisense, hammerhead, DNAzyme, NCH, amberzyme, zinzyme or G-Cleaver 
ribozyme binding/cleavage sites were identified and were designed to anneal to various 
sites in the RNA target. The binding arms are complementary to the target site sequences 

1 5 described above. The nucleic acid molecules were chemically synthesized. The method of 
synthesis used follows the procedure for normal DNA/RNA synthesis as described below 
and in Usman et al, 1987 J. Am. Chem. Soc, 109, 7845; Scaringe et al., 1990 Nucleic 
Acids Res., 18, 5433; Wincott et at., 1 995 Nucleic Acids Res. 23, 2677-2684; and 
Caruthers et al, 1992, Methods in Enzymology 211,3-19. 

20 

Synthesis of Nucleic acid Molecules 

Synthesis of nucleic acids greater than 100 nucleotides in length is difficult using 
automated methods, and the therapeutic cost of such molecules is prohibitive. In this 
invention, small nucleic acid motifs ("small refers to nucleic acid motifs no more than 100 

25 nucleotides in length, preferably no more than 80 nucleotides in length, and most 
preferably no more than 50 nucleotides in length; e.g., antisense oligonucleotides, 
hammerhead or the NCH ribozymes) are preferably used for exogenous delivery. The 
simple structure of these molecules increases the ability of the nucleic acid to invade 
targeted regions of RNA structure. Exemplary molecules of the instant invention are 

30 chemically synthesized, and others can similarly be synthesized. 
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Oligonucleotides (e.g.; antisense GeneBlocs) are synthesized using protocols known 
in the art as described in Caruthers et al, 1992, Methods in Enzymology 21 1, 3-19, 
Thompson et al., International PCT Publication No. WO 99/54459, Wincott et al., 1995, 
Nucleic Acids Res. 23, 2677-2684, Wincott et al., 1997, Methods Mol. Bio., 74, 59, 
5 Brennan et al., 1998, Biotechnol Bioeng., 61, 33-45, and Brennan, US patent No. 

6,00 1 ,3 1 1 . All of these references are incorporated herein by reference. The synthesis of 
oligonucleotides makes use of common nucleic acid protecting and coupling groups, such 
as dimethoxytrityl at the 5 '-end, and phosphoramidites at the 3'-end. In a non-limiting 
example, small scale syntheses are conducted on a 394 Applied Biosystems, Inc. 

1 0 synthesizer using a 0.2 umol scale protocol with a 2.5 min coupling step for 2'-0- 

methylated nucleotides and a 45 sec coupling step for 2'-deoxy nucleotides. Table II 
outlines the amounts and the contact times of the reagents used in the synthesis cycle. 
Alternatively, syntheses at the 0.2 umol scale can be performed on a 96-well plate 
synthesizer, such as the instrument produced by Protogene (Palo Alto, CA) with minimal 

1 5 modification to the cycle. A 33-fold excess (60 uL of 0.1 1 M = 6.6 umol) of 2'-0-methyl 
phosphoramidite and a 105-fold excess of S-ethyl tetrazole (60 uL of 0.25 M = 15 umol) 
can be used in each coupling cycle of 2'-0-methyl residues relative to polymer-bound 5'- 
hydroxyl. A 22-fold excess (40 uL of 0.1 1 M = 4.4 umol) of deoxy phosphoramidite and a 
70-fold excess of S-ethyl tetrazole (40 uL of 0.25 M = 10 umol) can be used in each 

20 coupling cycle of deoxy residues relative to polymer-bound 5 ' -hydroxyl. Average 
coupling yields on the 394 Applied Biosystems, Inc. synthesizer, determined by 
colorimetric quantitation of the trityl fractions, are typically 97.5-99%. Other 
oligonucleotide synthesis reagents for the 394 Applied Biosystems, Inc. synthesizer 
include the following: detritylation solution is 3% TCA in methylene chloride (ABI); 

25 capping is performed with 16% N-methyl imidazole in THF (ABI) and 10% acetic 

anhydride/10% 2,6-lutidine in THF (ABI); and oxidation solution is 16.9 mM I2, 49 mM 
pyridine, 9% water in THF (PERSEPTIVE™). Burdick & Jackson Synthesis Grade 
acetonitrile is used directly from the reagent bottle. S-Ethyltetrazole solution (0.25 M in 
acetonitrile) is made up from the solid obtained from American International Chemical, 

30 Inc. Alternately, for the introduction of phosphorothioate linkages, Beaucage reagent (3H- 
l,2-Benzodithiol-3-one 1,1 -dioxide, 0.05 M in acetonitrile) is used. 
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Deprotection of the antisense oligonucleotides is performed as follows: the polymer- 
bound trityl-on oligoribonucleotide is transferred to a 4 mL glass screw top vial and 
suspended in a solution of 40% aq. methylamine (1 mL) at 65 °C for 10 min. After 
cooling to -20 °C, the supernatant is removed from the polymer support. The support is 
5 washed three times with 1 .0 mL of EtOH:MeCN:H20/3 : 1 : 1 , vortexed and the supernatant 
is then added to the first supernatant. The combined supernatants, containing the 
oligoribonucleotide, are dried to a white powder. 

The method of synthesis used for normal RNA including certain enzymatic nucleic 
acid molecules follows the procedure as described in Usman et al, 1987, J. Am. Chem. 

1 0 Soc, 109, 7845; Scaringe et al, 1990, Nucleic Acids Res., 18, 5433; and Wincott et al, 
1995, Nucleic Acids Res. 23, 2677-2684 Wincott et al, 1997, Methods Mol. Bio., 1A, 59, 
and makes use of common nucleic acid protecting and coupling groups, such as 
dimethoxytrityl at the 5 '-end, and phosphoramidites at the 3'-end. In a non-limiting 
example, small scale syntheses are conducted on a 394 Applied Biosystems, Inc. 

1 5 synthesizer using a 0.2 umol scale protocol with a 7.5 min coupling step for alkylsilyl 

protected nucleotides and a 2.5 min coupling step for 2'-0-methylated nucleotides. Table 
II outlines the amounts and the contact times of the reagents used in the synthesis cycle. 
Alternatively, syntheses at the 0.2 umol scale can be done on a 96-well plate synthesizer, 
such as the instrument produced by Protogene (Palo Alto, CA) with minimal modification 

20 to the cycle. A 33-fold excess (60 uL of 0.11 M = 6.6 umol) of 2'-0-methyl 

phosphoramidite and a 75-fold excess of S-ethyl tetrazole (60 uL of 0.25 M = 15 umol) 
can be used in each coupling cycle of 2'-0-methyl residues relative to polymer-bound 5'- 
hydroxyl. A 66-fold excess (120 uL of 0.1 1 M = 13.2 umol) of alkylsilyl (ribo) protected 
phosphoramidite and a 150-fold excess of S-ethyl tetrazole (120 uL of 0.25 M = 30 umol) 

25 can be used in each coupling cycle of ribo residues relative to polymer-bound 5 '-hydroxyl. 
Average coupling yields on the 394 Applied Biosystems, Inc. synthesizer, determined by 
colorimetric quantitation of the trityl fractions, are typically 97.5-99%. Other 
oligonucleotide synthesis reagents for the 394 Applied Biosystems, Inc. synthesizer 
include the following: detritylation solution is 3% TCA in methylene chloride (ABI); 

30 capping is performed with 16% N-methyl imidazole in THF (ABI) and 10% acetic 
anhydride/10% 2,6-lutidine in THF (ABI); oxidation solution is 16.9 mM I2, 49 mM 
pyridine, 9% water in THF (PERSEPTIVE™). Burdick & Jackson Synthesis Grade 



WO 01/16312 PCT/US00/23998 
28 

acetonitrile is used directly from the reagent bottle. S-Ethyltetrazole solution (0.25 M in 
acetonitrile) is made up from the solid obtained from American International Chemical, 
Inc. Alternately, for the introduction of phosphorothioate linkages, Beaucage reagent (3H- 
l,2-Benzodithiol-3-one l,l-dioxide0.05 M in acetonitrile) is used. 
5 Deprotection of the RNA is performed using either a two-pot or one-pot protocol. 

For the two-pot protocol, the polymer-bound trityl-on oligoribonucleotide is transferred to 
a 4 mL glass screw top vial and suspended in a solution of 40% aq. methylamine (1 mL) at 
65 °C for 10 min. After cooling to -20 °C, the supernatant is removed from the polymer 
support. The support is washed three times with 1 .0 mL of EtOH:MeCN:H20/3 : 1 : 1 , 

1 0 vortexed and the supernatant is then added to the first supernatant. The combined 

supematants, containing the oligoribonucleotide, are dried to a white powder. The base 
deprotected oligoribonucleotide is resuspended in anhydrous TEA/HF/NMP solution (300 
uL of a solution of 1.5 mL N-methylpyrrolidinone, 750 uL TEA and 1 mL TEA»3HF to 
provide a 1.4 M HF concentration) and heated to 65 °C. After 1.5 h, the oligomer is 

1 5 quenched with 1 . 5 M NH4HCO3 . 

Alternatively, for the one-pot protocol, the polymer-bound trityl-on 
oligoribonucleotide is transferred to a 4 mL glass screw top vial and suspended in a 
solution of 33% ethanolic methylamine/DMSO: 1/1 (0.8 mL) at 65 °C for 15 min. The 
vial is brought to r.t. TEA-3HF (0.1 mL) is added and the vial is heated at 65 °C for 15 

20 min. The sample is cooled at -20 °C and then quenched with 1.5 M NH4HCO3. 

For purification of the trityl-on oligomers, the quenched NH4HCO3 solution is 
loaded onto a C-18 containing cartridge that had been prewashed with acetonitrile 
followed by 50 mM TEAA. After washing the loaded cartridge with water, the RNA is 
detritylated with 0.5% TFA for 13 min. The cartridge is then washed again with water, 

25 salt exchanged with 1 M NaCl and washed with water again. The oligonucleotide is then 
eluted with 30% acetonitrile. 

Inactive hammerhead ribozymes or binding attenuated control (BAC) 
oligonucleotides) are synthesized by substituting a U for G5 and a U for A1 4 (numbering 
from Hertel, K. J., et al, 1992, Nucleic Acids Res. . 20, 3252). Similarly, one or more 

30 nucleotide substitutions can be introduced in other enzymatic nucleic acid molecules to 
inactivate the molecule and such molecules can serve as a negative control. 
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The average stepwise coupling yields are typically >98% (Wincott et al, 1995 
Nucleic Acids Res. 23, 2677-2684). Those of ordinary skill in the art will recognize that 
the scale of synthesis can be adapted to be larger or smaller than the example described 
above including but not limited to 96-well format, all that is important is the ratio of 
5 chemicals used in the reaction. 

Alternatively, the nucleic acid molecules of the present invention can be synthesized 
separately and joined together post-synthetically, for example, by ligation (Moore et al, 
1992, Science 256, 9923; Draper et al, International PCT publication No. WO 93/23569; 
Shabarova et al, 1991, Nucleic Acids Research 19, 4247; Bellon et al, 1997, Nucleosides 

10 & Nucleotides, 16, 951; Bellon et al, 1997, Bioconjugate Chem. 8, 204). 

The nucleic acid molecules of the present invention are modified extensively to 
enhance stability by modification with nuclease resistant groups, for example, 2'-amino, 2'- 
C-allyl, 2'-flouro, 2'-0-methyl, 2'-H (for a review see Usman and Cedergren, 1992, TIBS 
17, 34; Usman et al, 1994, Nucleic Acids Symp. Ser. 31, 163). Ribozymes are purified by 

1 5 gel electrophoresis using general methods or are purified by high pressure liquid 
chromatography (HPLC; see Wincott et al, supra, the totality of which is hereby 
incorporated herein by reference) and are re-suspended in water. 

The sequences of the ribozymes and antisense constructs that are chemically 
synthesized, useful in this study, are shown in Tables 3-31, 33, 34, 37-43, 56, 58, 59, 62, 

20 63. Those in the art will recognize that these sequences are representative only of many 
more such sequences where the enzymatic portion of the ribozyme (all but the binding 
arms) is altered to affect activity. The ribozyme and antisense construct sequences listed in 
Tables 3-31, 33, 34, 37-43, 56, 58, 59, 62, 63 may be formed of ribonucleotides or other 
nucleotides or non-nucleotides. Such ribozymes with enzymatic activity are equivalent to 

25 the ribozymes described specifically in the Tables. 

Optimizing Activity of the nucleic acid molecule of the invention. 

Chemically synthesizing nucleic acid molecules with modifications (base, sugar 
and/or phosphate) that prevent their degradation by serum ribonucleases may increase their 
potency (see e.g., Eckstein et al, International Publication No. WO 92/07065; Perrault et 

30 al, 1990 Nature 344, 565; Pieken et al., 1991, Science 253, 314; Usman and Cedergren, 
1992, Trends in Biochem. Sci. 17, 334; Usman et al, International Publication No. WO 
93/15187; Rossi et al, International Publication No. WO 91/03162; Sproat, US Patent No. 
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5,334,71 1 ; and Burgin et al, supra; all of these describe various chemical modifications 
that can be made to the base, phosphate and/or sugar moieties of the nucleic acid 
molecules herein and are all hereby incorporated by reference herein). Modifications 
which enhance their efficacy in cells, and removal of bases from nucleic acid molecules to 
5 shorten oligonucleotide synthesis times and reduce chemical requirements are desired. 
There are several examples in the art describing sugar, base and phosphate 
modifications that can be introduced into nucleic acid molecules (e.g., enzymatic nucleic 
acid molecules) without significantly effecting catalysis and with significant enhancement 
in their nuclease stability and efficacy. Enzymatic nucleic acid molecules are modified to 

1 0 enhance stability and/or enhance catalytic activity by modification with nuclease resistant 
groups, for example, 2'-amino, 2'-C-allyl, 2'-fluoro, 2'-Omethyl, 2'-0-allyl, 2'-H, 
nucleotide base modifications (for a review see Usman and Cedergren, 1992 TIBS 17, 34; 
Usman et al, 1994 Nucleic Acids Symp. Ser. 31, 163; Burgin et al, 1996 Biochemistry 35, 
1 4090). Sugar modification of enzymatic nucleic acid molecules have been extensively 

1 5 described in the art (see Eckstein et al, International Publication PCT No. WO 92/07065; 
Perrault et al. Nature 1990, 344, 565-568; Pieken et al. Science 1991, 253, 314-317; 
Usman and Cedergren, Trends in Biochem. Sci. 1992, 1 7, 334-339; Usman et al 
International Publication PCT No. WO 93/15187; Sproat, US Patent No. 5,334,711 and 
Beigelman et al, 1995 J. Biol Chem. 270, 25702; all of the references are hereby 

20 incorporated in their totality by reference herein). Such publications describe general 
methods and strategies to determine the location of incorporation of sugar, base and/or 
phosphate modifications and the like into enzymatic nucleic acid molecules without 
inhibiting catalysis, and are incorporated by reference herein. The 2'-position of the sugar 
in a nucleotide present in the nucleic acid molecules of the instant invention which 

25 tolerates substitution is selected from the group comprising -H, -OH, -COOH, -CONH 2 , - 
CONHR 1 , -CONR'R 2 , -NHz^NHR'^NR'R 2 , -NHCOR 1 , -SH, SR 1 , -F, -ONH 2 , - 
ONHR 1 , -ONR'R 2 , -NHOH, -NHOR 1 , -NR 2 OH, -NR 2 0R', substituted or unsubstituted 
Ci-Cio straight chain or branched alkyl, substituted or unsubstituted C 2 -C t o straight chain 
or branched alkenyl, substituted or unsubstituted C 2 -Ci 0 straight chain or branched alkynyl, 

30 substituted or unsubstituted Ci -Cio straight chain or branched alkoxy, substituted or 
unsubstituted C 2 -Cio straight chain or branched alkenyloxy, and substituted or 
unsubstituted C2-C10 straight chain or branched alkynyloxy. The substituents for sugar 2' 
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position preferably are independently halogen, cyano, amino, carboxy, ester, ether, 
carboxamide, hydroxy, or mercapto. R 1 and R 2 can be substituted or unsubstituted alkyl, 
alkenyl, or alkynyl groups, where the substituents are independently halogen, cyano, 
amino, carboxy, ester, ether, carboxamide, hydroxy, or mercapto. 
5 In view of such teachings, similar modifications can be used as described herein to 

modify the nucleic acid molecules of the instant invention. Such publications describe 
general methods and strategies to determine the location of incorporation of sugar, base 
and/or phosphate modifications and the like into ribozymes without inhibiting catalysis, 
and are incorporated by reference herein. In view of such teachings, similar modifications 
1 0 can be used as described herein to modify the nucleic acid molecules of the instant 
invention. 

Some of the non-limiting examples of base modifications that can be introduced into 
enzymatic nucleic acids without significantly effecting their catalytic activity include, 
inosine, purine, pyridin-4-one, pyridin-2-one, phenyl, pseudouracil, 2, 4, 6-trimethoxy 

1 5 benzene, 3-methyluracil, dihydrouridine, naphthyl, aminophenyl, 5-alkylcytidines (e.g., 
5-methylcytidine), 5-alkyluridines (e.g., ribothymidine), 5-halouridine (e.g., 
5-bromouridine) or 6-azapyrimidines or 6-alkylpyrimidines (e.g. 6-methyluridine) and 
others (Burgin et al, 1996, Biochemistry, 35, 14090). By "modified bases" in this aspect is 
meant nucleotide bases other than adenine, guanine, cytosine and uracil at 1' position or 

20 their equivalents; such bases may be used within the catalytic core of the enzyme and/or in 
the substrate-binding regions. 

The nucleic acid bases can be hypoxanthin-9-yl, or a functional equivalent thereof, 
in position 15 'of the ribozyme; the base at other positions may be guanin-9-yl, 
hypoxanthin-9-yl or 7-deazaguanin-9-yl in positions 5, 8 and 12 in the ribozyme; adenin-9- 

25 yl, 2,6-diaminopurin-9-yl, purin-9-yl or 7-deaza adenin-9-yl in positions 6, 9, 13 and 14; 
uracil-l-yl, uracil-5-yl, thymin-l-yl or 5-propynyluracil-l-yl in position 4; cytosin-l-yl, 5- 
methylcytosin-l-yl or 5-propynylcytosin-l-yl in position 3; and adenin-9-yl, cytosin-l-yl, 
guanin-9-yl, uracil-l-yl, uracil-5-yl, hypoxanthin-9-yl, thymin-l-yl, 5-methylcytosin-l-yl, 
2,6-diaminopurin-9-yl, purin-9-yl, 7-deaza adenin-9-yl, 7-deazaguanin-9-yl, 5- 

30 propynylcytosin-l-yl, 5-propynyluracil-l-yl, isoguanin-9-yl, 2-aminopurin-9-yl, 6- 
methyluracil-l-yl, 4-thiouracil-l-yl, 2-pyrimidone-l-yl, quinazoline-2,4-dione-l-yl, 
xanthin-9-yl, N 2 -dimethylguanin-9-yl, or a functional equivalent thereof in position 7. The 
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base at position 15.1 is preferably hypoxanthin-9-yl or an analog where no hydrogen bond 
can form between any group at the 2 position of the base and the 2-oxo group of C 16 ' 1 . 
Preferably, B is not guanin-9-yl in position 15.1. 

In particular, the invention features modified ribozymes having a base substitution 
5 selected from pyridin-4-one, pyridin-2-one, phenyl, pseudouracil, 2, 4, 6-trimethoxy 
benzene, 3-methyluracil, dihydrouracil, naphthyl, 6-methyl-uracil and aminophenyl. 

While chemical modification of oligonucleotide internucleotide linkages with 
phosphorothioate, phosphorothioate, and/or 5'-methylphosphonate linkages improves 
stability, too many of these modifications may cause some toxicity. Therefore, when 

1 0 designing nucleic acid molecules, the amount of these internucleotide linkages should be 
minimized. The reduction in the concentration of these linkages should lower toxicity 
resulting in increased efficacy and higher specificity of these molecules. 

Nucleic acid molecules having chemical modifications which maintain or enhance 
activity are provided. Such nucleic acid molecules are also generally more resistant to 

1 5 nucleases than unmodified nucleic acid. Thus, in a cell and/or in vivo the activity may not 
be significantly lowered. Therapeutic nucleic acid molecules delivered exogenously must 
optimally be stable within cells until translation of the target RNA has been inhibited long 
enough to reduce the levels of the undesirable protein. This period of time varies between 
hours to days depending upon the disease state. Clearly, nucleic acid molecules must be 

20 resistant to nucleases in order to function as effective intracellular therapeutic agents. 

Improvements in the chemical synthesis of RNA and DNA (Wincott et al., 1995 Nucleic 
Acids Res. 23, 2677; Caruthers et al, 1992, Methods in Enzymology 21 1,3-19 (all are 
incorporated by reference herein) have expanded the ability to modify nucleic acid 
molecules by introducing nucleotide modifications to enhance their nuclease stability as 

25 described above. 

Use of these the nucleic acid-based molecules of the invention will lead to better 
treatment of the disease progression by affording the possibility of combination therapies 
{e.g., multiple antisense or enzymatic nucleic acid molecules targeted to different genes, 
nucleic acid molecules coupled with known small molecule inhibitors, or intermittent 

30 treatment with combinations of molecules (including different motifs) and/or other 

chemical or biological molecules). The treatment of patients with nucleic acid molecules 
may also include combinations of different types of nucleic acid molecules. 
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Therapeutic nucleic acid molecules (e.g., enzymatic nucleic acid molecules and 
antisense nucleic acid molecules) delivered exogenously must optimally be stable within 
cells until translation of the target RNA has been inhibited long enough to reduce the 
levels of the undesirable protein. This period of time varies between hours to days 
5 depending upon the disease state. Clearly, these nucleic acid molecules must be resistant 
to nucleases in order to function as effective intracellular therapeutic agents. 
Improvements in the chemical synthesis of nucleic acid molecules described in the instant 
invention and in the art have expanded the ability to modify nucleic acid molecules by 
introducing nucleotide modifications to enhance their nuclease stability as described 
1 0 above. 

By "enhanced enzymatic activity" is meant to include activity measured in cells 
and/or in vivo where the activity is a reflection of both catalytic activity and ribozyme 
stability. In this invention, the product of these properties is increased or not significantly 
(less than 10-fold) decreased in vivo compared to an all RNA ribozyme or all DNA 
1 5 enzyme. 

In yet another preferred embodiment, nucleic acid catalysts having chemical 
modifications which maintain or enhance enzymatic activity are provided. Such nucleic 
acid catalysts are also generally more resistant to nucleases than unmodified nucleic acid. 
Thus, in a cell and/or in vivo the activity may not be significantly lowered. As exemplified 
20 herein such ribozymes are useful in a cell and/or in vivo even if activity over all is reduced 
10 fold (Burgin et al, 1996, Biochemistry, 35, 14090). Such ribozymes herein are said to 
"maintain" the enzymatic activity of an all RNA ribozyme. 

In another aspect the nucleic acid molecules comprise a 5' and/or a 3'- cap 
structure. 

25 By "cap structure" is meant chemical modifications, which have been incorporated 

at either terminus of the oligonucleotide (see, for example, Wincott et al, WO 97/26270, 
incorporated by reference herein). These terminal modifications protect the nucleic acid 
molecule from exonuclease degradation, and may help in delivery and/or localization 
within a cell. The cap may be present at the 5'-terminus (5 '-cap) or at the 3 '-terminal (3'- 

30 cap) or may be present on both termini. In non-limiting examples: the 5 '-cap is selected 
from the group comprising inverted abasic residue (moiety); 4',5'-methylene nucleotide; 1- 
(beta-D-erythrofuranosyl) nucleotide, 4'-thio nucleotide; carbocyclic nucleotide; 1,5- 
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anhydrohexitol nucleotide; L-nucleotides; alpha-nucleotides; modified base nucleotide; 
phosphorodithioate linkage; rtreo-pentofuranosyl nucleotide; acyclic 3',4'-seco nucleotide; 
acyclic 3,4-dihydroxybutyl nucleotide; acyclic 3, 5-dihydroxypentyl nucleotide, 3'-3'- 
inverted nucleotide moiety; 3 '-3 '-inverted abasic moiety; 3'-2'-inverted nucleotide moiety; 
5 3'-2*-inverted abasic moiety; 1,4-butanediol phosphate; 3'-phosphoramidate; 
hexylphosphate; aminohexyl phosphate; 3'-phosphate; 3'-phosphorothioate; 
phosphorodithioate; or bridging or non-bridging methylphosphonate moiety (for more 
details, see Wincott et at, International PCT publication No. WO 97/26270, incorporated 
by reference herein). 

10 In yet another preferred embodiment, the 3 '-cap is selected from a group 

comprising, 4', 5'-methylene nucleotide; l-(beta-D-erythrofuranosyl) nucleotide; 4'-thio 
nucleotide, carbocyclic nucleotide; 5'-amino-alkyl phosphate; l,3-diamino-2-propyl 
phosphate; 3-aminopropyl phosphate; 6-aminohexyl phosphate; 1 ,2-aminododecyl 
phosphate; hydroxypropyl phosphate; 1 ,5-anhydrohexitol nucleotide; L-nucleotide; alpha- 

1 5 nucleotide; modified base nucleotide; phosphorodithioate; //jreo-pentofuranosyl 
nucleotide; acyclic 3',4'-seco nucleotide; 3,4-dihydroxybutyl nucleotide; 3,5- 
dihydroxypentyl nucleotide, 5'-5'-inverted nucleotide moiety; 5'-5'-inverted abasic moiety; 
5'-phosphoramidate; 5'-phosphorothioate; 1,4-butanediol phosphate; 5'-amino; bridging 
and/or non-bridging 5'-phosphoramidate, phosphorothioate and/or phosphorodithioate, 

20 bridging or non bridging methylphosphonate and 5'-mercapto moieties (for more details 
see Beaucage and Iyer, 1993, Tetrahedron 49, 1925; incorporated by reference herein). 

An "alkyl" group refers to a saturated aliphatic hydrocarbon, including straight- 
chain, branched-chain, and cyclic alkyl groups. Preferably, the alkyl group has 1 to 12 
carbons. More preferably it is a lower alkyl of from 1 to 7 carbons, more preferably 1 to 4 

25 carbons. The alkyl group may be substituted or unsubstituted. When substituted the 
substituted group(s) is preferably, hydroxyl, cyano, alkoxy, =0, =S, NO2 or N(CH3)2, 
amino, or SH. The term also includes alkenyl groups which are unsaturated hydrocarbon 
groups containing at least one carbon-carbon double bond, including straight-chain, 
branched-chain, and cyclic groups. Preferably, the alkenyl group has 1 to 12 carbons. 

30 More preferably it is a lower alkenyl of from 1 to 7 carbons, more preferably 1 to 4 

carbons. The alkenyl group may be substituted or unsubstituted. When substituted the 
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substituted group(s) is preferably, hydroxyl, cyano, alkoxy, =0, =S, N02, halogen, 
N(CH3)2, amino, or SH. The term "alkyl" also includes alkynyl groups which have an 
unsaturated hydrocarbon group containing at least one carbon-carbon triple bond, 
including straight-chain, branched-chain, and cyclic groups. Preferably, the alkynyl group 
5 has 1 to 12 carbons. More preferably it is a lower alkynyl of from 1 to 7 carbons, more 
preferably 1 to 4 carbons. The alkynyl group may be substituted or unsubstituted. When 
substituted the substituted group(s) is preferably, hydroxyl, cyano, alkoxy, =0, =S, N02 
or N(CH3)2, amino or SH. 

Such alkyl groups may also include aryl, alkylaryl, carbocyclic aryl, heterocyclic 

1 0 aryl, amide and ester groups. An "aryl" group refers to an aromatic group which has at 
least one ring having a conjugated pi electron system and includes carbocyclic aryl, 
heterocyclic aryl and biaryl groups, all of which may be optionally substituted. The 
preferred substituent(s) of aryl groups are halogen, trihalomethyl, hydroxyl, SH, OH, 
cyano, alkoxy, alkyl, alkenyl, alkynyl, and amino groups. An "alkylaryl" group refers to an 

1 5 alkyl group (as described above) covalently joined to an aryl group (as described above). 
Carbocyclic aryl groups are groups wherein the ring atoms on the aromatic ring are all 
carbon atoms. The carbon atoms are optionally substituted. Heterocyclic aryl groups are 
groups having from 1 to 3 heteroatoms as ring atoms in the aromatic ring and the 
remainder of the ring atoms are carbon atoms. Suitable heteroatoms include oxygen, 

20 sulfur, and nitrogen, and include furanyl, thienyl, pyridyl, pyrrolyl, N-lower alkyl pyrrolo, 
pyrimidyl, pyrazinyl, imidazolyl and the like, all optionally substituted. An "amide" refers 
to an -C(0)-NH-R, where R is either alkyl, aryl, alkylaryl or hydrogen. An "ester" refers to 
an -C(0)-OR', where R is either alkyl, aryl, alkylaryl or hydrogen. 

By "nucleotide" as used herein is as recognized in the art to include natural bases 

25 (standard), and modified bases well known in the art. Such bases are generally located at 
the T position of a nucleotide sugar moiety. Nucleotides generally comprise a base, sugar 
and a phosphate group. The nucleotides can be unmodified or modified at the sugar, 
phosphate and/or base moiety, (also referred to interchangeably as nucleotide analogs, 
modified nucleotides, non-natural nucleotides, non-standard nucleotides and other; see, for 

30 example, Usman and McSwiggen, supra; Eckstein et al, International PCT Publication 
No. WO 92/07065; Usman et al, International PCT Publication No. WO 93/15187; 
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Uhlman & Peyman, supra, all are hereby incorporated by reference herein). There are 
several examples of modified nucleic acid bases known in the art as summarized by 
Limbach et al, 1994, Nucleic Acids Res. 22, 21 83. Some of the non-limiting examples of 
base modifications that can be introduced into nucleic acid molecules include, inosine, 
5 purine, pyridin-4-one, pyridin-2-one, phenyl, pseudouracil, 2, 4, 6-trimethoxy benzene, 3- 
methyl uracil, dihydrouridine, naphthyl, aminophenyl, 5-alkylcytidines (e.g., 
5-methylcytidine), 5-alkyluridines (e.g., ribothymidine), 5-halouridine (e.g., 
5-bromouridine) or 6-azapyrimidines or 6-alkylpyrimidines (e.g. 6-methyluridine), 
propyne, and others (Burgin et al, 1996, Biochemistry, 35, 14090; Uhlman & Peyman, 
1 0 supra). 

By "modified bases" in this aspect is meant nucleotide bases other than adenine, 
guanine, cytosine and uracil at 1' position or their equivalents; such bases may be used at 
any position, for example, within the catalytic core of an enzymatic nucleic acid molecule 
and/or in the substrate-binding regions of the nucleic acid molecule. Such modified 

1 5 nucleotides include dideoxynucleotides which have pharmaceutical utility well known in 
the art, as well as utility in basic molecular biology methods such as sequencing. 

In a preferred embodiment, the invention features modified ribozymes with 
phosphate backbone modifications comprising one or more phosphorothioate, 
phosphorodithioate, methylphosphonate, morpholino, amidate carbamate, carboxymethyl, 

20 acetamidate, polyamide, sulfonate, sulfonamide, sulfamate, formacetal, thioformacetal, 
and/or alkylsilyl, substitutions. For a review of oligonucleotide backbone modifications, 
see Hunziker and Leumann, 1995, Nucleic Acid Analogues: Synthesis and Properties, in 
Modern Synthetic Methods, VCH, 331-417, and Mesmaeker et al, 1994, Novel Backbone 
Replacements for Oligonucleotides, in Carbohydrate Modifications in Antisense Research, 

25 ACS, 24-39. These references are hereby incorporated by reference herein. 

By "abasic" is meant sugar moieties lacking a base or having other chemical groups 
in place of a base at the 1' position, (for more details, see Wincott et al, International PCT 
publication No. WO 97/26270). 

By "unmodified nucleoside" or "unmodified nucleotide" is meant one of the bases 

30 adenine, cytosine, guanine, thymine, uracil joined to the 1' carbon of p-D-ribo-furanose. 
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By "modified nucleoside" or "modified nucleotide" is meant any nucleotide base 
which contains a modification in the chemical structure of an unmodified nucleotide base, 
sugar and/or phosphate. 

In connection with 2'-modified nucleotides as described for the present invention, 
5 by "amino" is meant 2'-NH 2 or T-O- NH 2 , which may be modified or unmodified. Such 
modified groups are described, for example, in Eckstein et al., U.S. Patent 5,672,695 and 
Matulic-Adamic et al., WO 98/28317, which are both incorporated by reference in their 
entireties. 

Various modifications to nucleic acid (e.g., antisense and ribozyme) structure can be 
1 0 made to enhance the utility of these molecules. Such modifications will enhance shelf-life, 
half-life in vitro, stability, and ease of introduction of such oligonucleotides to the target 
site, e.g., to enhance penetration of cellular membranes, and confer the ability to recognize 
and bind to targeted cells. 

Use of these molecules will lead to better treatment of the disease progression by 
1 5 affording the possibility of combination therapies (e.g., multiple ribozymes targeted to 
different genes, ribozymes coupled with known small molecule inhibitors, or intermittent 
treatment with combinations of ribozymes (including different ribozyme motifs) and/or 
other chemical or biological molecules). The treatment of patients with nucleic acid 
molecules may also include combinations of different types of nucleic acid molecules. 
20 Therapies may be devised which include a mixture of ribozymes (including different 
ribozyme motifs), antisense and/or 2-5A chimera molecules to one or more targets to 
alleviate symptoms of a disease. 

Administration of Nucleic Acid Molecules 

25 Methods for the delivery of nucleic acid molecules are described in Akhtar et al, 

1992, Trends Cell Bio., 2, 139; and Delivery Strategies for Antisense Oligonucleotide 
Therapeutics, ed. Akhtar, 1 995 which are both incorporated herein by reference. Sullivan 
et al, PCT WO 94/02595, further describes the general methods for delivery of enzymatic 
RNA molecules. These protocols may be utilized for the delivery of virtually any nucleic 

30 acid molecule. Nucleic acid molecules may be administered to cells by a variety of 

methods known to those familiar to the art, including, but not restricted to, encapsulation 
in liposomes, by iontophoresis, or by incorporation into other vehicles, such as hydrogels, 
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cyclodextrins, biodegradable nanocapsules, and bioadhesive microspheres. For some 
indications, nucleic acid molecules may be directly delivered ex vivo to cells or tissues 
with or without the aforementioned vehicles. Alternatively, the nucleic acid/vehicle 
combination is locally delivered by direct injection or by use of a catheter, infusion pump 
5 or stent. Many examples in the art describe CNS delivery methods of oligonucleotides by 
osmotic pump, (see Chun et al, 1998, Neuroscience Letters, 257, 135-1 38, D'Aldin et al, 
1998, Mol Brain Research, 55, 151-164, Dryden^a/., 1998, J. Endocrinol, 157, 169- 
175, Ghirnikar et al, 1998, Neuroscience Letters, 247, 21-24) or direct infusion (Broaddus 
et al, 1997, Neurosurg. Focus, 3, article 4). Other routes of delivery include, but are not 

1 0 limited to oral (tablet or pill form) and/or intrathecal delivery (Gold, 1997, Neuroscience, 
76, 1 153-1 158). For a comprehensive review on drug delivery strategies including broad 
coverage of CNS delivery, see Jain, Drug Delivery Systems: Technologies and 
Commercial Opportunities, Decision Resources, 1998. Other routes of delivery include, 
but are not limited to, intravascular, intramuscular, subcutaneous or joint injection, aerosol 

1 5 inhalation, oral (tablet or pill form), topical, systemic, ocular, intraperitoneal and/or 
intrathecal delivery. More detailed descriptions of nucleic acid delivery and 
administration are provided in Sullivan et al, supra, Draper et al, PCT W093/23569; 
Beigelman et al, PCT WO99/05094, and Klimuk et al, PCT WO99/04819 all of which 
are incorporated by reference herein. 

20 The molecules of the instant invention can be used as pharmaceutical agents. 

Pharmaceutical agents prevent, inhibit the occurrence, or treat (alleviate a symptom to 
some extent, preferably all of the symptoms) of a disease state in a patient. 

The negatively charged polynucleotides of the invention can be administered (e.g., 
RNA, DNA or protein) and introduced into a patient by any standard means, with or 

25 without stabilizers, buffers, and the like, to form a pharmaceutical composition. When it is 
desired to use a liposome delivery mechanism, standard protocols for formation of 
liposomes can be followed. The compositions of the present invention may also be 
formulated and used as tablets, capsules or elixirs for oral administration; suppositories for 
rectal administration; sterile solutions; suspensions for injectable administration; and the 

30 other compositions known in the art. 
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The present invention also includes pharmaceutically acceptable formulations of the 
compounds described. These formulations include salts of the above compounds, e.g., 
acid addition salts, for example, salts of hydrochloric, hydrobromic, acetic acid, and 
benzene sulfonic acid. 
5 A pharmacological composition or formulation refers to a composition or 

formulation in a form suitable for administration, e.g., systemic administration, into a cell 
or patient, preferably a human. Suitable forms, in part, depend upon the use or the route of 
entry, for example, oral, transdermal, or by injection. Such forms should not prevent the 
composition or formulation from reaching a target cell (i.e., a cell to which the negatively 

1 0 charged polymer is desired to be delivered to). For example, pharmacological 

compositions injected into the blood stream should be soluble. Other factors are known in 
the art, and include considerations such as toxicity and forms which prevent the 
composition or formulation from exerting its effect. 

By "systemic administration" is meant in vivo systemic absorption or accumulation 

15 of drugs in the blood stream followed by distribution throughout the entire body. 

Administration routes which lead to systemic absorption include, without limitations: 
intravenous, subcutaneous, intraperitoneal, inhalation, oral, intrapulmonary and 
intramuscular. Each of these administration routes expose the desired negatively charged 
polymers, e.g., nucleic acids, to an accessible diseased tissue. The rate of entry of a drug 

20 into the circulation has been shown to be a function of molecular weight or size. The use 
of a liposome or other drug carrier comprising the compounds of the instant invention can 
potentially localize the drug, for example, in certain tissue types, such as the tissues of the 
reticular endothelial system (RES). A liposome formulation which can facilitate the 
association of drug with the surface of cells, such as, lymphocytes and macrophages is also 

25 useful. This approach may provide enhanced delivery of the drug to target cells by taking 
advantage of the specificity of macrophage and lymphocyte immune recognition of 
abnormal cells, such as cancer cells. 

By pharmaceutically acceptable formulation is meant, a composition or formulation 
that allows for the effective distribution of the nucleic acid molecules of the instant 

30 invention in the physical location most suitable for their desired activity. Nonlimiting 

examples of agents suitable for formulation with the nucleic acid molecules of the instant 
invention include: P-glycoprotein inhibitors (such as Pluronic P85) which can enhance 
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entry of drugs into the CNS (Jolliet-Riant and Tillement, 1999, Fundam. Clin. Pharmacol, 
13, 16-26); biodegradable polymers, such as poly (DL-lactide-coglycolide) microspheres 
for sustained release delivery after intracerebral implantation (Emerich, DF et al, 1999, 
Cell Transplant, 8, 47-58) Alkermes, Inc. Cambridge, MA; and loaded nanoparticles, such 
5 as those made of polybutylcyanoacrylate, which can deliver drugs across the blood brain 
barrier and can alter neuronal uptake mechanisms {Prog Neuropsychopharmacol Biol 
Psychiatry, 23, 941-949, 1999). Other non-limiting examples of delivery strategies for the 
nucleic acid molecules of the instant invention include material described in Boado et al, 
1998, J. Pharm. ScL, 87, 1308-1315; Tylers al, 1999, FEB 'S Lett., 421, 280-284; 

1 0 Pardridge et al, 1995, PNAS USA., 92, 5592-5596; Boado, 1995, Adv. Drug Delivery Rev., 
15, 73-107; Aldrian-Herrada et al, 1998, Nucleic Acids Res., 26, 4910-4916; and Tyler et 
al, 1999, PNAS USA., 96, 7053-7058. 

The invention also features the use of the composition comprising surface-modified 
liposomes containing poly (ethylene glycol) lipids (PEG-modified, or long-circulating 

1 5 liposomes or stealth liposomes). These formulations offer a method for increasing the 

accumulation of drugs in target tissues. This class of drug carriers resists opsonization and 
elimination by the mononuclear phagocytic system (MPS or RES), thereby enabling longer 
blood circulation times and enhanced tissue exposure for the encapsulated drug (Lasic et 
al Chem. Rev. 1995, 95, 2601-2627; Ishiwata et al, Chem. Pharm. Bull. 1995, 43, 1005- 

20 1011). Such liposomes have been shown to accumulate selectively in tumors, presumably 
by extravasation and capture in the neovascularized target tissues (Lasic et al, Science 
1995, 267, 1275-1276; Oku et al, 1995, Biochim. Biophys. Acta, 1238, 86-90). The long- 
circulating liposomes enhance the pharmacokinetics and pharmacodynamics of DNA and 
RNA, particularly compared to conventional cationic liposomes which are known to 

25 accumulate in tissues of the MPS (Liu et al., J. Biol. Chem. 1995, 42, 24864-24870; Choi 
et al, International PCT Publication No. WO 96/10391; Ansell et al, International PCT 
Publication No. WO 96/10390; Holland et al, International PCT Publication No. WO 
96/10392; all of which are incorporated herein by reference). Long-circulating liposomes 
are also likely to protect drugs from nuclease degradation to a greater extent compared to 

30 cationic liposomes, based on their ability to avoid accumulation in metabolically 
aggressive MPS tissues such as the liver and spleen. 
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The present invention also includes compositions prepared for storage or 
administration which include a pharmaceutically effective amount of the desired 
compounds in a pharmaceutically acceptable carrier or diluent. Acceptable carriers or 
diluents for therapeutic use are well known in the pharmaceutical art, and are described, 
5 for example, in Remington 's Pharmaceutical Sciences, Mack Publishing Co. (A.R. 

Gennaro edit. 1985) hereby incorporated by reference herein. For example, preservatives, 
stabilizers, dyes and flavoring agents may be provided. These include sodium benzoate, 
sorbic acid and esters of p-hydroxybenzoic acid. In addition, antioxidants and suspending 
agents may be used. 

10 A pharmaceutically effective dose is that dose required to prevent, inhibit the 

occurrence, or treat (alleviate a symptom to some extent, preferably all of the symptoms) 
of a disease state. The pharmaceutically effective dose depends on the type of disease, the 
composition used, the route of administration, the type of mammal being treated, the 
physical characteristics of the specific mammal under consideration, concurrent 

1 5 medication, and other factors which those skilled in the medical arts will recognize. 
Generally, an amount between 0.1 mg/kg and 100 mg/kg body weight/day of active 
ingredients is administered dependent upon potency of the negatively charged polymer. 

The nucleic acid molecules of the present invention may also be administered to a 
patient in combination with other therapeutic compounds to increase the overall 

20 therapeutic effect. The use of multiple compounds to treat an indication may increase the 
beneficial effects while reducing the presence of side effects. 

Alternatively, certain of the nucleic acid molecules of the instant invention can be 
expressed within cells from eukaryotic promoters {e.g., Izant and Weintraub, 1985, 
Science, 229, 345; McGarry and Lindquist, 1986, Proc. Natl. Acad. Sci., USA 83, 399; 

25 Scanlon et al, 1991, Proc. Natl. Acad. Sci. USA, 88, 10591-5; Kashani-Sabet et al, 1992, 
Antisense Res. Dev., 2,3-15; Dropulic et al., 1992, J. Virol., 66, 1432-41; Weerasinghe et 
al., 1991, J. Virol, 65, 5531-4; Ojwangera/., 1992, Proc. Natl. Acad. Sci. USA, 89, 
10802-6; Chen et al, 1992, Nucleic Acids Res., 20, 4581-9; Sarvere/a/., 1990 Science, 
247, 1222-1225; Thompson et al, 1995, Nucleic Acids Res., 23, 2259; Good et al, 1997, 

30 Gene Therapy, 4, 45; all of these references are hereby incorporated herein, in their 
totalities, by reference). Those skilled in the art realize that any nucleic acid can be 
expressed in eukaryotic cells from the appropriate DNA/RNA vector. The activity of such 
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nucleic acids can be augmented by their release from the primary transcript by a ribozyme 
(Draper et al, PCT WO 93/23569, and Sullivan et al, PCT WO 94/02595; Ohkawa et al, 
1992, Nucleic Acids Symp. Ser., 27, 15-6; Taira et al, 1991, Nucleic Acids Res., 19, 5125- 
30; Ventura et al, 1993, Nucleic Acids Res., 21 , 3249-55; Chowrira et al, 1994, J. Biol. 
5 Chem., 269, 25856; all of these references are hereby incorporated in their totality by 
reference herein). 

In another aspect of the invention, RNA molecules of the present invention are 
preferably expressed from transcription units (see, for example, Couture et al, 1996, TIG., 
12, 510) inserted into DNA or RNA vectors. The recombinant vectors are preferably DNA 

1 0 plasmids or viral vectors. Ribozyme expressing viral vectors could be constructed based 
on, but not limited to, adeno-associated virus, retrovirus, adenovirus, or alphavirus. 
Preferably, the recombinant vectors capable of expressing the nucleic acid molecules are 
delivered as described above, and persist in target cells. Alternatively, viral vectors may 
be used that provide for transient expression of nucleic acid molecules. Such vectors 

1 5 might be repeatedly administered as necessary. Once expressed, the nucleic acid molecule 
binds to the target mRNA. Delivery of nucleic acid molecule expressing vectors could be 
systemic, such as by intravenous or intra-muscular administration, by administration to 
target cells ex-planted from the patient followed by reintroduction into the patient, or by 
any other means that would allow for introduction into the desired target cell (for a review 

20 see Couture et al, 1 996, TIG., 12,510). 

In one aspect, the invention features an expression vector comprising a nucleic acid 
sequence encoding at least one of the nucleic acid molecules of the instant invention is 
disclosed. The nucleic acid sequence encoding the nucleic acid molecule of the instant 
invention is operably linked in a manner which allows expression of that nucleic acid 

25 molecule. 

In another aspect the invention features an expression vector comprising: a) a 
transcription initiation region (e.g., eukaryotic pol I, II or HI initiation region); b) a 
transcription termination region (e.g., eukaryotic pol I, II or III termination region); c) a 
nucleic acid sequence encoding at least one of the nucleic acid catalyst of the instant 
30 invention; and wherein said sequence is operably linked to said initiation region and said 
termination region, in a manner which allows expression and/or delivery of said nucleic 
acid molecule. The vector may optionally include an open reading frame (ORF) for a 
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protein operably linked on the 5' side or the 3'-side of the sequence encoding the nucleic 
acid catalyst of the invention; and/or an intron (intervening sequences). 

Transcription of the nucleic acid molecule sequences are driven from a promoter for 
eukaryotic RNA polymerase I (pol I), RNA polymerase II (pol II), or RNA polymerase HI 
5 (pol ET). Transcripts from pol n or pol HI promoters will be expressed at high levels in all 
cells; the levels of a given pol II promoter in a given cell type will depend on the nature of 
the gene regulatory sequences (enhancers, silencers, etc.) present nearby. Prokaryotic 
RNA polymerase promoters are also used, providing that the prokaryotic RNA polymerase 
enzyme is expressed in the appropriate cells (Elroy-Stein and Moss, 1990, Proc. Natl. 

10 Acad. Sci. USA, 87, 6743-7; Gao and Huang 1993, Nucleic Acids Res., 21, 2867-72; 

Lieber et al., 1993, Methods Enzymol, 217, 47-66; Zhou et al., 1990, Mol. Cell Biol, 10, 
4529-37). All of these references are incorporated by reference herein. Several 
investigators have demonstrated that nucleic acid molecules, such as ribozymes expressed 
from such promoters can function in mammalian cells (e.g. Kashani-Sabet et al., 1992, 

1 5 Antisense Res. Dev., 2, 3-15; Ojwang et al., 1992, Proc. Natl. Acad. Sci. USA, 89, 

10802-6; Chen et al, 1992, Nucleic Acids Res., 20, 4581-9; Yu et al, 1993, Proc. Natl. 
Acad. Sci. U SA, 90, 6340-4; L'Huilliere? al, 1992, EMBO J., 11, 4411-8; Lisziewicz et 
al, 1993, Proc. Natl. Acad. Sci. U. S. A, 90, 8000-4; Thompson et al, 1995, Nucleic 
Acids Res., 23, 2259; Sullenger & Cech, 1993, Science, 262, 1566). More specifically, 

20 transcription units such as the ones derived from genes encoding U6 small nuclear 

(snRNA), transfer RNA (tRNA) and adenovirus VA RNA are useful in generating high 
concentrations of desired RNA molecules such as ribozymes in cells (Thompson et al, 
supra; Couture and Stinchcomb, 1996, supra; Noonberg et al, 1994, Nucleic Acid Res., 
22, 2830; Noonberg et al, US Patent No. 5,624,803; Good et al, 1997, Gene Ther., 4, 45; 

25 Beigelman et al, International PCT Publication No. WO 96/18736; all of these 

publications are incorporated by reference herein. The above ribozyme transcription units 
can be incorporated into a variety of vectors for introduction into mammalian cells, 
including but not restricted to, plasmid DNA vectors, viral DNA vectors (such as 
adenovirus or adeno-associated virus vectors), or viral RNA vectors (such as retroviral or 

30 alphavirus vectors) (for a review see Couture and Stinchcomb, 1996, supra). 



WO 01/16312 PCT/US00/23998 
44 

In yet another aspect, the invention features an expression vector comprising nucleic 
acid sequence encoding at least one of the nucleic acid molecules of the invention, in a 
manner which allows expression of that nucleic acid molecule. The expression vector 
comprises in one embodiment; a) a transcription initiation region; b) a transcription 
5 termination region; c) a nucleic acid sequence encoding at least one said nucleic acid 
molecule; and wherein said sequence is operably linked to said initiation region and said 
termination region, in a manner which allows expression and/or delivery of said nucleic 
acid molecule. In another preferred embodiment the expression vector comprises: a) a 
transcription initiation region; b) a transcription termination region; c) an open reading 

1 0 frame; d) a nucleic acid sequence encoding at least one said nucleic acid molecule, 

wherein said sequence is operably linked to the 3 '-end of said open reading frame; and 
wherein said sequence is operably linked to said initiation region, said open reading frame 
and said termination region, in a manner which allows expression and/or delivery of said 
nucleic acid molecule. In yet another embodiment, the expression vector comprises: a) a 

1 5 transcription initiation region; b) a transcription termination region; c) an intron; d) a 

nucleic acid sequence encoding at least one said nucleic acid molecule; and wherein said 
sequence is operably linked to said initiation region, said intron and said termination 
region, in a manner which allows expression and/or delivery of said nucleic acid molecule. 
In another embodiment, the expression vector comprises: a) a transcription initiation 

20 region; b) a transcription termination region; c) an intron; d) an open reading frame; e) a 
nucleic acid sequence encoding at least one said nucleic acid molecule, wherein said 
sequence is operably linked to the 3'-end of said open reading frame; and wherein said 
sequence is operably linked to said initiation region, said intron, said open reading frame 
and said termination region, in a manner which allows expression and/or delivery of said 

25 nucleic acid molecule. 



Examples : 

The following are non-limiting examples showing the selection, isolation, synthesis 
and activity of nucleic acids of the instant invention. 



30 
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Example 1: Telomerase 

The ribonucleoprotein enzyme telomerase consists of an RNA template subunit and 
one or more protein subunits including telomerase reverse transcriptase (TERT), which 
function together to direct the synthesis of telomeres. Telomeres exist as non-nucleosome 
5 DNA/protein complexes at the physical ends of eukaryotic chromosomes. These capping 
structures maintain chromosome stability and replicative potential (Zakian, V. A., 1995, 
Science, 270, 1601-1607). Telomere structure is characterized by tandem repeats of 
conserved DNA sequences rich in G-C base pairs. Additional conserved telomere 
elements include a terminal 3 '-overhang in the G-rich strand and non-histone structural 

1 0 proteins that are complexed with telomeric DNA in the nucleus. (Blackburn, "E., 1990, 
JBC, 265, 5919-5921 .). Observed shortening of telomeres coincides with the onset of 
cellular senescence in most somatic cell lines lacking significant levels of telomerase. 
This finding has had a profound impact on our views concerning the mechanisms of aging, 
age related disease, and cancer. 

1 5 Conventional DNA polymerases are unable to fully replicate the ends of linear 

chromosomes (Watson, J. D., 1972, Nature, 239, 197-201). This inability stems from the 
3' G-rich overhang that is a product of ribonuclease cleavage of the RNA primer used in 
DNA replication. The overhang prevents DNA polymerase replication since the recessed 
C-rich parent strand cannot be used as a template. Telomerase overcomes this limitation 

20 by extending the 3' end of the chromosome using deoxyribonucleotides as substrates and a 
sequence within the telomerase RNA subunit as a template. (Lingner, J., 1995, Science, 
269, 1533-1534). As such, telomerase is considered a reverse transcriptase that is 
responsible for telomere maintenance. 

Telomerase was first discovered by in Tetrahymena thermophila in 1985 (Greider, 

25 C. W., 1995, Cell, 43, 405-413). The RNA subunits and their respective genes were later 
discovered and characterized in protozoa, budding yeast, and mammals. Genetic studies 
of these genes confirmed the role of telomerase RNA (TR) in determining telomere 
sequence by mutating genes which encode the telomeric RNA (Yu, G. L., 1990, Nature, 
344, 126-132), (Singer, M. S., 1994, Science, 266, 404-409), (Blasco, M. A., 1995, 

30 Science, 269, 1267-1270). These studies showed that telomerase activity parallels TR 
expression in protozoa, yeast and mice. However, the expression of human telomerase 
RNA (hTR) does not correlate well with telomerase activity in mammalian cells. Many 
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human tissues express hTR but are devoid of telomerase activity (Feng, J., 1995, Science, 
269, 1236-1241). Knockout mice, in which the mTR gene has been deleted from germline 
cells, have been shown to be viable for at least six generations. Cells from later 
generations of these mice showed chromosomal abnormalities consistent with telomere 
5 degradation, indicating that mTR is necessary for telomere length maintenance, but is not 
required for embryonic development, oncogenic transformation, or tumor formation in 
mice (Blasco, M. A., 1997, Cell, 91, 25-34). 

The first catalytically active subunit of telomerase (pi 23) was isolated from Euplotes 
aediculatus along with another subunit (p43) and a 66-kD RNA subunit (Linger, J., 1996, 

1 0 Proc. Natl. Acad. Sci., 93, 10712-10717). Subsequent studies revealed telomerase 
catalytic subunit homo logs from fission yeast (Est2p) and human genes (TRT1). The 
human homolog, TRT1 encoding hTERT, expressed mRNA with a strong correlation to 
telomerase activity in human cells (Nakamura, T. M., 1997, Science, 277, 955-959). 
Reconstitution of telomerase activity with in vitro transcribed and translated hTERT and 

1 5 hTR, either co-synthesized or simply mixed, demonstrated that hTERT and hTR represent 
the minimal components of telomerase. Furthermore, transient expression of hTERT in 
normal diploid human cells restored telomerase activity, demonstrating that hTERT is the 
only component necessary to restore telomerase activity in normal human cells (Weinrich, 
S. L., 1997, Nature Genetics, 17, 498-502). The introduction of telomerase into normal 

20 human cells using hTERT expression via transfection has resulted in the extension of life 
span in these cells. Such findings indicate that telomere loss in the absence of telomerase 
is the "mitotic clock" that controls the replicative potential of a cell prior to senescence 
(Bodnar, A. G., 1998, Science, 279, 349-352). 

Expression of telomerase is observed in germ cell and most cancer cell lines. These 

25 "immortal" cell lines continue to divide without shortening of their telomeres (Kim, N. 
W., 1994, Science, 266, 201 1-2015). A model of tumor progression has evolved from 
these findings, suggesting a role for telomerase expression in malignant transformation. 
Successful malignant transformation in human cells was accomplished for the first time by 
ectopic expression of hTERT in combination with two oncogenes, SV40 large-T and H- 

30 ras. Injection of nude mice with cells expressing these oncogenes and hTERT resulted in 
rapid growth of tumors. These observations indicate that hTERT mediated telomere 



WO 01/16312 PCT/US00/23998 
47 

maintenance is essential for the formation of human tumor cells (Hahn, W. C, 1999, 
Nature, 400, 464-468). 

Various methods have been developed to assay telomerase activity in vitro. The 
most widely used method to characterize telomerase activity is the telomeric repeat 
5 amplification protocol (TRAP). TRAP utilizes RT-PCR of cellular extracts to measure 
telomerase activity by making the amount of PCR target dependant upon the biochemical 
activity of the enzyme (Kim, N. W., 1997, Nucleic Acids Research, 25, 2595-2597, which 
is incorporated by reference herein). 

A method based on Kim is as follows. Briefly, for the telomerase assay, 2ug of 

1 0 protein extract is used. The extract is assayed in 50nl of reaction mixture containing 0. 1 
ug TS substrate primer (5 -AATCCGTCGAGCAGAGTT-3', end-labeled using alpha- 32 P- 
ATP and T4 polynucleotide kinase), O.lug ACX return primer(5'-GCGCGG[CTTACC] 3 
CTAACC-3'), 0.1 ug NT internal control primer (5'-ATCGCTTCTCGGCCTTTT-3'), 0.01 
micromol TSNT internal control template (5'- 

1 5 AATCCGTCGAGCAGAGTTAAAAGGCCGAGAACGAT-3), 50 uM each 

deoxynucleoside triphosphate, 2 U of Taq DNA polymerase, and 2 ul CHAPS protein 
extract, all in IX TRAP buffer (20 mM Tris (pH 8.3), 68 mM KC1, 1.5 mM MgCl 2 , 1 mM 
EGTA, 0.05% Tween 20). Each reaction is placed in a thermocycler block preheated to 30 
C and incubated at 30 C for 10 minutes, then cycled for 27 cycles of 94 degrees C for 30 

20 seconds, 60 degrees C for 30 seconds. Reaction products are separated on a denaturing 8% 
polyacrylamide gel, followed by drying of the gel and autoradiography. The internal 
control (to control for possible Taq polymerase inhibition) generates a band of 36 nt. 
Comparison of radioactive signal integrated (e.g., by phorphorimager analysis) for 
telomerase-extended bands with the radioactive signal from a reaction performed with a 

25 known amount of quantification standard template (termed R8; 5'- 

AATCCGTCGAGCAGAGTTAG [GGTTAG] 7 -3) allows expression of telomerase 
activity as an absolute value. The absolute value = TPG (total product generated) =[(TP- 
TPi)/TI]/[(R8-B)/RI)] x 100, where TP = telomerase products from test extract, TPi = 
telomerase products from a heat-inactivated (75 C, 10 minutes) extract reaction, TI = the 

30 signal from the internal control, R8 = the signal from the R8 qualification standard 

template reaction, B = signal from a lysis buffer-only blank reaction, and RI = the internal 
control value for the reaction containing R8 template and NT and TSNT control primers. 
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TPG values of 0-10,000 are possible, with the linear range being from approximately 1 to 
1000 TPG. The range of 1 to 1000 TPG encompasses the minimum and maximum levels 
of telomerase activity in most tumor samples tested, while non-tumor cells most often 
have no telomerase activity (TPG approximately zero). 
5 Telomerase activity may also be assayed as follows. Samples to be assayed for 

telomerase activity are prepared by extraction into CHAPS lysis buffer (lOmM Tris pH 
7.5, ImM MgCl 2 , ImM EGTA, 0.1 mM PMSF, 5mM -mercaptoethanol, ImM DTT, 
0.5% 3-[(3-cholamidopropyl)-dimethyl-amino]-l- propanesulfonate (CHAPS), 10% 
glycerol and 40 U/ml RNAse inhibitor (Promega, Madison, WI, U.S.A.). Cells are 

1 0 suspended in CHAPS lysis buffer and incubated on ice for 30 minutes, which allows lysis 
of 90-100% of cells. Lysate is then transferred to polyallomer centrifuge tubes and spun at 
100,000 x g for 1 hour at 4 degrees C. The supernatant is the protein extract, and 
concentration ranges of 4-10 ug/ul are suitable for telomerase assay. Extracts may be 
concentrated if necessary using a Microcon Micro filter 30 (Amicron, Beverly, MA U.S.A.) 

1 5 according to the manufactureris instructions. Extracts may be stored frozen at -80 degrees 
C until assayed. 

A variety of animal models have been designed to assay telomerase activity in vivo. 
Inhibition of telomerase activity has been analyzed in rats via cell proliferation studies 
with MNU (N-methyl-N-nitosurea) induced mammary carcinomas in response to treatment 

20 with 4-(hydroxyphenyl)retinamide (4-HPR), a known inhibitor of mammary 

carcinogenesis in animal models and premenopausal women (Bednarek, A., 1999, 
Carcinogenesis, 20, 879-883). Additional studies have focused on the up-regulation of 
telomerase in transformed cell lines from animal and human model systems (Zhang, P. B., 
1998, Leuk. Res., 22, 509-516), (Chadeneau, C, 1995, Oncogene, 11, 893-898), 

25 (Greenberg, R., 1999, Oncogene, 18, 1219-1226). 

Human cell culture studies have been established to assay inhibition of telomerase 
activity in human carcinomas responding to various therapeutics. A human breast cancer 
model for studying telomerase inhibitors is described (Raymond, E., 1999, Br. J. Cancer, 
80, 1332-1341). Human studies of telomerase expression as related to various other 

30 cancers are described including cervical cancer (Nakano, K., 1998, Am. J. Pathol, 153, 
857-864), endometrial cancer (Kyo, S., 1999, Int. J. Cancer, 80, 60-63), meningeal 
carcinoma (Kleinschmidt-DeMasters, B. K., 1998, J. Neurol. Sci., 161, 124-134), lung 
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carcinoma (Yashima, K., 1997, Cancer Reseach, 57, 2372-2377), testicular cancer in 
response to cisplatin (Burger, A. M., 1997, Eur. J. Cancer, 33, 638-644), and ovarian 
carcinoma (Counter, C. M., 1994, Proc. Natl. Acad. Sci, 91, 2900-2904). 

Particular degenerative and disease states that can be associated with telomerase 
5 expression modulation include but are not limited to: 

Cancer : Almost all human tumors have detectable telomerase activity (Shay, J. W., 1997, 
Eur. J. Cancer, 33, 787-791). Treatment with telomerase inhibitors may provide effective 
cancer therapy with minimal side effects in normal somatic cells that lack telomerase 
activity. The therapeutic potential exists for the treatment of a wide variety of cancer 
10 types. 

Restinosis : Telomerase inhibition in vascular smooth muscle cells may inhibit 
restinosis by limiting proliferation of these cells. 

Infectious disease : Telomerase inhibition in infectious cell types that express 
telomerase activity may provide selective anti-infectious agent activity. Such treatment 
1 5 may prove especially effective in protozoan-based infection such as Giardia and Lesh 
Meniesis. 

Transplant rejection : Telomerase inhibition in endothelial cell types may 
demonstrate selective immunnosuppressant activity. Activation of telomerase in 
transplant cells could benefit grafting success through increased proliferative potential. 

20 Autoimmune disease : Telomerase modulation in various immune cells may prove 

beneficial in treating diseases such as multiple sclerosis, lupus, and AIDS. 

Age related disease : Activation of telomerase expression in cells at or nearing 
senescence as a result of advanced age or premature aging could benefit conditions such as 
macular degeneration, skin ulceration, and rheumatoid arthritis. 

25 The present body of knowledge in telomerase research indicates the need for 

methods to assay telomerase activity and for compounds that can regulate telomerase 
expression for research, diagnostic, trait alteration, animal health and therapeutic use. 

Gemcytabine and cyclophosphamide are non-limiting examples of chemotherapeutic 
agents that can be combined with or used in conjunction with the nucleic acid molecules 

30 (e.g. ribozymes and antisense molecules) of the instant invention. Those skilled in the art 
will recognize that other drugs such as anti-cancer compounds and therapies can be 
similarly be readily combined with the nucleic acid molecules of the instant invention 
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(e.g. ribozymes and antisense molecules) and are hence within the scope of the instant 
invention. Such compounds and therapies are well known in the art (see for example 
Cancer: Principles and Pranctice of Oncology, Volumes 1 and 2, eds Devita, V.T., 
Hellman, S., and Rosenberg, S.A., J.B. Lippincott Company, Philadelphia, USA; 
5 incorporated herein by reference) and include, without limitations, antifolates; 
fluoropyrimidines; cytarabine; purine analogs; adenosine analogs; amsacrine; 
topoisomerase I inhibitors; anthrapyrazoles; retinoids; antibiotics such as bleomycin, 
anthacyclins, mitomycin C, dactinomycin, and mithramycin; hexarnethylmelamine; 
dacarbazine; 1-asperginase; platinum analogs; alkylating agents such as nitrogen mustard, 

1 0 melphalan, chlorambucil, busulfan, ifosfamide, 4-hydroperoxycyclophosphamide, 
nitrosoureas, thiotepa; plant derived compounds such as vinca alkaloids, 
epipodophyllotoxins, taxol; Tomaxifen; radiation therapy; surgery; nutritional 
supplements; gene therapy; radiotherapy such as 3D-CRT; immunotoxin therapy such as 
ricin, monoclonal antibodies herceptin; and the like. For combination therapy, the nucleic 

1 5 acids of the invention are prepared in one of two ways. First, the agents are physically 
combined in a preparation of nucleic acid and chemotherapeutic agent, such as a mixture 
of a nucleic acid of the invention encapsulated in liposomes and ifosfamide in a solution 
for intravenous administration, wherein both agents are present in a therapeutically 
effective concentration (e.g., ifosfamide in solution to deliver 1000-1250 mg/m 2 /day and 

20 liposome-associated nucleic acid of the invention in the same solution to deliver 0. 1 - 1 00 
mg/kg/day). Alternatively, the agents are administered separately but simultaneously in 
their respective effective doses (e.g., 1000-1250 mg/m 2 /d ifosfamide and 0.1 to 100 
mg/kg/day nucleic acid of the invention). 

Gaeta et al, US patents No. 5,760,062; 5,767,278; 5,770,613 have described small 

25 molecule inhibitors of human telomerase RNA (hTR) subunit. 

Blasco et al, 1995, Science, 269, 1267-1270 describe the synthesis and testing of 
antisense oligonucleotides targeted against a specific region of the mouse telomerase RNA 
(mTR) subunit and reported reduction in telomerase activity in mice. 

Bisoffi et al, 1998, Eur. J. Cancer, 34, 1242-1249 have studied the down 

30 regulation of human telomerase activity by a retrovirus vector expressing antisense RNA 
targeted against the hTR RNA. 
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Norton et al, 1996, Nature Biotechnology, 14, 615-619 have reported the use of a 
peptide nucleic acid (PNA) molecule targeting hTR RNA to down regulate telomerase 
activity in human immortal breast epithelial cells. 

Yokoyama et al, 1998, Cancer Research, 58, 5406-5410 have reported the 
5 synthesis and testing of hammerhead ribozyme constructs targeting hTR RNA resulting in 
a decrease in the telomerase activity in Ishikawa cells. 

Henderson, European Patent Application No. 666,3 13-A2 describes methods of 
identifying and cloning hTR gene for use in gene therapy approaches for creating aberrant 
telomeric sequences in transfected human tumor cells. A ribozyme based gene therapy 
1 0 approach to inhibit the expression of hTR gene is described as well. The intended result 
of such therapies involves incurred genetic instability based on non-native telomeric 
sequences resulting in rapid cell death of the treated cells. 

West et al, US patent No. 5,489,508 describe methods for determining telomere 
length and telomerase activity in cells. Inhibitors of hTR RNA, including oligonucleotides 
1 5 and/or small molecules are described. 

These foregoing approaches of targeting the telomerase RNA subunit (TR) may not 
be very beneficial, because as demonstrated by Feng et al, (Feng, J., 1995, Science, 269, 
1236-1241), telomerase activity in humans does not correlate well to hTR concentration. 
Collins et al, International PCT publication No. WO 98/01542 describes assays for 
20 the detection of telomerase activity. Four human telomerase subunit proteins are described 
called pi 40, pi 05, p48 and p43. In addition, hybridization probes and primers are 
described as inhibitors of telomerase gene function. Antibody based inhibitors of 
telomerase protein subunits are described. 

A more attractive approach to telomerase regulation would involve the regulation of 
25 human telomerase by modulating the expression of the protein subunits of the enzyme, 
preferably the reverse transcriptase (hTERT) subunit. Based of reconstitution 
experiments, hTERT and hTR represent the minimal components of telomerase. Since 
hTR expression does not correlate well with telomerase activity in human cells and since 
many human cells express hTR without telomerase activity, targeting hTERT may prove 
30 more beneficial than targeting hTR. hTERT is the only component necessary to restore 
telomerase activity in normal human cells. A study in which the three major subunits of 
telomerase (hTR, TP 1, and hTERT were assayed in normal and malignant endometrial 
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tissues determined that hTERT is a rate limiting determinant of enzymatic activity of 
human telomerase (Kyo, S., 1999, Int. J. Cancer, 80, 60-63). Additional protein subunits 
that have been isolated most likely serve only a structural role in telomerase activity, but 
may be important in enhancing the activity of the telomerase enzyme. As such, hTERT is 
5 one of the better targets for the ectopic regulation of telomerase activity. 

Cech et al, International PCT publication No. WO 98/14593 describe 
compositions and methods related to hTERT for diagnosis, prognosis and treatment of 
human diseases, for altering proliferative capacity in cells and organisms, and for 
screening compounds and treatments with potential use as human therapeutics. 

1 0 Cech et al, International PCT publication No. WO 98/14592 describe nucleic acid 

and amino acid sequences encoding various telomerase protein subunits and motifs of 
Euplotes aediculatus, and related sequences from Schizosaccharomyces, Saccharomyces 
sequences, and human telomerase. The polypeptides comprising telomeric subunits and 
functional polypeptides and ribonucleoproteins that contain these subunits are described as 

1 5 well. Cech et al, International PCT Publication No. WO 98/14592, mentions in general 
terms the the possibility of using antisense and ribozymes to down regulate the expression 
of human telomerase reverse transcriptase enzyme. 

Identification of Potential Target Sites in Human TERT RNA 
20 The sequence of human TERT was screened for accessible sites using a computer 

folding algorithm. Regions of the RNA that did not form secondary folding structures and 
contained potential ribozyme and/or antisense binding/cleavage sites were identified. The 
sequences of these cleavage sites are shown in Tables 13-17. 

25 Selection of Enzymatic Nucleic Acid Cleavage Sites in Human TERT RNA 

To test whether the sites predicted by the computer-based RNA folding algorithm 
corresponded to accessible sites in TERT RNA, 10 hammerhead ribozyme and three G- 
Cleaver ribozyme sites were selected for further analysis (Table 17). Ribozyme target 
sites were chosen by analyzing sequences of Human TERT (Nakamura et al, 1997 

30 Science 277, 955-959; Genbank sequence accession number: NM_003219) and 

prioritizing the sites on the basis of folding. Ribozymes were designed that could bind 
each target and were individually analyzed by computer folding (Christoffersen et al, 
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19947. Mol Struc. Theochem, 311, 273; Jaeger et al, 1989, Proc. Natl. Acad. Sci. USA, 
86, 7706) to assess whether the ribozyme sequences fold into the appropriate secondary 
structure. Those ribozymes with unfavorable intramolecular interactions between the 
binding arms and the catalytic core were eliminated from consideration. As noted below, 
5 varying binding arm lengths can be chosen to optimize activity. Generally, at least 5 bases 
on each arm are able to bind to, or otherwise interact with, the target RNA. 

Chemical Synthesis and Purification of Ribozymes for Efficient Cleavage of TERT RNA 
Ribozymes were designed to anneal to various sites in the RNA message. The 

1 0 binding arms are complementary to the target site sequences described above. The 
ribozymes were chemically synthesized. The method of synthesis used followed the 
procedure for normal RNA synthesis as described above and in Usman et al., (1 987 J. 
Am. Chem. Soo, 109, 7845), Scaringe et al., (1990 Nucleic Acids Res., 18, 5433) and 
Wincott et al., supra, and made use of common nucleic acid protecting and coupling 

1 5 groups, such as dimethoxytrityl at the 5'-end, and phosphoramidites at the 3'-end. The 
average stepwise coupling yields were >98%. 

Ribozymes were also synthesized from DNA templates using bacteriophage T7 
RNA polymerase (Milligan and Uhlenbeck, 1989, Methods Enzymol. 180, 51). Ribozymes 
were purified by gel electrophoresis using general methods or were purified by high 

20 pressure liquid chromatography (HPLC; See Wincott et al., supra; the totality of which is 
hereby incorporated herein by reference) and were resuspended in water. The sequences of 
the chemically synthesized ribozymes used in this study are shown below in Table 13-17. 

Ribozvme Cleavage of TERT RNA Target in vitro 
25 Ribozymes targeted to the human TERT RNA are designed and synthesized as 

described above. These ribozymes can be tested for cleavage activity in vitro, for example 

using the following procedure. The target sequences and the nucleotide location within the 

TERT RNA are given in Tables 13-17. 

Cleavage Reactions: Full-length or partially full-length, internally-labeled target 
30 RNA for ribozyme cleavage assay is prepared by in vitro transcription in the presence of 

[a- 32 p] CTP, passed over a G 50 Sephadex column by spin chromatography and used as 
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substrate RNA without further purification. Alternately, substrates are 5'- 32 P-end labeled 
using T4 polynucleotide kinase enzyme. Assays are performed by pre-warming 15 |xl of a 
2X concentration of purified ribozyme in ribozyme cleavage buffer (50 mM Tris-HCl, pH 
7.5 at 37°C, 10 mM MgCl2) and the cleavage reaction was initiated by adding the 2X 
5 ribozyme mix to an equal volume (15 nl) of substrate RNA (maximum of 1-5 nM; 5 x 10 5 
to 1 x 10 7 cpm) that was also pre-warmed in cleavage buffer. As an initial screen, assays 
are carried out for 1 hour at 37°C using a final concentration of either 40 nM or 1 mM 
ribozyme, i.e., ribozyme excess. The reaction is quenched by the addition of an equal 
volume (30 ul) of 95% formamide, 20 mM EDTA, 0.05% bromophenol blue and 0.05% 
1 0 xylene cyanol after which the sample is heated to 95 C for 2 minutes, quick chilled and 
loaded onto a denaturing polyacrylamide gel. Substrate RNA and the specific RNA 
cleavage products generated by ribozyme cleavage are visualized on an autoradiograph of 
the gel. The percentage of cleavage is determined by Phosphor Imager® quantitation of 
bands representing the intact substrate and the cleavage products. 

15 

Example 2: PTP-1B 

Protein tyrosine phosphorylation and dephosphorylation are important mechanisms 
in the regulation of signal transduction pathways that control the processes of cell growth, 
proliferation, and differentiation (Fantl, W. J., 1993, Annu. Rev. Biochem., 62, 453-481). 

20 Cooperative enzyme classes regulate protein tyrosine phosphorylation and 

dephosphorylation events. These broad classes of enzymes consist of the protein tyrosine 
kinases (PTKs) and protein tyrosine phosphatases (PTPs). PTKs and PTPs can exist as 
both receptor-type transmembrane proteins and as cytoplasmic protein enzymes. Receptor 
tyrosine kinases propagate signal transduction events via extracellular receptor-ligand 

25 interactions that result in the activation of the tyrosine kinase portion of the PTK in the 
cytoplasmic domain. Receptor-like transmembrane PTPs function through extracellular 
ligand binding that modulates dephosphorylation of intracellular phosphotyrosine proteins 
via cytoplasmic phosphatase domains. Cytoplasmic PTKs and PTPs exert enzymatic 
activity without receptor-mediated ligand interactions, however, phosphorylation can 

30 regulate the activity of these enzymes. 
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Protein tyrosine phosphatase IB, a cytoplasmic PTP, was the first PTP to be isolated 
in homogeneous form (Tonks, N. K., 1988, J. Biol. Chem., 263, 6722-6730), characterized 
(Tonks, N. K., 1988, J. Biol. Chem., 263, 6731-6737), and sequenced (Charbonneau, H., 
1989, Biochemistry, 86, 5252-5256). Cytoplasmic and receptor-like PTPs both share a 
5 catalytic domain characterized by eleven conserved amino acids containing cysteine and 
arginine residues that are critical for phosphatase activity (Streuli, M., 1990, EMBO, 9, 
2399-2407). A cysteine residue at position 215 is responsible for the covalent attachment 
of phosphate to the enzyme (Guan, K., 1991, J. Biol. Chem., 266, 17026-17030). The 
crystal structure of human PTP1B defined the phosphate binding site of the enzyme as a 

1 0 glycine rich cleft at the surface of the molecule with cysteine 2 1 5 positioned at the base of 
this cleft. The location of cysteine 215 and the shape of the cleft provide specificity of 
PTPase activity for tyrosine residues but not for serine or threonine residues (Barford, D., 
1994, Science, 263, 1397-1404). 

Receptor tyrosine kinase and protein tyrosine phosphatase localization plays a key 

1 5 role in the regulation of phosphotyrosine mediated signal transduction. PTP- IB activity 
and specificity against a panel of receptor tyrosine kinases demonstrated clear differences 
between substrates, suggesting that cellular compartmentalization is a determinant in 
defining the activity and function of the enzyme (Lammers, R.,1993, J. Biol. Chem., 268, 
22456-22462). Experiments have indicated that PTP-1B is localized predominantly in the 

20 endoplasmic reticulum via its 35 amino acid carboxyterminal sequence. PTP- IB is also 
tightly associated with microsomal membranes with its catalytic phosphatase domain 
oriented towards the cytoplasm (Frangioni, J. V., 1992, Cell, 68, 545-560). 

PTP-1B has been identified as a negative regulator of the insulin response. PTP-1B 
is widely expressed in insulin sensitive tissues (Goldstein, B. J., 1993, Receptor, 3, 1-15). 

25 Isolated PTP-1B dephosphorylates the insulin receptor in vitro (Tonks, N. K., 1988, J. 
Biol. Chem., 263, 673 1-6737). PTP- IB dephosphorylation of multiple phosphotyrosine 
residues of the insulin receptor proceeds sequentially and with specificity for the three 
tyrosine residues that are critical for receptor autoactivation (Ramachandran, C, 1992, 
' Biochemistry, 31, 4232-4238). In addition to insulin receptor dephosphorylation, PTP-1B 

30 also dephosphorylates the insulin related subtrate 1 (IRS-1), a principal substrate of the 
insulin receptor (Lammers, R., 1993, J. Biol. Chem., 268, 22456-22462). 
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Microinjection of PTP1B into Xenopus oocytes results in the inhibition of insulin 
stimulated tyrosine phosphorylation of endogenous proteins, including the P-subunit of the 
insulin and insulin-like growth factor receptor proteins. The resulting 3 to 5 fold increase 
over endogenous PTPase activity also blocks the activation of an S6 peptide kinase 
5 (Cicirelli, M. F., 1990, Proc, Natl. Acad. Sci., 87, 5514-5518). Inactivation of 
recombinant rat PTP-1B with antibody immunoprecipitation results in the dramatic 
increase in insulin stimulated DNA synthesis and phosphatidylinositol 3 '-kinase activity. 
Insulin stimulated receptor autophosphorylation and insulin receptor substrate 1 tyrosine 
phosphorylation are increased dramatically as well through PTP-1B inhibition (Ahmad, F., 

1 0 1995, J. Biol. Chem., 270, 20503-20508). 

Increased PTP-1B expression correlates with insulin resistance in hyperglycemic 
cultured fibroblasts. In this study, desensitized insulin receptor function was observed via 
impaired insulin-induced autophosphorylation of the receptor. Treatment with insulin 
sensitivity normalizing thiazolidine derivatives resulted in the amelioration of the 

1 5 hyperglycemic insulin resistance via a normalization in PTP-1B expression (Maegawa, H., 
1995, J. Biol. Chem., 270, 7724-7730). A murine model of insulin resistance with a 
knockout of the hetrerotrimeric GTP-binding protein subunit Gia2 provides a type 2 
diabetis phenotype that correlates with the increased expression of PTP-1B (Moxam, C. 
M., 1996, Nature, 379, 840-844). 

20 PTP-1B interacts directly with the activated insulin receptor p-subunit. An inactive 

homolog of PTP-1B was used to precipitate the activated insulin receptor in both purified 
receptor preparations and whole-cell lysates. Phosphorylation of the insulin receptor's 
triple tyrosine residues in the kinase domain is necessary for PTP-1B interaction. 
Furthermore, insulin stimulates tyrosine phosphorylation of PTP-1B (Seely, B. L., 1996, 

25 Diabetes, 45, 1 379- 1 3 85). A similar study confirmed the direct interaction of PTP- 1 B 
with the insulin receptor p-subunit as well as the required multiple phosphorylation sites 
within the receptor and PTP-1B (Bandyopadhyay, D., J. Biol. Chem., 272, 1639-1645). 

Knockout mice lacking the PTP- IB gene (both homozygous PTP-1B" / - and 
heterozygous PTP-1B +/ ") have been used to study the specific role of PTP-1B relating to 

30 insulin action in vivo. The resulting PTP-1B deficient mice were healthy and, in the fed 
state, had lower blood glucose and circulating insulin levels that were half that of their 
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PTP-1B +/+ expressing littermates. These PTP-1B deficient mice demonstrated enhanced 
insulin sensitivity in glucose and insulin tolerance tests. At the physiological level, the 
PTP-1B deficient mice showed increased phosphorylation of the insulin receptor after 
insulin administration. When fed a high fat diet, the PTP-1B deficient mice were resistant 
5 to weight gain and remained insulin sensitive as opposed to normal PTP-1B expressing 
mice, who rapidly gained weight and become insulin resistant (Elchebly, M., 1999, 
Science, 283, 1544-1548). As such, modulation of PTP-1B expression could be used to 
regulate autophosphorylation of the insulin receptor and increase insulin sensitivity in vivo. 
This modulation could prove beneficial in the treatment of insulin related disease states. 
10 In light of the above findings, particular disease states that involve PTP-1B 

expression include but are not limited to: 

Diabetes : Both type 1 and type 2 diabetes may be treated by modulation of PTP-1B 
expression. Type 2 diabetes correlates to desensitized insulin receptor function (White et 
al, 1994). Disruption of the PTP-1B dephosphorylation of the insulin receptor in vivo 
1 5 manifests in insulin sensitivity and increased insulin receptor autophosphorylation 
(Elchebly et al, 1999). Insulin dependant diabetes, type 1, may respond to PTP-1B 
modulation through increased insulin sensitivity. 

Obesity : Elchebly et al, 1999, demonstrated that PTP-1B deficient mice were resistant to 
weight gain when fed a high fat diet compared to normal PTP-1B expressing mice. This 

20 finding suggests that PTP-1B modulation may be beneficial in the treatment of obesity. 
Ahmad et al, 1997, Metab. Clin. Exp., 46, 1 140-1 145, describe reduced PTPs in adipose 
tissue and improved insulin sensitivity in obese subjects following weight loss. 

Troglitazone is a non-limiting example of a pharmaceutical agent that can be 
combined with or used in conjunction with the nucleic acid molecules (e.g. ribozymes and 

25 antisense molecules) of the instant invention. Those skilled in the art will recognize that 
other drugs such as anti-diabetes and anti-obesity compounds and therapies can be 
similarly be readily combined with the nucleic acid molecules of the instant invention 
(e.g. ribozymes and antisense molecules) are hence within the scope of the instant 
invention. 

30 Methods have been developed to assay PTP- 1 B activity. 

Maegawa et al, 1995, J. Biol. Chem., 270, 7724-7730, describe a tissue culture 
model in which Rat 1 fibroblasts expressing human insulin receptors can be used to model 
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hyperglycemia induced insulin resistance. Maegawa et al. also describe assays to measure 
PTPase activity using labeled phosphorylated insulin receptors and by immunoenzymatic 
techniques. 

Moxham et al, 1996, Nature, 379, 840-844, describe a murine animal and tissue 
5 culture model employing Gia2 deficiency to study hyperinsulinaemia, impaired glucose 
tolerance and resistance to insulin in vivo. Assays for PTPase activity and tyrosine 
phosphorylation of insulin-receptor substrate 1 are described. 

Khandelwal et al, 1995, Molecular and Cellular Biochemistry, 153, 87-94, describe 
four different animal models for studying insulin dependent and insulin resistant diabetes 
1 0 mellitus. These models were used to study the effect of vanadate, an insulin mimetic and 
PTPase inhibitor, on the insulin-stimulated phosphorylation of the insulin receptor and its 
tyrosine kinase acitivity. 

Wang et al, 1999, Biochim. Biophys. Acta, 1431, 14-23, describe fluorescein 
monophosphates as fluorogenic substrates for PTPs. 
1 5 Various methods and compounds have been developed to inhibit protein tyrosine 

phosphatase activity. 

Wrobel et al, 1999, J. Med. Chem., 42, 3199-3202, describe PTP-1B inhibition and 
antihyperglycemic activity in the ob/ob mouse model by 1 l-arylbenzo[b]naphtho[2,3- 
d]furans and arylbenzo[b]naphtho[2,3-d]thiophenes. 
20 Andersen et al, International PCT publication No. WO 98/DK407 describe the 

preparation of thienopyridzinones and thienochromenones as modulators of PTPases. 

Taing et al, 1999, Biochemistry, 38, 3793-3803, describe potent and highly selective 
inhibitors of PTP-1B comprising an array of bis(aryldifluorophosphonates). 

Ham etal, 1999, Bioorg. Med. Chem. Lett., 9, 185-186, describe selective 
25 inactivation of PTP- 1 B by a sulfone analog of naphthoquinone. 

Desmarais et al, 1999, Biochem, J., 337, 219-223, describe 
[Difluro(phosphono)methyl]phenylalanine-containing peptide inhibitors of PTPs. 

Taylor et al, 1998, Bioorg. Med. Chem., 6, 2235, describe potent non-peptidyl 
inhibitors of PTP-1B. 

30 Kotoris et al, 1998, Bioorg. Med. Chem. Lett., 8, 3275-3280, describe novel 

phosphate mimetics for the design of non-peptidyl inhibitors of PTPs. 
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Groves et al, 1998, Biochemistry, 37, 17773-17783, describe the structural basis for 
PTP-1B inhibition by the phosphotyrosine peptide mimetics 

(difluoronaphthylmethyl)phosphonic acid and the fluoromalonyl tyrosines with complexed 
crystal structures. 

5 Yao et al, 1998, Bioorgl Med. Chem., 6, 1 799-1 810, describe the structure-based 

design and synthesis of small molecule PTP-1B inhibitors comprising novel 
naphthyldifluoromethyl phosphonic acids 1 and 2. 

Taylor et al, 1998, Bioorg. Med. Chem., 6, 1457-1468, describe potent non-peptidyl 
inhibitors of PTP-1B. 

1 0 Desmarais et al, 1998, Arch. Biochem. Biophys., 354, 225-231, describe inhibition 

of PTP-1B and CD45 by sulfotyrosyl peptides. 

Mjalli et al, application US 96-7661 14, cont. in part of US patent No. 543,630, 
describe the preparation of heterocyclic compounds as modulators of proteins with 
phosphotyrosine recognition units. 
1 5 Wang et al, 1998, Bioorg. Med. Chem. Lett., 8, 345-350, describe 

naphthalenebis[a,a-difluoromethylenephosphonates] as potent inhibitors of PTPs. 

Rice et al, 1997, Biochemistry, 36, 15965-15974, describe a targeted library of 
small molecule tyrosine and dual-specificity phosphatase inhibitors with random side 
chain variation from a rational core design. 
20 Olefsky, International PCT publication No. WO 97/US2752 describes a method and 

phosphopeptides used for the treatment of insulin resistance based on the association of 
PTP-1B with the activated insulin receptor. Also included is a method for determining 
whether a compound inhibits PTP-1B binding to the insulin receptor. 

Huyer et al, 1997, J. Biol. Chem., 272, 843-851, describe the mechanism of 
25 inhibition of PTPases by vanadate and pervanadate. 

Burke et al, 1996, Biochemistry, 35, 15989-15996, describe the structure-based 
design of PTP-1B inhibitors. 

Tonks etal, International PCT publication No. WO 97/US13016, describe 
substrate-trapping protein PTPase mutants for identification of tyrosine-phosphorylated 
30 protein substrates and their clinical uses. 
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The human genome is thought to contain up to 100 PTPases, each varying slightly in 
chemistry but vastly in function. Compounds designed to inhibit PTP-1B activity 
specifically by covalent binding to or modification of PTP-1B have the potential for 
multiple side effects. Conventional drug substances that will potently suppress PTP-1B 
5 activity with few or no side effects from interaction with other PTPs are difficult to 

envision. A more attractive approach to PTP-1B modulation would involve the specific 
regulation of PTP-1B expression with oligonucleotides. 

Identification of Potential Target Sites in Human PTP-1B RNA 
1 0 The sequence of human PTP- 1 B was screened for accessible sites using a computer 

folding algorithm. Regions of the RNA that did not form secondary folding structures and 
contained potential ribozyme and/or antisense binding/cleavage sites were identified. The 
sequences of these cleavage sites are shown in Tables 3-8. 

15 Selection of Enzymatic Nucleic Acid Cleavage Sites in Human PTP- IB RNA 

To test whether the sites predicted by the computer-based RNA folding algorithm 
corresponded to accessible sites in PTP- IB RNA, 10 hammerhead ribozyme, five NCH 
and three G-Cleaver ribozyme sites were selected for further analysis (Table 8). 
Ribozyme target sites were chosen by analyzing sequences of Human PTP-1B (Genbank 

20 accession number M33689) and prioritizing the sites on the basis of folding. Ribozymes 
were designed that could bind each target and were individually analyzed by computer 
folding (Christoffersen etal, \99AJ.Mol. Struc. Theochem, 311, 273; Jaeger et al, 1989, 
Proc. Natl. Acad. Sci. USA, 86, 7706) to assess whether the ribozyme sequences fold into 
the appropriate secondary structure. Those ribozymes with unfavorable intramolecular 

25 interactions between the binding arms and the catalytic core were eliminated from 

consideration. As noted below, varying binding arm lengths can be chosen to optimize 
activity. Generally, at least 5 bases on each arm are able to bind to, or otherwise interact 
with, the target RNA. 
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Chemical Synthesis and Purification of Ribozvmes for Efficient Cleavage of PTP-1B RNA 

Ribozymes were designed to anneal to various sites in the RNA message. The 
binding arms are complementary to the target site sequences described above. The 
ribozymes were chemically synthesized. The method of synthesis used followed the 
5 procedure for normal RNA synthesis as described above and in Usman et al., (1987 J. 
Am. Chem. Soc, 109, 7845), Scaringe et al., (1990 Nucleic Acids Res., 18, 5433) and 
Wincott et al., supra, and made use of common nucleic acid protecting and coupling 
groups, such as dimethoxytrityl at the 5'-end, and phosphoramidites at the 3'-end. The 
average stepwise coupling yields were >98%. 

1 0 Ribozymes were also synthesized from DNA templates using bacteriophage T7 

RNA polymerase (Milligan and Uhlenbeck, 1989, Methods Enzymol. 180, 51). Ribozymes 
were purified by gel electrophoresis using general methods or were purified by high 
pressure liquid chromatography (HPLC; see Wincott et al., supra; the totality of which is 
hereby incorporated herein by reference) and were resuspended in water. The sequences of 

1 5 the chemically synthesized ribozymes used in this study are shown below in Tables 3-8. 



Ribozvme Cleavage of PTP-1B RNA Target in vitro 

Ribozymes targeted to the human PTP-1B RNA are designed and synthesized as 
described above. These ribozymes can be tested for cleavage activity in vitro, for 

20 example, using the following procedure. The target sequences and the nucleotide location 
within the PTP-1B RNA are given in Tables 3-8. 

Cleavage Reactions: Full-length or partially full-length, internally-labeled target 
RNA for ribozyme cleavage assay is prepared by in vitro transcription in the presence of 
[ct- 32 p] CTP, passed over a G 50 Sephadex column by spin chromatography and used as 

25 substrate RNA without further purification. Alternately, substrates are 5'- 32 P-end labeled 
using T4 polynucleotide kinase enzyme. Assays are performed by pre-warming a 2X 
concentration of purified ribozyme in ribozyme cleavage buffer (50 mM Tris-HCl, pH 7.5 
at 37°C, 10 mM MgCl2) and the cleavage reaction was initiated by adding the 2X 
ribozyme mix to an equal volume of substrate RNA (maximum of 1-5 nM) that was also 

30 pre-warmed in cleavage buffer. As an initial screen, assays are carried out for 1 hour at 
37°C using a final concentration of either 40 nM or 1 mM ribozyme, i.e., ribozyme excess. 
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The reaction is quenched by the addition of an equal volume of 95% formamide, 20 mM 
EDTA, 0.05% bromophenol blue and 0.05% xylene cyanol after which the sample is 
heated to 95°C for 2 minutes, quick chilled and loaded onto a denaturing polyacrylamide 
gel. Substrate RNA and the specific RNA cleavage products generated by ribozyme 
5 cleavage are visualized on an autoradiograph of the gel. The percentage of cleavage is 
determined by Phosphor Imager® quantitation of bands representing the intact substrate 
and the cleavage products. 



Example 3: MetAP-2 

1 0 Methionyl aminopeptidases are metalloproteases that are known to possess post- 

translational enzymatic activity by hydrolytically cleaving amino-terminal methionine 
residues from nascent peptide substrates in a non-processive manner (Kendall, R. L., 1992, 
J. Biol. Chem., 267, 20667-20673). This family of enzymes is divided into two classes 
(type 1 and type 2) based on differences in sequence, although the overall structure of the 

1 5 two classes are similar (Liu, S., 1998, Science, 282, 1324-1327). Methionine 

aminopeptidase expression appears to be involved in the control of cellular proliferation. 
Deletion of the MetAP gene from E. Coli is lethal (Chang, S. Y., 1989, J. Bacterid., 171, 
4071-4072). In Saccharomyces cerevisiae, deletion of the gene that codes for either 
MetAP- 1 or 2 results in a slow growth phenotype while deletion of both genes is lethal (Li, 

20 X., 1995, Proc. Natl. Acad. Sci., 92, 12357-12361). (Human methionine aminopeptidase- 
1, MetAP-1, accession No. P53582). 

The aminopeptidase function of this class of enzymes may serve a regulatory role in 
activating signal peptides in conjunction with N-myristoyl transferase (NMT) activity. 
NMT is expressed from a lethal gene in yeast (Duronio, R. J., 1989, Science, 243, 796- 

25 800). NMT is responsible for amino-terminal ligation of myristic acid onto nascent 

peptides and cannot act on peptides with an amino-terminal methionine residue (Resh, M. 
D., 1996, Cell. Signal., 8, 403-412). Myristoylation of proteins correlates to intracellular 
localization events that may determine why certain signaling proteins are dependent on 
NMT for activity (Taunton, J., 1997, Chemistry & Biology, 4, 493-496). Protein tyrosine 

30 kinase Src is dependant on myristoylation for activity and has been identified as an 
upstream regulator of human vascular endothelial growth factor (VEGF) expression 
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through hypoxic induction in solid tumors (Mukhopadhyay, D., 1995, Nature, 375, 577- 
581). MetAPs may therefore regulate the activation of signal peptides (such as VEGF) 
through cotranslational modification of nascent peptides with NMT. Disruption of protein 
myristoylation by MetAP inhibition could result in the improper localization of signaling 
5 proteins resulting in inhibition of cell growth. (Human N-myristoyltransferase, hNMT, 
accession No. AF043324.) 

Fumagillin, a sesquiterpene diepoxide metabolite of the fungus Aspergillus 
fumigatus, and a related compound TNP-470, are strong inhibitors of growth in cultured 
endothelial cells. The antiproliferative and angiostatic activity of fumagillin was originally 

1 0 discovered by the serendipitous contamination of Aspergillus fumigatus in an endothelial 
cell culture dish in which cells closest to the fungal colony displayed growth inhibition. 
Synthetic analogs of fumagillin were later synthesized resulting in the discovery of TNP- 
470, which is 50 times more potent of an inhibitor than fumagillin and is less toxic in mice 
(Ingber, D., 1990, Nature, 348, 555-557). Treatment of endothelial cells with these 

1 5 compounds results in late Gl phase arrest. TNP-470 inhibits the signaling pathway of 
retinoblastoma gene product phosphorylation, cyclin dependent kinases cdk2 and cdk4 
activation, and cyclins E and A expression (Abe, J., 1994, Cancer Res., 54, 3407-3412). 
TNP-470 has also been shown to potently inhibit endothelial cell proliferation induced by 
the growth factors VEGF and bFGF (Toi, M., 1994, Oncology Reports, 1, 423-426). 

20 The bifunctional protein MetAP-2 has been identified as the molecular target for 

fumagillin and related compounds that demonstrate antiproliferative activity in endothelial 
cells. The use of affinity chromatography with a fumagillin-biotin conjugate resulted in 
the isolation of a 67-kDa mammalian protein through covalent interaction with the bound 
substrate. Analysis of digested peptide fragments from the isolated protein revealed 

25 MetAP-2 as the covalently bound substrate. Subsequent growth inhibition studies in yeast 
utilizing MetAP- 1 and MetAP-2 deletion strains determined that MetAP-2 is selectively 
inhibited by fumagillin in vivo (Sin, N., 1997, Proc. Natl. Acad. Sci., 94, 6099-6103). A 
similar study with TNP-470 and ovalicin, another potent inhibitor of neovascularization, 
determined that MetAP-2 is the molecular target for these fumagillin-related compounds 

30 (Griffith, E. C, 1997, Chemistry & Biology, 4, 461-471). 
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MetAP-2 expression correlates with cellular growth. Non-dividing cells in culture 
have no detectable levels of the 67-kDa MetAP-2 protein by immunoassay. MetAP-2 has 
been shown to affect translational initiation by association with eukaryotic initiation factor 
2a (eIF-2a) (Ray, M. K., 1992, Proc. Natl. Acad. Sci., 89, 539-543). The binding of 
5 MetAP-2 with eIF-2a inhibits the heme-regulated inhibitor kinase (HPJ) phosphorylation 
of eIF-2a in vitro in reticulocyte lysates (Datta, B., 1988, Proc. Natl. Acad. Sci., 85, 3324- 
3328). MetAP-2/eIF-2a binding results in the partial reversal of protein synthesis 
inhibition by double stranded RNA dependent kinase mediated phosphorylation in vivo 
(Wu, S., 1996, Biochemistry, 35, 8275-8280). Griffith et al. also determined that covalent 

1 0 binding of TNP-470 and ovalicin, while potently inhibiting methionine aminopeptidase 
type 2 activity specifically, did not affect the regulatory activity of MetAP-2 on eIF-2a. 
This finding by Griffith et al. rules out the possibility that control of eIF-2a 
phosphorylation by MetAP-2 is responsible for the inhibition of endothelial cell 
proliferation by fumagillin related compounds. 

1 5 Particular angiogenesis related degenerative and disease states that can be associated 

with MetAP expression modulation include but are not limited to: 
Cancer : Solid tumors are unable to grow or metastasize without the formation of new 
blood vessels (Hanahan, D., 1996, Cell, 86, 353-364). Inhibition of angiogenesis via 
MetAP modulation can potentially be used to treat a wide variety of cancers. 

20 

Diabetic retinopathy and age related macular degeneration : Ocular 
neovascularization is observed in diabetic retinopathy, which is mediated by up-regulation 
of VEGF (Adamis, A. P., 1994, Amer. J. Ophthal., 118, 445-450). The requirement of 
protein kinase Src in hypoxia induced VEGF expression (Mukhopadhyay, D., 1995, 

25 Nature, 375, 577-581) indicates that MetAP modulation of aminopeptidase activity can 
potentially be used to treat conditions involving ocular neovascularization. 

Arthritis : The ingrowth of a vascular pannus in arthritis may be mediated by the 
overexpression of angiogenic factors from infiltrating inflammatory cells, macrophages, 
and immune cells (Peacock, D. J., 1992, J. exp. Med., 175, 1135-1138). Angiogenesis 

30 inhibition through MetAP modulation can potentially be used to treat arthritis. 
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Psoriasis : Angiogenesis has been implicated in psoriasis due to overexpression of 
the angiogenic polypeptide interleukin-8 and decreased expression of the angiogenesis 
inhibitor thrombospondin (Nickoloff, B. J., 1994, Amer. J. Pathol. 44, 820-828). 
Angiogenesis inhibition through MetAP modulation can potentially be used to treat 
5 psoriasis. 

Female reproduction : Angiogenesis in the female reproductive system has been 
implicated in several disorders of the reproductive tract (Reynolds, L. P., 1992, FASEB, 6, 
886-892). Modulation of angiogenesis through control of MetAP may have various 
applications in the area of female reproduction and fertility. 
1 0 Various methods have been developed to assay MetAP activity. 

Griffith etal, 1998, Proc. Natl. Acad. Sci., 95, 15183-15188, describe an enzymatic 
assay for MetAP-2 activity in vitro and an endothelial cell culture proliferation assay for 
MetAP-2 activity in vivo. 

Weber et al, 1999, International PCT publication No. WO 98/US-21231 describe 
1 5 novel fluorescent reporter molecules and an enzymatic assay that can be used for 
determining the activity of MetAP-2 for drug screening and determining the 
chemosensitivity of human cancer cells to treatment with chemotherapeutic drugs. 

Larrabee, J. A. et al, 1999, Anal. Biochem, 269, 194-198, describe the use of a high- 
pressure liquid chromatographic (HPLC) method for assaying MetAP-2 activity with 
20 application to the study of enzymic inactivation. 

Quantitative methods have been developed to assay the efficacy of antiangiogenic 
therapies. 

Wantanabe et al, 1992, Molec. Biol. Cell, 3, 324a, describe the quantitation of 
angiogenic peptides (bFGF) in human serum as a prognostic test for breast cancer. 
25 Nguyen et al, 1994, J. Nam. Cancer Inst., 86, 356-361, describe the quantitation of 

angiogenic peptides (bFGF) in the urine of patients with a wide spectrum of cancers. 

Li et al, 1994, The Lancet, 344, 82-86, describe the quantitation of angiogenic 
peptides (bFGF) in the cerebrospinal fluid of children with brain tumors. This work also 
describes determining the extent of neovascularization in histological sections by utilizing 
30 microvessel count. 
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The present body of knowledge in angiogenesis research indicates the need for 
compounds that can modulate MetAP activity for research, diagnostic, trait alteration, 
animal health and therapeutic use. 

Griffith et al, International PCT publication No. WO 9856372 describe small 
5 molecule inhibitors of MetAP2 and uses thereof. 

D' Amato et al, International PCT publication No. WO 9805293 describe the use of 
AGM-1470 (TNP-470) as an angiogenesis inhibitor for use in regulating the female 
reproductive system and for treating diseases of the reproductive tissue. 

Davidson et al, US patent No. 5,801,146 describe a compound and method for 
1 0 inhibiting angiogenesis using mammalian kringle 5 protein. 

Cao et al, US patent No. 5,854,221 describe a protein-based endothelial cell 
proliferation inhibitor and its method of use. 

Chang et al, US patent No. 5,888,796 describe a clone of a nucleotide sequence 
encoding a protein having two functions comprising methionine aminopeptidase activity 
1 5 and anti eIF-2 phosphorylation activity. 

Wang et al, 1998, Proc. Am. Assoc. Cancer Res., 39, 98 (abstr.) describe blocked 
proliferation of human endothelial cells by human MetAP-2 antisense oligonucleotides. 

A rat corneal model has been developed to study ribozyme inhibition of VEGF 
receptor-mediated angiogenesis (Pavco, P. A., 1999, Nucleic Acids Research, 27, 2569- 
20 2577). A similar study employing MetAP-2 inhibition could be used to study ribozyme 
based inhibition of MetAP-2 induced angiogenesis in vivo. 

Identification of Potential Target Sites in Human MetAP-2 RNA 

The sequence of human MetAP-2 was screened for accessible sites using a 
25 computer- folding algorithm. Regions of the RNA that did not form secondary folding 
structures and contained potential ribozyme and/or antisense binding/cleavage sites were 
identified. The sequences of these cleavage sites are shown in Tables 9-12. 

Selection of Enzymatic Nucleic Acid Cleavage Sites in Human MetAP-2 RNA 
30 To test whether the sites predicted by the computer-based RNA folding algorithm 

corresponded to accessible sites in MetAP-2 RNA, 1 1 hammerhead ribozyme, 4 NCH and 
three G-Cleaver ribozyme sites were selected for further analysis (Table 12). Ribozyme 
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target sites were chosen by analyzing sequences of Human MetAP-2 (Genbank accession 
number HSU29607) and prioritizing the sites on the basis of folding. Ribozymes were 
designed that could bind each target and were individually analyzed by computer folding 
(Christoffersen et al., 19947. Mol. Struc. Theochem, 311, 273; Jaeger et al, 1989, Proc. 
5 Natl. Acad. Sci. USA, 86, 7706) to assess whether the ribozyme sequences fold into the 
appropriate secondary structure. Those ribozymes with unfavorable intramolecular 
interactions between the binding arms and the catalytic core were eliminated from 
consideration. As noted below, varying binding arm lengths can be chosen to optimize 
activity. Generally, at least 5 bases on each arm are able to bind to, or otherwise interact 
1 0 with, the target RNA. 

Chemical Synthesis and Purification of Ribozymes for Efficient Cleavage of MetAP-2 
RNA 

Ribozymes were designed to anneal to various sites in the RNA message. The 
1 5 binding arms are complementary to the target site sequences described above. The 
ribozymes were chemically synthesized. The method of synthesis used followed the 
procedure for normal RNA synthesis as described above and in Usman et al., (1987 J. 
Am. Chem. Soc, 109, 7845), Scaringe et al., (1990 Nucleic Acids Res., 18, 5433) and 
Wincott et al., supra, and made use of common nucleic acid protecting and coupling 
20 groups, such as dimethoxytrityl at the 5'-end, and phosphoramidites at the 3'-end. The 
average stepwise coupling yields were >98%. 

Ribozymes were also synthesized from DNA templates using bacteriophage T7 
RNA polymerase (Milligan and Uhlenbeck, 1989, Methods Enzymol. 180, 51). Ribozymes 
were purified by gel electrophoresis using general methods or were purified by high 
25 pressure liquid chromatography (HPLC; see Wincott et al, supra; the totality of which is 

hereby incorporated herein by reference) and were resuspended in water. The sequences of 
the chemically synthesized ribozymes used in this study are shown below in Table 9-12. 

Ribozvme Cleavage of MetAP-2 RNA Target in vitro 
30 Ribozymes targeted to the human MetAP-2 RNA are designed and synthesized as 

described above. These ribozymes can be tested for cleavage activity in vitro, for 
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example, using the following procedure. The target sequences and the nucleotide location 
within the MetAP-2 RNA are given in Tables 9-12. 

Cleavage Reactions: Full-length or partially full-length, internally-labeled target 
RNA for ribozyme cleavage assay is prepared by in vitro transcription in the presence of 
5 [a- 32 p] CTP, passed over a G 50 Sephadex column by spin chromatography and used as 
substrate RNA without further purification. Alternately, substrates are 5'-3 2 P-end labeled 
using T4 polynucleotide kinase enzyme. Assays are performed by pre-warming a 2X 
concentration of purified ribozyme in ribozyme cleavage buffer (50 mM Tris-HCl, pH 7.5 
at 37°C, 10 mM MgCh) and the cleavage reaction was initiated by adding the 2X 

1 0 ribozyme mix to an equal volume of substrate RNA (maximum of 1-5 nM) that was also 
pre-warmed in cleavage buffer. As an initial screen, assays are carried out for 1 hour at 
37°C using a final concentration of either 40 nM or 1 mM ribozyme, i.e., ribozyme excess. 
The reaction is quenched by the addition of an equal volume of 95% formamide, 20 mM 
EDTA, 0.05% bromophenol blue and 0.05% xylene cyanol after which the sample is 

1 5 heated to 95°C for 2 minutes, quick chilled and loaded onto a denaturing polyacrylamide 
gel. Substrate RNA and the specific RNA cleavage products generated by ribozyme 
cleavage are visualized on an autoradiograph of the gel. The percentage of cleavage is 
determined by Phosphor Imager® quantitation of bands representing the intact substrate 
and the cleavage products. 

20 

Example 4: BACE. ps-1, ps-2 

Alzheimer's disease (AD) is a progressive, degenerative disease of the brain which 
affects approximately 4 million people in the United States alone. An estimated 14 
million Americans will have Alzheimer's disease by the middle of the next century if no 

25 cure or definitive prevention of the disease is found. Nearly one out of ten people over age 
65 and nearly half of those over 85 have Alzheimer's disease. Alzheimer's disease is not 
confined to the elderly, a small percentage of people in their 30's and 40's are afflicted 
with early onset AD. Alzheimer's disease is the most common form of dementia, and 
amounts to the third most expensive disease in the US following heart disease and cancer. 

30 An estimated 1 00 billion dollars are spent annually on Alzheimer' s disease (National 
Alzheimer's Association, 1999). 
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Alzheimer's disease is characterized by the progressive formation of insoluble 
plaques and vascular deposits in the brain consisting of the 4 kD amyloid P peptide (AP). 
These plaques are characterized by dystrophic neurites that show profound synaptic loss, 
neurofibrillary tangle formation, and gliosis. Ap arises from the proteolytic cleavage of 
5 the large type I transmembrane protein, p-amyloid precursor protein (APP) (Kang et al, 
1987, Nature, 325, 733). Processing of APP to generate Ap requires two sites of cleavage 
by a p-secretase and a y-secretase. P-secretase cleavage of APP results in the cytoplasmic 
release of a 100 kD soluble amino-terminal fragment, APPsP, leaving behind a 12 kD 
transmembrane carboxy-terminal fragment, C99. Alternately, APP can be cleaved by a a- 

1 0 secretase to generate cytoplasmic APPsa and transmembrane C83 fragments. Both 
remaining transmembrane fragments, C99 and C83, can be further cleaved by a y- 
secretase, leading to the release and secretion of Alzheimer's related AP and a non- 
pathogenic peptide, p3, respectively (Vassar et al, 1999, Science, 286, 735-741). Early 
onset familial Alzheimer's disease is characterized by mutant APP protein with a Met to 

1 5 Leu substitution at position PI, characterized as the "Swedish" familial mutation (Mullan 
et al, 1992, Nature Genet., 1, 345). This APP mutation is characterized by a dramatic 
enhancement in p-secretase cleavage (Citron et al, 1992, Nature, 360, 672). 

The identification of p-secretase, and y-secretase constituents involved in the release 
of p-amyloid protein is of primary importance in the development of treatment strategies 

20 for Alzheimer's disease. Characterization of a-secretase is also important in this regard 
since a-secretase cleavage may compete with p-secretase cleavage resulting in non- 
pathogenic vs. pathogenic protein production. Involvement of the two metalloproteases, 
ADAM 10, and TACE has been demonstrated in a-cleavage of AAP (Buxbaum et al, 
1999, / Biol Chem., 273, 27765, and Lammich et al, 1999, Proc. Natl Acad. Sci. U.S.A., 

25 96, 3922). Studies of y-secretase activity have demonstrated presenilin dependence (De 
Stooper et al, 1998, Nature, 391, 387, and De Stooper et al, 1999, Nature, 398, 518), and 
as such, presenilins have been proposed as y-secretase even though presenilin does not 
present proteolytic activity (Wolfe et al, 1999, Nature, 398, 513). 
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Recently, Vassar et al., 1999, supra reported p-secretase cleavage of AAP by the 
transmembrane aspartic protease beta site APP cleaving enzyme, BACE. While other 
potential candidates for p-secretase have been proposed (for review see Evin et al, 1999, 
Proc. Natl. Acad. Sci. U.S.A., 96, 3922), none have demonstrated the full range of 
5 characteristics expected from this enzyme. Vassar et al, supra, demonstrate that BACE 
expression and localization are as expected for p-secretase, that BACE overexpression in 
cells results in increased P-secretase cleavage of APP and Swedish APP, that isolated 
BACE demonstrates site specific proteolytic activity on APP derived peptide substrates, 
and that antisense mediated endogenous BACE inhibition results in dramatically reduced 

10 P-secretase activity. 

Current treatment strategies for Alzheimer's disease rely on either the prevention or 
the alleviation of symptoms and/or the slowing down of disease progression. Two drugs 
approved in the treatment of Alzheimer's, donepezil (Aricept®) and tacrine (Cognex®), 
both cholinomimetics, attempt to slow the loss of cognitive ability by increasing the 

1 5 amount of acetylcholine available to the brain. Antioxidant therapy through the use of 
antioxidant compounds such as alpha-tocopherol (vitamin E), melatonin, and selegeline 
(Eldepryl®) attempt to slow disease progression by minimizing free radical damage. 
Estrogen replacement therapy is thought to incur a possible preventative benefit in the 
development of Alzheimer's disease based on limited data. The use of anti-inflammatory 

20 drugs may be associated with a reduced risk of Alzheimer's as well. Calcium channel 
blockers such as Nimodipine® are considered to have a potential benefit in treating 
Alzheimer's disease due to protection of nerve cells from calcium overload, thereby 
prolonging nerve cell survival. Nootropic compounds, such as acetyl-L-carnitine (Alcar®) 
and insulin, have been proposed to have some benefit in treating Alzheimer's due to 

25 enhancement of cognitive and memory function based on cellular metabolism. 

Whereby the above treatment strategies may all improve quality of life in 
Alzheimer's patients, there exists an unmet need in the comprehensive treatment and 
prevention of this disease. As such, there exists the need for therapeutics effective in 
reversing the physiological changes associated with Alzheimer's disease, specifically, 

30 therapeutics that can eliminate and/or reverse the deposition of amyloid P peptide. The 
use of compounds to modulate the expression of proteases that are instrumental in the 
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release of amyloid p peptide, namely P-secretase (BACE), and y-secretase (presenilin), is 
of therapeutic significance. 

Tsai et al, 1999, Book of Abstrasts, 218 th ACS National Meeting, New Orleans, 
Aug 22-26, describe substrate-based alpha-aminoisobutyric acid derivatives of difluoro 
5 ketone peptidomimetic inhibitors of amyloid P peptide through y-secretase inhibition. 

Czech et al, International PCT publication No. WO/9921886, describe peptides 
capable of inhibiting the interaction between presenilins and the P-amyloid peptide or its 
precursor for therapeutic use. 

Foumier et al, International PCT publication No. WO/9916874, describe human 
1 0 brain proteins capable of interacting with presenilins and cDNAs encoding them toward 
therapeutic use. 

St. George-Hyslop et al, International PCT publication No. WO/9727296, describe 
genes for proteins that interact with presenilins and their role in Alzheimer's disease 
toward therapeutic use. 

1 5 Vassar et al, 1999, Science, 286, 735-74 1 , describe specific antisense 

oligonucleotides targeting BACE, used for inhibition studies of endogenous BACE 
expression in 101 cells and APPsw cells via lipid mediated transfection. 

Vassar et al, 1999, Science, 286, 735-741, describe a cell culture model for studying 
BACE inhibition. Specific antisense nucleic acid molecules targeting BACE mRNA were 

20 used for inhibition studies of endogenous BACE expression in 101 cells and APPsw 

(Swedish type amyloid precursor protein expressing) cells via lipid mediated transfection. 
Antisense treatment resulted in dramatic reduction of both BACE mRNA by Northern blot 
analysis, and APPspsw ("Swedish" type P-secretase cleavage product) by ELISA, with 
maximum inhibition of both parameters at 75-80%. This model was also used to study the 

25 effect of BACE inhibition on amyloid p-peptide production in APPsw cells. 

Games et al, 1995, Nature, 373, 523-527, describe a transgenic mouse model in 
which mutant human familial type APP (Phe 717 instead of Val) is overexpressed. This 
model results in mice that progressively develop many of the pathological hallmarks of 
Alzheimer's disease, and as such, provides a model for testing therapeutic drugs. 

30 Particular degenerative and disease states that can be associated with BACE 

expression modulation include but are not limited to Alzheimer's disease and dementia. 
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Donepezil, tacrine, selegeline, and acetyl-L-carnitine are non-limiting examples of 
pharmaceutical agents that can be combined with or used in conjunction with the nucleic 
acid molecules (e.g. ribozymes and antisense molecules) of the instant invention. Those 
skilled in the art will recognize that other drugs such as diuretic and antihypertensive 
5 compounds and therapies can be similarly be readily combined with the nucleic acid 
molecules of the instant invention (e.g. ribozymes and antisense molecules) are hence 
within the scope of the instant invention. 



Identification of Potential Target Sites in Human BACE RNA 
1 0 The sequence of human BACE was screened for accessible sites using a computer- 

folding algorithm. Regions of the RNA that did not form secondary folding structures and 
contained potential ribozyme and/or antisense binding/cleavage sites were identified. The 
sequences of these cleavage sites are shown in Tables 18-23. 

15 Selection of Enzymatic Nucleic Acid Cleavage Sites in Human BACE RNA 

Ribozyme target sites were chosen by analyzing sequences of Human BACE 
(Genbank sequence accession number: AF190725) and prioritizing the sites on the basis 
of folding. Ribozymes were designed that could bind each target and were individually 
analyzed by computer folding (Christoffersen et al, 1 994 J. Mol Struc. Theochem, 311, 

20 273; Jaeger et al, 1989, Proc. Natl. Acad. Sci. USA, 86, 7706) to assess whether the 

ribozyme sequences fold into the appropriate secondary structure. Those ribozymes with 
unfavorable intramolecular interactions between the binding arms and the catalytic core 
were eliminated from consideration. As noted below, varying binding arm lengths can be 
chosen to optimize activity. Generally, at least 5 bases on each arm are able to bind to, or 

25 otherwise interact with, the target RNA. 



Chemical Synthesis and Purification of Ribozymes and Antisense for Efficient Cleavage 
and/or blocking of BACE RNA 

Ribozymes and antisense constructs were designed to anneal to various sites in the 
30 RNA message. The binding arms of the ribozymes are complementary to the target site 
sequences described above, while the antisense constructs are fully complimentary to the 
target site sequences described above. The ribozymes and antisense constructs were 



WO 01/16312 PCT/US00/23998 
73 

chemically synthesized. The method of synthesis used followed the procedure for normal 
RNA synthesis as described above and in Usman et al., (1987 J. Am. Chem. Soc, 109, 
7845), Scaringe et al., (1990 Nucleic Acids Res., 18, 5433) and Wincott et al., supra, and 
made use of common nucleic acid protecting and coupling groups, such as dimethoxytrityl 
5 at the 5'-end, and phosphoramidites at the 3'-end. The average stepwise coupling yields 
were >98%. 

Ribozymes and antisense constructs were also synthesized from DNA templates 
using bacteriophage T7 RNA polymerase (Milligan and Uhlenbeck, 1 989, Methods 
Enzymol. 180, 51). Ribozymes and antisense constructs were purified by gel 
1 0 electrophoresis using general methods or were purified by high pressure liquid 
chromatography (HPLC; See Wincott et al., supra; the totality of which is hereby 
incorporated herein by reference) and were resuspended in water. The sequences of the 
chemically synthesized ribozymes and antisense constructs used in this study are shown 
below in Table 18-23. 

15 

Ribozvme Cleavage of B ACE RNA Target in vitro 

Ribozymes targeted to the human BACE RNA are designed and synthesized as 
described above. These ribozymes can be tested for cleavage activity in vitro, for 
example, using the following procedure. The target sequences and the nucleotide location 

20 within the BACE RNA are given in Tables 1 8-23 . 

Cleavage Reactions: Full-length or partially full-length, internally-labeled target 
RNA for ribozyme cleavage assay is prepared by in vitro transcription in the presence of 
[a- 32 p] CTP, passed over a G 50 Sephadex column by spin chromatography and used as 
substrate RNA without further purification. Alternately, substrates are 5'- 32 P-end labeled 

25 using T4 polynucleotide kinase enzyme. Assays are performed by pre-warming a 2X 

concentration of purified ribozyme in ribozyme cleavage buffer (50 mM Tris-HCl, pH 7.5 
at 37°C, 10 mM MgCl2) and the cleavage reaction was initiated by adding the 2X 
ribozyme mix to an equal volume of substrate RNA (maximum of 1-5 nM) that was also 
pre-warmed in cleavage buffer. As an initial screen, assays are carried out for 1 hour at 

30 37°C using a final concentration of either 40 nM or 1 mM ribozyme, i.e., ribozyme excess. 
The reaction is quenched by the addition of an equal volume of 95% formamide, 20 mM 
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EDTA, 0.05% bromophenol blue and 0.05% xylene cyanol after which the sample is 
heated to 95°C for 2 minutes, quick chilled and loaded onto a denaturing polyacrylamide 
gel. Substrate RNA and the specific RNA cleavage products generated by ribozyme 
cleavage are visualized on an autoradiograph of the gel. The percentage of cleavage is 
5 determined by Phosphor Imager® quantitation of bands representing the intact substrate 
and the cleavage products. 

Example 5: Phospholamban 

Cardiac disease leading to heart failure is the leading cause of combined morbidity 

1 0 and mortality in the developed world. Nearly twenty million people worldwide suffer 

from heart failure related disease. An estimated five million Americans are afflicted with 
congestive heart failure (CHF), with 400,000 new cases diagnosed each year. In the US, 
cardiac disease associated failure results in approximately 40,000 deaths per year, and is 
associated with an additional 250,000 deaths (Harnish, 1999, Drug & Market 

1 5 Development, 10, 1 14-1 19). Heart failure related disease represents a major public health 
issue due to an overall increase in prevalence and incidence in aging populations with a 
greater proportion of survivors of acute myocardial infarction (AMI) (Kannel et al, 199 A, 
Br. Heart. J., 72 (suppl), 3). Heart failure related disease represents the most common 
reason for hospitalization of elderly patients in the US. The resulting life expectancy of 

20 these patients is less than that of many common cancers, with five year survival rates for 
men and women at only 25% and 38% respectively, and with one year mortality rates for 
severe heart failure at 50% (Ho et al, 1993, Circulation, 88, 107). 

Heart disease is characterized by a progressive decrease in cardiac output resulting 
from insufficient pumping activity of the diseased heart. The resulting venous back- 

25 pressure results in peripheral and pulmonary dysfunctional congestion. The heart responds 
to a variety of mechanical, hemodynamic, hormonal, and pathological stimuli by 
increasing muscle mass in response to an increased demand for cardiac output. The 
resulting transformation of heart tissue (myocardial hypertrophy) can arise as a result of 
genetic, physiologic, and environmental factors, and represents an early indication of 

30 clinical heart disease and an important risk factor for subsequent heart failure (Hunter and 
Chien, 1999, New England J. of Medicine, 99, 313-322). 
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Coronary heart disease is a predominant factor in the development of the cardiac 
disease state, along with prior AMI, hypertension, diabetes mellitus, and valvular heart 
disease. Diagnosis of cardiac disease includes determination of coronary heart disease 
associated left ventricular systolic dysfunction (LVSD) and/or left ventricular diastolic 
5 dysfunction (LVDD) by echocaardiographic imaging (Cleland, 1 997, Dis Management 
Health Outcomes, 1, 169). Promising diagnosis may also rely on assaying atrial natriuretic 
peptide (ANP) and brain natriuretic peptide (BNP) concentrations. ANP and BNP levels 
are indicative of the level of ventricular dysfunction (Davidson et al, 1996, Am. J. 
Cardiol, 77, 828). 

1 0 Current treatment strategies for cardiac disease associated failure are varied. 

Diuretics are often used to reduce pulmonary edema and dyspnea in patients with fluid 
overload, and are usually used in conjunction with angiotensin converting enzyme (ACE) 
inhibitors for vasodilation. Digoxin is another popular choice for treating cardiac disease 
as an ionotropic agent, however, doubts remain concerning the long-term efficacy and 

1 5 safety of Digoxin (Harnish, 1 999, Drug & Market Development, 1 0, 1 1 4- 1 1 9). Carvedilol, 
a beta-blocker, has been introduced to complement the above treatments in order to slow 
down the progression of cardiac disease. Antiarrhythmic agents can be used in order to 
reduce the risk of sudden death in patients suffering from cardiac disease. Lastly, heart 
transplants have been effective in the treatment of patients with advanced stages of cardiac 

20 disease, however, the limited supply of donor hearts greatly limits the scope of this 

treatment to the broad population (Harnish, 1999, Drug & Market Development, 10, 1 14- 
119). 

Whereby the above treatment strategies can all improve morbidity and mortality 
associated with cardiac disease, the only existing definitive approach to curing the diseased 
25 heart is replacement by transplant. Even a healthy, transplanted heart can become diseased 
in response to the various stresses of mechanical, hemodynamic, hormonal, and 
pathological stimuli associated with extrinsic risk factors. As such there exists the need 
for therapeutics effective in reversing the physiological changes associated with cardiac 
disease. 

30 Myocardial hypertrophy and apoptosis are the underlying degenerative process 

associated with cardiac hypertrophy and failure. A variety of signaling pathways are 
involved in the progression of myocardial hypertrophy and myocardial apoptosis. Genetic 
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studies have been instrumental in elucidating these pathways and their involvement in 
cardiac disease through in vitro assays of cardiac muscle cells and in vivo studies of 
genetically engineered animals ; 

Studies in which the expression of specific genes have been altered in cardiac 
5 myocytes have shown that specific peptide hormones, growth factors, and cytokines can 
activate various features of the hypertrophic response (Hunter and Chien, 1999, New 
England J of Medicine, 99, 313-322). Particular substances that have been characterized 
from these studies include potential therapeutic and molecular targets involved in heart 
failure. Hunter et ai, in Chien, KR, ed. Molecular basis of heart disease: a companion 
10 to Braunwald's Heart Disease, Philadelphia: W.B. Saunders, 1999:21 1-250, describe 
classes of therapeutic and molecular targets involved in heart failure including: 

1. Endothelin 1 and angiotensin II receptor antagonists, and antagonists of ras, p38, and 
c-jun N-terminal kinase (JNK) for inhibition of pathologic hypertrophy. 

2. Insulin like growth factor I and growth hormone receptor stimulation for promotion of 
1 5 physiologic hypertrophy. 

3 . beta- 1 -adrenergic receptor blockers for inhibition of neurohumoral over stimulation. 

4. Phospholamban and Sarcolipin small molecule inhibitors for relief of sarcoplasmic 
reticulum calcium ATPase inhibition to provide enhancement of myocardial contractile 
and relaxation responses. 

20 5. Small molecule inhibitors of 0-adrenergic receptor kinase to counteract the 

desensitization of G protein coupled receptor kinases in order to provide enhancement 

of myocardial contractile and relaxation responses. 
6. Enhancement of angiogenic growth factors (VEGF, FGF-5) for relief of energy 

deprivation in cardiac tissues. 
25 7. Promoters of myocyte survival including gp 130 ligands (cardiotrophin 1), and 

Neuregulin for the inhibition of apoptosis of myocytes. 

8. Inhibitors of apoptosis such as Caspase inhibitors for the inhibition of apoptosis of 
myocytes. 

9. Inhibitors of cytokines such as TNF-alpha for the inhibition of apoptosis of myocytes. 
30 Congestive heart failure, heart failure, dilated cardiomyopathy and pressure overload 

hypertrophy are nonlimiting examples of disorders and disease states that can be 
associated with the above classes of molecular targets. 
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The failure of cardiac contractile performance leading to cardiac disorders and 
disease, governed by impairment of cardiac excitation/contraction coupling, points to the 
importance of the signaling pathways involved in this process. The release and uptake of 
cytosolic Ca 2+ by the sarcoplasmic reticulum plays an integral role in each cycle of cardiac 
5 contraction and excitation (Minamisawa et al, 1999, Cell, 99, 313-322). The process of 
Ca 2+ reuptake is mediated by the cardiac sarcoplasmic reticulum Ca 2+ ATPase 
(SERCA2a). SERCA2a activity is regulated by phospholamban, a p52 muscle specific 
sarcoplasmic reticulum phosphoprotein (Koss et al, 1996, Circ. Res., 79, 1059-1063, and 
Simmerman et al, 1998, Physiol. Rev., 78, 921-947). In its active, unphosphorylated state, 

1 0 phospholamban is a potent inhibitor of SERCA2a activity. Phosphorylation of 
phospholamban at serine 16 by cyclic AMP-dependent protein kinase (PKA) or 
calmodulin kinase, results in the inhibition of phospholamban interaction with SERCA2a. 
This phosphorylation event is predominantly responsible for the proportional increase in 
the rate of Ca 2+ uptake into the sarcoplasmic reticulum and resultant ventricular relaxation 

1 5 (Tada et al, 1982, Mol. Cell. Biochem., 46, 73-95, and Luo et al, 1998, J. Biol. Chem., 
273, 4734-4739). 

Since a proportional decrease in Ca 2+ uptake is a hallmark feature of heart failure 
(Sordahl et al, 1973, Am. J. Physiol., 224, 497-502) and since an increase in the relative 
ratio of phospholamban to SERCA2a is an important determinant of sarcoplasmic 
20 reticulum dysfunction in heart failure (Hasenfuss, 1998, Cardiovasc. Res., 37, 279-289), 
the targeting of phospholamban and related regulatory factors as therapeutic targets for 
heart disorders should prove valuable for cardiac indications. 

Pystynen et al, International PCT publication No. WO 99/00132, describe bisethers 
of 1-oxa, aza and thianaphthalen-2-ones as small molecule inhibitors of phospholamban 
25 for increasing coronary flow via direct dilation of the coronary arteries. 

Pystynen et al, International PCT publication No. WO 99/15523, describe bisethers 
of 1-oxa, aza and thianaphthalen-2-ones as small molecule inhibitors of phospholamban 
that are useful for treating heart failure. 

The efficacy of the above mentioned treatment strategies is limited. Small molecule 
30 inhibition of a molecular target is often limited by toxicity, which can restrict dosing and 
overall efficacy. 
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He et al, 1999, Circulation, 100, 974-980, describe endogenous expression of 
mutant phospholamban and phospholamban antisense RNA to investigate the 
corresponding effect on SERCA2a activity and cardiac myocyte contractility. 

A more attractive approach to the treatment of heart disease would involve the use of 
5 ribozymes and/or antisense constructs to modulate the expression of target molecules 
involved in heart failure. The use of nucleic acid molecules of the instant invention 
permits highly specific regulation of the molecular targets of interest, including 
phospholamban (PLN) (GenBank accession No. NM 002667), sarcolipin (SLN) 
(GenBank accession No. NM_003063), angiotensin II receptor (GenBank accession No. 

1 0 U20860), endothelin 1 receptor (GenBank accession No. NM_001957), K-ras (GenBank 
accession No. NM_004985), p38 (GenBank accession No. AF092535), c-jun N-terminal 
kinase (GenBank accession No. NM_002750, L31951, NM_002753), growth hormone 
receptor (GenBank accession No. NM_000163), insulin-like growth factor I receptor 
(GenBank accession No. NM_000875), beta- 1 -adrenergic receptor (GenBank accession 

15 No. NM_000024), (31-adrenergic receptor kinase (GenBank accession No. NM_001619, 
NM_005160), VEGF receptor (GenBank accession No. U43368, M27281 XI 5997), 
fibroblast growth factor 5 (GenBank accession No. NM 004464), cardiotrophin I 
(GenBank accession No. NM_001330), neuregulin (GenBank accession No. AF009227), 
TNF-alpha (GenBank accession No. X02910 X02159), PI3 kinase (GenBank accession 

20 No. NM_006218, NM_006219, U86453, NM_002649, M61906), and AKT kinase 
(GenBank accession No. NM_005163, M77198). 

Various methods have been developed to assay phospholamban activity in vitro and 
in vivo. Holt et al, 1999, J. Mol. Cell. Cardiol., 31, 645-656, describe a cell culture model 
in which thyroid hormone control of contraction and the Ca 2+ -ATPase/phospholamban 

25 complex is studied in adult rat ventricular myocytes. Slack et al. 1997, J. Biol. Chem., 
272, 18862-18868, describe studies in which the ectopic expression of phospholamban in 
mouse fast-twitch skeletal muscle cells alters sarcoplasmic reticulum Ca 2+ transport and 
muscle relaxation. MacLennan et al, 1996, Soc. Gen. Physiol. Ser., 51, 89-103, in a 
review of regulatory interactions between calcium ATPases and phospholamban describe 

30 phospholamban/ Ca 2+ -ATPase interactions in protein expressed in heterologous cell 
culture experiments. Cornwell et al, 1991, Mol. Pharmacol., 40,923-931, describe the 
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regulation of sarcoplasmic reticulum protein phosphorylation by localized cyclic GMP- 
dependent protein kinase in vascular smooth muscle cells. 

Minamisawa et al, 1999, Cell, 99, 313-322, describe a phospholamban knockout 
mouse model which affords protection from induced dilated cardiomyopathy. Dillmann et 
5 al, 1999, Am. J. Cardiol, 83, 89H-91H, describe a transgenic rat model for the study of 
altered expression of calcium regulatory proteins, including phospholamban, and their 
effect on myocyte contractile response. LekanneDeprez et al, 1998, J. Mol Cell. 
Cardiol, 30, 1 877-1888, describe a rat pressure-overload model to investigate alterations 
in gene expression of phospholamban, atrial natriuretic peptide (ANP), sarcoplasmic 

1 0 endoplasmic reticular calcium ATPase 2 (SERCA2), collagen Hied, and calsequestrin 
(CSQ). Jones et al, 1998, J. Clin. Invest., 101, 1385-1393, describe a mouse model for 
investigating the regulation of calcium signaling in transgenic mouse cardiac myocytes 
overexpressing calsequestrin. In this study, the upregulation and downregulation of 
calcium uptake and release proteins were determined, including phospholamban. Lorenz 

1 5 et al, 1997, Am J. Physiol, 273, 6, describe a mouse model for the study of regulatory 
effects of phospholamban on cardiac function in intact mice. This study makes use of 
animal models with altered levels of phospholamban to permit in vivo evaluation of the 
physiological role of phospholamban. Arai et al, 1996, Saishin Igaku, 51, 1095-1104, 
presents a review article of gene targeted animal models expressing cardiovascular 

20 abnormalities. The study of phospholamban and other protein expression modification 
effects in mice is presented. Wankerl et al, 1995, J. Mol Med., 73, 487-496, presents a 
review article describing the study of calcium transport proteins in the nonfailing and 
failing heart. Animal models investigating the major calcium handling myocardial 
proteins, including phospholamban, are described. These models, as well as others, may 

25 be used to evaluate the effect of treatment with nucleic acid molecules of the instant 

invention on cardiac function. Endpoints may be, but are not limited to, left ventricular 
pressure, left ventricular pressure as a function of time (LVdP/dt), and mean arterial blood 
pressure. Endpoints will be evaluated under basal and stimulated (cardiac load) 
conditions. 

30 Particular degenerative and disease states that can be associated with 

phospholamban expression modulation include but are not limited to congestive heart 
failure, heart failure, dilated cardiomyopathy and pressure overload hypertrophy: 
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Digoxin, Bendrofluazide, Dofetilide, and Carvedilol are non-limiting examples of 
pharmaceutical agents that can be combined with or used in conjunction with the nucleic 
acid molecules (e.g. ribozymes and antisense molecules) of the instant invention. Those 
skilled in the art will recognize that other drugs such as diuretic and antihypertensive 
5 compounds and therapies can be similarly be readily combined with the nucleic acid 
molecules of the instant invention (e.g. ribozymes and antisense molecules) are hence 
within the scope of the instant invention. 

Identification of Potential Target Sites in Human phospholamban RNA 
1 0 The sequence of human phospholamban was screened for accessible sites using a 

computer folding algorithm. Regions of the RNA that did not form secondary folding 
structures and contained potential ribozyme and/or antisense binding/cleavage sites were 
identified. The sequences of these cleavage sites are shown in Tables 24-30. 

15 Selection of Enzymatic Nucleic Acid Cleavage Sites in Human phospholamban RNA 
Ribozyme target sites were chosen by analyzing sequences of Human 
phospholamban (Genbank sequence accession number: NM_002667) and prioritizing the 
sites on the basis of folding. Ribozymes were designed that could bind each target and 
were individually analyzed by computer folding (Christoffersen et al., 1994 J. Mol. Struc. 

20 Theochem, 311, 273; Jaeger et al, 1989, Proc. Natl. Acad. Sci. USA, 86, 7706) to assess 
whether the ribozyme sequences fold into the appropriate secondary structure. Those 
ribozymes with unfavorable intramolecular interactions between the binding arms and the 
catalytic core were eliminated from consideration. As noted below, varying binding arm 
lengths can be chosen to optimize activity. Generally, at least 5 bases on each arm are able 

25 to bind to, or otherwise interact with, the target RNA. 

Chemical Synthesis and Purification of Ribozymes and Antisense for Efficient Cleavage 
and/or blocking of phospholamban RNA 

Ribozymes and antisense constructs were designed to anneal to various sites in the 
30 RNA message. The binding arms of the ribozymes are complementary to the target site 
sequences described above, while the antisense constructs are fully complimentary to the 
target site sequences described above. The ribozymes and antisense constructs were 
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chemically synthesized. The method of synthesis used followed the procedure for normal 
RNA synthesis as described above and in Usman et al., (1987 J. Am. Chem. Soc, 109, 
7845), Scaringe et al., (1990 Nucleic Acids Res., 18, 5433) and Wincott et al., supra, and 
made use of common nucleic acid protecting and coupling groups, such as dimethoxytrityl 
5 at the 5'-end, and phosphoramidites at the 3'-end. The average stepwise coupling yields 
were >98%. 

Ribozymes and antisense constructs were also synthesized from DNA templates 
using bacteriophage T7 RNA polymerase (Milligan and Uhlenbeck, 1989, Methods 
Enzymol. 180, 51). Ribozymes and antisense constructs were purified by gel 
0 electrophoresis using general methods or were purified by high pressure liquid 
chromatography (HPLC; see Wincott et al., supra; the totality of which is hereby 
incorporated herein by reference) and were resuspended in water. The sequences of the 
chemically synthesized ribozymes and antisense constructs used in this study are shown 
below in Table 24-30. 

5 

Ribozvme Cleavage of phospholamban RNA Target in vitro 

Ribozymes targeted to the human phospholamban RNA are designed and 
synthesized as described above. These ribozymes can be tested for cleavage activity in 
vitro, for example using the following procedure. The target sequences and the nucleotide 

0 location within the phospholamban RNA are given in Tables 24-30. 

Cleavage Reactions: Full-length or partially full-length, internally-labeled target 
RNA for ribozyme cleavage assay is prepared by in vitro transcription in the presence of 
[a- 32 p] CTP, passed over a G 50 Sephadex column by spin chromatography and used as 
substrate RNA without further purification. Alternately, substrates are 5'-32p. e nd labeled 

5 using T4 polynucleotide kinase enzyme. Assays are performed by pre-warming a 2X 

concentration of purified ribozyme in ribozyme cleavage buffer (50 mM Tris-HCl, pH 7.5 
at 37°C, 10 mM MgCb) and the cleavage reaction was initiated by adding the 2X 
ribozyme mix to an equal volume of substrate RNA (maximum of 1-5 nM) that was also 
pre-warmed in cleavage buffer. As an initial screen, assays are carried out for 1 hour at 

0 37 C using a final concentration of either 40 nM or 1 mM ribozyme, i.e., ribozyme excess. 
The reaction is quenched by the addition of an equal volume of 95% formamide, 20 mM 
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EDTA, 0.05% bromophenol blue and 0.05% xylene cyanol after which the sample is 
heated to 95°C for 2 minutes, quick chilled and loaded onto a denaturing polyacrylamide 
gel. Substrate RNA and the specific RNA cleavage products generated by ribozyme 
cleavage are visualized on an autoradiograph of the gel. The percentage of cleavage is 
determined by Phosphor Imager® quantitation of bands representing the intact substrate 
and the cleavage products. 

Tissue distribution of BrdU-labeled antisense in mice 

CD1 mice were injected with a single bolus (30 mg/kg) of a BrdU-labeled antisense 
oligonucleotide or a similar molar amount of BrdU (as a control). At various time points 
(30 min, 2h and 6 h), mice were sacrificed and major tissues isolated and fixed. 
Distribution of antisense oligonucleotides was determined by probing with an anti-BrdU 
antibody and immunohistochemical staining. Tissue slices were probed with an anti-BrdU 
antibody followed by a reporter enzyme-conjugated second antibody and finally an enzyme 
substrate. Visualization of the colored product by microscopy indicated nuclear staining, 
demonstrating effective distribution of antisense oligonucleotide in cardiac tissue. 

Tissue distribution of BrdU-labeled ribozymes in monkey 

Rhesus monkeys were dosed with BrdU-labeled ribozyme by intravenous bolus 
injection at 0. 1 , 1 .0, and 1 0 mg/kg once daily over five days. Saline injection was used in 
control animals. Animals were sacrificed and major tissues isolated and fixed. Tissue 
samples were probed with an anti-BrdU antibody followed by a reporter enzyme- 
conjugated second antibody and finally an enzyme substrate. Significant quantities of 
chemically modified ribozyme are detected in cardiac tissue following this dosing regimen. 

Example 6: HBV 

Chronic hepatitis B is caused by an enveloped virus, commonly known as the 
hepatitis B virus or HBV. HBV is transmitted via infected blood or other body fluids, 
especially saliva and semen, during delivery, sexual activity, or sharing of needles 
contaminated by infected blood. Individuals may be "carriers" and transmit the infection 
to others without ever having experienced symptoms of the disease. Persons at highest 
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risk are those with multiple sex partners, those with a history of sexually transmitted 
diseases, parenteral drug users, infants born to infected mothers, "close" contacts or sexual 
partners of infected persons, and healthcare personnel or other service employees who 
have contact with blood. Transmission is also possible via tattooing, ear or body piercing, 
and acupuncture; the virus is also stable on razors, toothbrushes, baby bottles, eating 
utensils, and some hospital equipment such as respirators, scopes and instruments. There 
is no evidence that HBsAg positive food handlers pose a health risk in an occupational 
setting, nor should they be excluded from work. Hepatitis B has never been documented 
as being a food-borne disease. The average incubation period is 60 to 90 days, with a 
range of 45 to 180; the number of days appears to be related to the amount of virus to 
which the person was exposed. However, determining the length of incubation is difficult, 
since onset of symptoms is insidious. Approximately 50% of patients develop symptoms 
of acute hepatitis that last from 1 to 4 weeks. Two percent or less of these individuals 
develop fulminant hepatitis resulting in liver failure and death. 

The determinants of severity include: (1) The size of the dose to which the person 
was exposed; (2) the person's age with younger patients experiencing a milder form of the 
disease; (3) the status of the immune system with those who are immunosuppressed 
experiencing milder cases; and (4) the presence or absence of co-infection with the Delta 
virus (hepatitis D), with more severe cases resulting from co-infection. In symptomatic 
cases, clinical signs include loss of appetite, nausea, vomiting, abdominal pain in the right 
upper quadrant, arthralgia, and tiredness/loss of energy. Jaundice is not experienced in all 
cases, however, jaundice is more likely to occur if the infection is due to transfusion or 
percutaneous serum transfer, and it is accompanied by mild pruritus in some patients. 
Bilirubin elevations are demonstrated in dark urine and clay-colored stools, and liver 
enlargement may occur accompanied by right upper-quadrant pain. The acute phase of the 
disease may be accompanied by severe depression, meningitis, Guillain-Barre syndrome, 
myelitis, encephalitis, agranulocytosis, and/or thrombocytopenia. 

Hepatitis B is generally self-limiting and will resolve in approximately 6 months. 
Asymptomatic cases can be detected by serologic testing, since the presence of the virus 
leads to production of large amounts of HBsAg in the blood. This antigen is the first and 
most useful diagnostic marker for active infections. However, if HBsAg remains positive 
for 20 weeks or longer, the person is likely to remain positive indefinitely and is now a 
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carrier. While only 10% of persons over age 6 who contract HBV become carriers, 90% of 
infants infected during the first year of life do so. 

Hepatitis B virus (HBV) infects over 300 million people worldwide (Imperial, 1999, 
Gastroenterol. Hepatol, 14 (suppl), Sl-5). In the United States approximately 1.25 

5 million individuals are chronic carriers of HBV as evidenced by the fact that they have 
measurable hepatitis B virus surface antigen HBsAg in their blood. The risk of becoming 
a chronic HBsAg carrier is dependent upon the mode of acquisition of infection as well as 
the age of the individual at the time of infection. For those individuals with high levels of 
viral replication, chronic active hepatitis with progression to cirrhosis, liver failure and 

0 hepatocellular carcinoma (HCC) is common, and liver transplantation is the only treatment 
option for patients with end-stage liver disease from HBV. 

The natural progression of chronic HBV infection over a 10 to 20 year period leads 
to cirrhosis in 20-to-50% of patients and progression of HBV infection to hepatocellular 
carcinoma has been well documented. There have been no studies that have determined 

5 sub-populations that are most likely to progress to cirrhosis and/or hepatocellular 
carcinoma, thus all patients have equal risk of progression. 

It is important to note that the survival for patients diagnosed with hepatocellular 
carcinoma is only 0.9 to 12.8 months from initial diagnosis (Takahashi et ah, 1993, 
American Journal of Gastroenterology, 88, 240-243). Treatment of hepatocellular 

0 carcinoma with chemotherapeutic agents has not proven effective and only 10% of patients 
will benefit from surgery due to extensive tumor invasion of the liver (Trinchet et al, 
1994, Presse Medicine, 23, 831-833). Given the aggressive nature of primary 
hepatocellular carcinoma, the only viable treatment alternative to surgery is liver 
transplantation (Pichlmayr et al, 1994, Hepatology., 20, 33S-40S). 

5 Upon progression to cirrhosis, patients with chronic HCV infection present with 

clinical features, which are common to clinical cirrhosis regardless of the initial cause 
(D'Amico et al, 1986, Digestive Diseases and Sciences, 31, 468-475). These clinical 
features may include: bleeding esophageal varices, ascites, jaundice, and encephalopathy 
(Zakim D, Boyer TD. Hepatology a textbook of liver disease, Second Edition Volume 1 . 

0 1990 W.B. Saunders Company. Philadelphia). In the early stages of cirrhosis, patients are 
classified as compensated, meaning that although liver tissue damage has occurred, the 
patient's liver is still able to detoxify metabolites in the blood-stream. In addition, most 
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patients with compensated liver disease are asymptomatic and the minority with symptoms 
report only minor symptoms such as dyspepsia and weakness. In the later stages of 
cirrhosis, patients are classified as decompensated meaning that their ability to detoxify 
metabolites in the bloodstream is diminished and it is at this stage that the clinical features 

5 described above will present. 

In 1986, D'Amico et al. described the clinical manifestations and survival rates in 
1 155 patients with both alcoholic and viral associated cirrhosis (D'Amico supra). Of the 
1 155 patients, 435 (37%) had compensated disease although 70% were asymptomatic at 
the beginning of the study. The remaining 720 patients (63%) had decompensated liver 

1 0 disease with 78% presenting with a history of ascites, 3 1 % with jaundice, 1 7% had 
bleeding and 16% had encephalopathy. Hepatocellular carcinoma was observed in six 
(0.5%) patients with compensated disease and in 30 (2.6%) patients with decompensated 
disease. 

Over the course of six years, the patients with compensated cirrhosis developed 
5 clinical features of decompensated disease at a rate of 10% per year. In most cases, ascites 
was the first presentation of decompensation. In addition, hepatocellular carcinoma 
developed in 59 patients who initially presented with compensated disease by the end of 
the six-year study. 

With respect to survival, the D'Amico study indicated that the five-year survival rate 
!0 for all patients on the study was only 40%. The six-year survival rate for the patients who 
initially had compensated cirrhosis was 54% while the six-year survival rate for patients 
who initially presented with decompensated disease was only 2 1 %. There were no 
significant differences in the survival rates between the patients who had alcoholic 
cirrhosis and the patients with viral related cirrhosis. The major causes of death for the 
!5 patients in the D'Amico study were liver failure in 49%; hepatocellular carcinoma in 22%; 
and, bleeding in 13% (D'Amico supra). 

Hepatitis B virus is a double-stranded circular DNA virus. It is a member of the 
Hepadnaviridae family. The virus consists of a central core that contains a core antigen 
(HBcAg) surrounded by an envelope containing a surface protein/surface antigen (HBsAg) 
[0 and is 42 nm in diameter. It also contains an e antigen (HBeAg) which, along with HBcAg 
and HBsAg, is helpful in identifying this disease 
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In HBV virions, the genome is found in an incomplete double-stranded form. HBV 
uses a reverse transcriptase to transcribe a positive-sense full length RNA version of its 
genome back into DNA. This reverse transcriptase also contains DNA polymerase activity 
and thus begins replicating the newly synthesized minus-sense DNA strand. However, it 

5 appears that the core protein encapsidates the reverse-transcriptase/polymerase before it 
completes replication. 

From the free-floating form, the virus must first attach itself specifically to a host 
cell membrane. Viral attachment is one of the crucial steps which determines host and 
tissue specificity. However, currently there are no in vitro cell-lines that can be infected by 

0 HBV. There are some cells lines, such as HepG2, which can support viral replication only 
upon transient or stable transfection using HBV DNA. 

After attachment, fusion of the viral envelope and host membrane must occur to 
allow the viral core proteins containing the genome and polymerase to enter the cell. Once 
inside, the genome is translocated to the nucleus where it is repaired and cyclized. 

5 The complete closed circular DNA genome of HBV remains in the nucleus and 

gives rise to four transcripts. These transcripts initiate at unique sites but share the same 
3 '-ends. The 3.5-kb pregenomic RNA serves as a template for reverse transcription and 
also encodes the nucleocapsid protein and polymerase. A subclass of this transcript with a 
5' -end extension codes for the precore protein that, after processing, is secreted as HBV e 

!0 antigen. The 2.4-kb RNA encompasses the pre-Sl open reading frame (ORF) that encodes 
the large surface protein. The 2. 1 -kb RNA encompasses the pre-S2 and S ORFs that 
encode the middle and small surface proteins, respectively. The smallest transcript (-0.8- 
kb) codes for the X protein, a transcriptional activator. 

Multiplication of the HBV genome begins within the nucleus of an infected cell. 

!5 RNA polymerase II transcribes the circular HBV DNA into greater-than-full length 
mRNA. Since the mRNA is longer than the actual complete circular DNA, redundant 
ends are formed. Once produced, the pregenomic RNA exits the nucleus and enters the 
cytoplasm. 

The packaging of pregenomic RNA into core particles is triggered by the binding of 
0 the HBV polymerase to the 5' epsilon stem-loop. RNA encapsidation is believed to occur 
as soon as binding occurs. The HBV polymerase also appears to require associated core 
protein in order to function. The HBV polymerase initiates reverse transcription from the 
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5' epsilon stem-loop three to four base pairs at which point the polymerase and attached 
nascent DNA are transferred to the 3' copy of the DR1 region. Once there, the (-)DNA is 
extended by the HBV polymerase while the RNA template is degraded by the HBV 
polymerase RNAse H activity. When the HBV polymerase reaches the 5' end, a small 

5 stretch of RNA is left undigested by the RNAse H activity. This segment of RNA is 
comprised of a small sequence just upstream and including the DR1 region. The RNA 
oligomer is then translocated and annealed to the DR2 region at the 5' end of the (-)DNA. 
It is used as a primer for the (+)DNA synthesis which is also generated by the HBV 
polymerase. It appears that the reverse transcription as well as plus strand synthesis may 

0 occur in the completed core particle. 

Since the pregenomic RNA is required as a template for DNA synthesis, this RNA is 
an excellent target for ribozyme cleavage. Nucleoside analogues that have been 
documented to inhibit HBV replication target the reverse transcriptase activity needed to 
convert the pregenomic RNA into DNA. Ribozyme cleavage of the pregenomic RNA 

5 template would be expected to result in a similar inhibition of HBV replication. Further, 
targeting the 3'-end of the pregenomic RNA that is common to all HBV transcripts could 
result in reduction of all HBV gene products and an additional level of inhibition of HBV 
replication. 

As previously mentioned, HBV does not infect cells in culture. However, 
!0 transfection of HBV DNA (either as a head-to-tail dimer or as an "overlength" genome of 
>100%) into HuH7 or Hep G2 hepatocytes results in viral gene expression and production 
of HBV virions released into the media. Thus, HBV replication competent DNA would be 
co-transfected with ribozymes in cell culture. Such an approach has been used to report 
intracellular ribozyme activity against HBV (zu Putlitz, et al, 1999, J. Virol., 73, 5381- 
:5 5387, and Kim et al, 1999, Biochem. Biophys. Res. Commun., 257, 759-765). In addition, 
stable hepatocyte cell lines have been generated that express HBV. In these cells only 
ribozyme would need to be delivered; however, a delivery screen would need to be 
performed. In addition, stable hepatocyte cell lines have been generated that express HBV. 
Intracellular HBV gene expression can be assayed by a Taqman® assay for HBV 
10 RNA or by ELISA for HBV protein. Extracellular virus can be assayed by PCR for DNA 
or ELISA for protein. Antibodies are commercially available for HBV surface antigen and 
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core protein. A secreted alkaline phosphatase expression plasmid can be used to 
normalize for differences in transfection efficiency and sample recovery. 

There are several small animal models to study HBV replication. One is the 
transplantation of HBV-infected liver tissue into irradiated mice. Viremia (as evidenced 

5 by measuring HBV DNA by PCR) is first detected 8 days after transplantation and peaks 
between 18-25 days (Ilan et al, 1999, Hepatology, 29, 553-562). 

Transgenic mice that express HBV have also been used as a model to evaluate 
potential anti-virals. HBV DNA is detectable in both liver and serum (Morrey et al, 1999, 
Antiviral Res., 42, 97-108). 

1 0 An additional model is to establish subcutaneous tumors in nude mice with Hep G2 

cells transfected with HBV. Tumors develop in about 2 weeks after inoculation and 
express HBV surface and core antigens. HBV DNA and surface antigen is also detected in 
the circulation of tumor-bearing mice (Yao et al, 1996, J. Viral Hepat., 3, 19-22). 

Woodchuck hepatitis virus (WHV) is closely related to HBV in its virus structure, 

1 5 genetic organization, and mechanism of replication. As with HBV in humans, persistent 
WHV infection is common in natural woodchuck populations and is associated with 
chronic hepatitis and hepatocellular carcinoma (HCC). Experimental studies have 
established that WHV causes HCC in woodchucks and woodchucks chronically infected 
with WHV have been used as a model to test a number of anti-viral agents. For example, 

!0 the nucleoside analogue 3T3 was observed to cause dose dependent reduction in virus 
(50% reduction after two daily treatments at the highest dose) (Hurwitz et al, 1998. 
Antimicrob. Agents Chemother., 42, 2804-2809). 

Current therapeutic goals of treatment are three-fold: to eliminate infectivity and 
transmission of HBV to others, to arrest the progression of liver disease and improve the 

!5 clinical prognosis, and to prevent the development of hepatocellular carcinoma (HCC). 
Interferon alpha use is the most common therapy for HBV; however, recently 
Lamivudine (3TC) has been approved by the FDA. Interferon alpha (IFN-alpha) is one 
treatment for chronic hepatitis B. The standard duration of IFN-alpha therapy is 16 weeks, 
however, the optimal treatment length is still poorly defined. A complete response (HBV 

10 DNA negative HBeAg negative) occurs in approximately 25% of patients. Several factors 
have been identified that predict a favorable response to therapy including: High ALT , 
low HBV DNA , being female, and heterosexual orientation. 
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There is also a risk of reactivation of the hepatitis B virus even after a successful 
response, this occurs in around 5% of responders and normally occurs within 1 year. 

Side effects resulting from treatment with type 1 interferons can be divided into four 
general categories including: Influenza-like symptoms, neuropsychiatric, laboratory 

5 abnormalities, and other miscellaneous side effects. Examples of influenza-like 

symptoms include, fatigue, fever; myalgia, malaise, appetite loss, tachycardia, rigors, 
headache and arthralgias. The influenza-like symptoms are usually short-lived and tend to 
abate after the first four weeks of dosing (Dusheiko et al, 1994, Journal of Viral Hepatitis, 
1, 3-5). Neuropsychiatric side effects include irritability, apathy, mood changes, insomnia, 

0 cognitive changes, and depression. Laboratory abnormalities include the reduction of 
myeloid cells, including granulocytes, platelets and to a lesser extent, red blood cells. 
These changes in blood cell counts rarely lead to any significant clinical sequellae. In 
addition, increases in triglyceride concentrations and elevations in serum alaine and 
aspartate aminotransferase concentration have been observed. Finally, thyroid 

5 abnormalities have been reported. These thyroid abnormalities are usually reversible after 
cessation of interferon therapy and can be controlled with appropriate medication while on 
therapy. Miscellaneous side effects include nausea, diarrhea, abdominal and back pain, 
pruritus, alopecia, and rhinorrhea. In general, most side effects will abate after 4 to 8 
weeks of therapy (Dushieko et al, supra ). 

!0 Lamivudine (3TC) is a nucleoside analogue, which is a very potent and specific 

inhibitor of HBV DNA synthesis. Lamivudine has recently been approved for the 
treatment of chronic Hepatitis B. Unlike treatment with interferon, treatment with 3TC 
does not eliminate the HBV from the patient. Rather, viral replication is controlled and 
chronic administration results in improvements in liver histology in over 50% of patients. 

!5 Phase m studies with 3TC, showed that treatment for one year was associated with 

reduced liver inflammation and a delay in scarring of the liver. In addition, patients treated 
with Lamivudine (lOOmg per day) had a 98 percent reduction in hepatitis B DNA and a 
significantly higher rate of seroconversion, suggesting disease improvements after 
completion of therapy. However, stopping of therapy resulted in a reactivation of HBV 

10 replication in most patients. In addition recent reports have documented 3TC resistance in 
approximately 30% of patients. 
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Particular degenerative and disease states that can be associated with HBV 
expression modulation include but are not limited to, HBV infection, hepatitis, cancer, 
tumorigenesis, cirrhosis, liver failure and others. 

Lamivudine (3TC), L-FMAU, adefovir dipivoxil, type 1 Interferon, therapeutic 
5 vaccines, steriods, and 2 '-5' Oligoadenylates are non-limiting examples of pharmaceutical 
agents that can be combined with or used in conjunction with the nucleic acid molecules 
(e.g. ribozymes and antisense molecules) of the instant invention. Those skilled in the art 
will recognize that other drugs such as diuretic and antihypertensive compounds or other 
therapies can similarly and readily be combined with the nucleic acid molecules of the 
1 0 instant invention (e.g. ribozymes and antisense molecules) and are, therefore, within the 
scope of the instant invention. 

Current therapies for treating HBV infection, including interferon and nucleoside 
analogues, are only partially effective. In addition, drug resistance to nucleoside analogues 
is now emerging, making treatment of chronic Hepatitis B more difficult. Thus, a need 
1 5 exists for effective treatment of this disease which utilizes antiviral inhibitors which work 
by mechanisms other than those currently utilized in the treatment of both acute and 
chronic hepatitis B infections. 

Draper, US patent No. 6,017,756, describes the use of ribozymes for the inhibition 
of Hepatitis B Virus. 

10 Passman et al, 2000, Biochem. Biophys. Res. Commun., 268(3), 728-733.; Gan et 

al, 1998, J. Med. Coll. PLA, 13(3), 157-159.; Li et al., 1999, Jiefangjun YixueZazhi, 
24(2), 99-101.; Putlitzef al., 1999, J. Virol., 73(7), 5381-5387.; Kim et al, 1999, 
Biochem. Biophys. Res. Commun., 257(3), 759-765.; Xuetal, 1998, Bingdu Xuebao, 
14(4), 365-369.; Welch et al, 1997, Gene Ther., 4(7), 736-743.; Goldenberg et al, 1997, 

>5 International PCT publication No. WO 97/08309, Wands et al, 1 997, of 
Gastroenterology and Hepatology, 12(suppl.), S354-S369.; Ruiz et al, 1997, 
BioTechniques, 22(2), 338-345.; Gan etal, 1996, J. Med. Coll PLA, 11(3), 171-175.; 
Beck and Nassal, 1995, Nucleic Acids Res., 23(24), 4954-62.; Goldenberg, 1995, 
International PCT publication No. WO 95/22600.; Xu et al, 1993, Bingdu Xuebao, 9(4), 

>0 33 1-6.; Wang et al, 1993, Bingdu Xuebao, 9(3), 278-80, all describe ribozymes that are 
targeted to cleave a specific HBV target site. 
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The enzymatic nucleic acid molecules of the instant invention exhibit a high degree 
of specificity for only the viral mRNA in infected cells. Nucleic acid molecules of the 
instant invention targeted to highly conserved sequence regions allow the treatment of 
many strains of human HBV with a single compound. No treatment presently exists which 
5 specifically attacks expression of the viral gene(s) that are responsible for transformation 
ofhepatocytes by HBV. 

The methods of this invention can be used to treat human hepatitis B virus 
infections, which include productive virus infection, latent or persistent virus infection, 
and HBV-induced hepatocyte transformation. The utility can be extended to other species 
1 0 of HBV which infect non-human animals where such infections are of veterinary 
importance. 

Preferred target sites are genes required for viral replication, a non-limiting example 
includes genes for protein synthesis, such as the 5' most 1 500 nucleotides of the HBV 
pregenomic mRNAs. For sequence references, see Renbao et al., 1987, Sci. Sin., 30, 507. 

1 5 This region controls the translational expression of the core protein (C), X protein (X) and 
DNA polymerase (P) genes and plays a role in the replication of the viral DNA by serving 
as a template for reverse transcriptase. Disruption of this region in the RNA results in 
deficient protein synthesis as well as incomplete DNA synthesis (and inhibition of 
transcription from the defective genomes). Target sequences 5' of the encapsidation site 

20 can result in the inclusion of the disrupted 3' RNA within the core virion structure and 
targeting sequences 3' of the encapsidation site can result in the reduction in protein 
expression from both the 3' and 5' fragments. 

Alternative regions outside of the 5' most 1500 nucleotides of the pregenomic 
mRNA also make suitable targets of enzymatic nucleic acid mediated inhibition of HBV 

25 replication. Such targets include the mRNA regions that encode the viral S gene. 

Selection of particular target regions will depend upon the secondary structure of the 
pregenomic mRNA. Targets in the minor mRNAs can also be used, especially when 
folding or accessibility assays in these other RNAs reveal additional target sequences that 
are unavailable in the pregenomic mRNA species. 

30 A desirable target in the pregenomic RNA is a proposed bipartite stem-loop structure 

in the 3'-end of the pregenomic RNA which is believed to be critical for viral replication 
(Kidd and Kidd-Ljunggren, 1996. Nuc. Acid Res. 24:3295-3302). The 5'end of the HBV 



WO 01/16312 PCT/US00/23998 
92 

pregenomic RNA carries a cw-acting encapsidation signal, which has inverted repeat 
sequences that are thought to form a bipartite stem-loop structure. Due to a terminal 
redundancy in the pregenomic RNA, the putative stem-loop also occurs at the 3'-end. 
While it is the 5' copy which functions in polymerase binding and encapsidation, reverse 

5 transcription actually begins from the 3' stem-loop. To start reverse transcription, a 4 nt 
primer which is covalently attached to the polymerase is made, using a bulge in the 5' 
encapsidation signal as template. This primer is then shifted, by an unknown mechanism, 
to the DR1 primer binding site in the 3' stem-loop structure, and reverse transcription 
proceeds from that point. The 3' stem-loop, and especially the DR1 primer binding site, 

0 appear to be highly effective targets for ribozyme intervention. 

Sequences of the pregenomic RNA are shared by the mRNAs for surface, core, 
polymerase, and X proteins. Due to the overlapping nature of the HBV transcripts, all 
share a common 3'-end. Ribozyme targeting this common 3'-end will thus cleave the 
pregenomic RNA as well as all of the mRNAs for surface, core, polymerase and X 

5 proteins. 

In preferred embodiments, the invention features a method for the analysis of HBV 
proteins. This method is useful in determining the efficacy of HBV inhibitors. 
Specifically, the instant invention features an assay for the analysis of HBsAg proteins and 
secreted alkaline phosphatase (SEAP) control proteins to determine the efficacy of agents 

!0 used to modulate HBV expression. 

The method consists of coating a micro-titer plate with an antibody such as anti- 
HBsAg Mab (for example, Biostride B88-95-31ad,ay) at 0.1 to 10 ug/ml in a buffer (for 
example, carbonate buffer, such as Na 2 C0 3 15 mM, NaHC0 3 35 mM, pH 9.5) at 4°C 
overnight. The microtiter wells are then washed with PBST or the equivalent thereof, (for 

15 example, PBS, 0.05% Tween 20) and blocked for 0.1-24 hr at 37° C with PBST, 1% BSA 
or the equivalent thereof. Following washing as above, the wells are dried (for example, at 
3 7° C for 30 min). Biotinylated goat anti-HBsAg or an equivalent antibody (for example, 
Accurate YVS1807) is diluted (for example at 1:1000) in PBST and incubated in the wells 
(for example, 1 hr. at 37° C). The wells are washed with PBST (for example, 4x). A 

0 conjugate, (for example, Streptavidin/Alkaline Phosphatase Conjugate, Pierce 21324) is 
diluted to 10-10,000 ng/ml in PBST, and incubated in the wells (for example, 1 hr. at 37° 
C). After washing as above, a substrate (for example, p-nitrophenyl phosphate substrate, 
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Pierce 37620) is added to the wells, which are then incubated (for example, 1 hr. at 37° C). 
The optical density is then determined (for example, at 405 ran). SEAP levels are then 
assayed, for example, using the Great EscAPe® Detection Kit (Clontech K2041-1), as per 
the manufacturers instructions. In the above example, incubation times and reagent 
5 concentrations may be varied to achieve optimum results, a non-limiting example is 
described in Example 6. 

Comparison of this HBsAg ELISA method to a commercially available assay from 
World Diagnostics, Inc. 15271 NW 60 th Ave, #201, Miami Lakes, FL 33014 (305) 827- 
3304 (Cat. No. EL10018) demonstrates an increase in sensitivity (signal :noise) of 3-20 



Identification of Potential Target Sites in Human HBV RNA 

The sequence of human HBV was screened for accessible sites using a computer- 
folding algorithm. Regions of the RNA that did not form secondary folding structures and 
5 contained potential ribozyme and/or antisense binding/cleavage sites were identified. The 
sequences of these cleavage sites are shown in Tables 36-43. 



Selection of Enzymatic Nucleic Acid Cleavage Sites in Human HBV RNA 

Ribozyme target sites were chosen by analyzing sequences of Human HBV 

!0 (accession number: AF100308.1) and prioritizing the sites on the basis of folding. 

Ribozymes were designed that could bind each target and were individually analyzed by 
computer folding (Christoffersen et al, 1994 /. Mol. Struc. Theochem. 31 1, 273; Jaeger et 
al, 1989, Proc. Natl. Acad. Sci. USA, 86, 7706) to assess whether the ribozyme sequences 
fold into the appropriate secondary structure. Those ribozymes with unfavorable 

!5 intramolecular interactions between the binding arms and the catalytic core were 
eliminated from consideration. As noted herein, varying binding arm lengths can be 
chosen to optimize activity. Generally, at least 5 bases on each arm are able to bind to, or 
otherwise interact with, the target RNA. 
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Chemical Synthesis and Purification of Ribozvmes and Antisense for Efficient Cleavage 
and/or blocking of HBV RNA 

Ribozymes and antisense constructs were designed to anneal to various sites in the 
RNA message. The binding arms of the ribozymes are complementary to the target site 

5 sequences described above, while the antisense constructs are fully complementary to the 
target site sequences described above. The ribozymes and antisense constructs were 
chemically synthesized. The method of synthesis used followed the procedure for normal 
RNA synthesis as described above and in Usman et al, (1987 /. Am. Chem. Soc, 109, 
7845), Scaringe et al., (1990 Nucleic Acids Res., 18, 5433) and Wincott et al., supra, and 

0 made use of common nucleic acid protecting and coupling groups, such as dimethoxytrityl 
at the 5 '-end, and phosphoramidites at the 3 '-end. The average stepwise coupling yields 
were typically >98%. 

Ribozvmes and antisense constructs were also synthesized from DNA templates 
using bacteriophage T7 RNA polymerase (Milligan and Uhlenbeck, 1989, Methods 

5 Enzymol. 180, 51). Ribozymes and antisense constructs were purified by gel 
electrophoresis using general methods or were purified by high pressure liquid 
chromatography (HPLC; see Wincott et al., supra; the totality of which is hereby 
incorporated herein by reference) and were resuspended in water. The sequences of the 
chemically synthesized ribozymes used in this study are shown below in Table 43. 

!0 

Ribozvme Cleavage of HBV RNA Target in vitro 

Ribozymes targeted to the human HBV RNA are designed and synthesized as 
described above. These ribozymes can be tested for cleavage activity in vitro, for example 
using the following procedure. The target sequences and the nucleotide location within the 
!5 HBV RNA are given in Tables 36-43. 

Cleavage Reactions: Full-length or partially full-length, internally-labeled target 
RNA for ribozyme cleavage assay is prepared by in vitro transcription in the presence of 
[a- 32 p] CTP, passed over a G 50 Sephadex® column by spin chromatography and used as 

substrate RNA without further purification. Alternately, substrates are 5'- 32 P-end labeled 
0 using T4 polynucleotide kinase enzyme. Assays are performed by pre-warming a 2X 

concentration of purified ribozyme in ribozyme cleavage buffer (50 mM Tris-HCl, pH 7.5 
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at 37°C, 10 mM MgCl2) and the cleavage reaction was initiated by adding the 2X 
ribozyme mix to an equal volume of substrate RNA (maximum of 1-5 nM) that was also 
pre-warmed in cleavage buffer. As an initial screen, assays are carried out for 1 hour at 
37 C using a final concentration of either 40 nM or 1 mM ribozyme, i.e., ribozyme excess. 

5 The reaction is quenched by the addition of an equal volume of 95% formamide, 20 mM 
EDTA, 0.05% bromophenol blue and 0.05% xylene cyanol after which the sample is 
heated to 95 C for 2 minutes, quick chilled and loaded onto a denaturing polyacrylamide 
gel. Substrate RNA and the specific RNA cleavage products generated by ribozyme 
cleavage are visualized on an autoradiograph of the gel. The percentage of cleavage is 

0 determined by Phosphor Imager® quantitation of bands representing the intact substrate 
and the cleavage products. 

Transfection of HepG2 Cells with psHBV-1 and Ribozymes 

The human hepatocellular carcinoma cell line Hep G2 was grown in Dulbecco's 

5 modified Eagle media supplemented with 10% fetal calf serum, 2 mM glutamine, 0. 1 mM 
nonessential amino acids, 1 mM sodium pyruvate, 25 mM Hepes, 100 units penicillin, and 
100 ug/ml streptomycin. To generate a replication competent cDNA, prior to transfection 
the HBV genomic sequences are excised from the bacterial plasmid sequence contained in 
the psHBV-1 vector (Those skilled in the art understand that other methods may be used 

10 to generate a replication competent cDNA). This was done with an EcoRI and Hind III 
restriction digest. Following completion of the digest, a ligation was performed under 
dilute conditions (20 ug/ml) to favor intermolecular ligation. The total ligation mixture 
was then concentrated using Qiagen spin columns. 

Secreted alkaline phosphatase (SEAP) was used to normalize the HBsAg levels to 

:5 control for transfection variability. The pSEAP2-TK control vector was constructed by 
ligating a Bgl II-Hind III fragment of the pRL-TK vector (Promega), containing the herpes 
simplex virus thymidine kinase promoter region, into Bgl U/Hind HI digested pSEAP2- 
Basic (Clontech). Hep G2 cells were plated (3 x 10 4 cells/well) in 96-well microtiter plates 
and incubated overnight. A lipidTDNA/ribozyme complex was formed containing (at final 

0 concentrations) cationic lipid (15 ug/ml), prepared psHBV-1 (4.5 ug/ml), pSEAP2-TK 
(0.5 ug/ml), and ribozyme (100 uM). Following a 15 min. incubation at 37° C, the 



WO 01/16312 PCT7US00/23998 
96 

complexes were added to the plated Hep G2 cells. Media was removed from the cells 96 
hr. post-transfection for HBsAg and SEAP analysis. 

Transfection of the human hepatocellular carcinoma cell line, Hep G2, with 
replication competent HBV DNA results in the expression of HBV proteins and the 
5 production of virions. To investigate the potential use of ribozymes for the treatment of 
chronic HBV infection, a series of ribozymes that target the 3' terminus of the HBV 
genome have been synthesized. Ribozymes targeting this region have the potential to 
cleave all four major HBV RNA transcripts as well as the potential to block the production 
of HBV DNA by cleavage of the pregenomic RNA. To test the efficacy of these HBV 

1 0 ribozymes, they were co-transfected with HBV genomic DNA into Hep G2 cells, and the 
subsequent levels of secreted HBV surface antigen (HBsAg) were analyzed by ELISA. To 
control for variability in transfection efficiency, a control vector which expresses secreted 
alkaline phosphatase (SEAP), was also co-transfected. The efficacy of the HBV 
ribozymes was determined by comparing the ratio of HBsAg:SEAP and/or HBeAg:SEAP 

15 to that of a scrambled attenuated control (SAC) ribozyme. Twenty-five ribozymes 

(RPI18341, RPI18356, RPI18363, RPI18364, RPI18365, RPI18366, RPI18367, RPI18368, 
RPI18369, RPI18370, RPI18371, RPI18372, RPI18373, RPI18374, RPI18303, RPI18405, 
RPI18406, RPI18407, RPI18408, RPI18409, RPI18410, RPI18411, RPI18418, RPI18419, 
and RPI18422) have been identified which cause a reduction in the levels of HBsAg 

20 and/or HBeAg as compared to the corresponding SAC ribozyme. 



Example 6: Analysis of HBsAg and SEAP Levels Following R ibozvme Treatment 

Immulon 4 (Dynax) microtiter wells were coated overnight at 4° C with anti-HBsAg 
Mab (Biostride B88-95-31ad,ay) at 1 ug/ml in Carbonate Buffer (Na 2 C0 3 15 mM, 

25 NaHCOs 35 mM, pH 9.5). The wells were then washed 4x with PBST (PBS, 0.05% 
Tween® 20) and blocked for 1 hr at 37° C with PBST, 1% BSA. Following washing as 
above, the wells were dried at 37° C for 30 min. Biotinylated goat ant-HBsAg (Accurate 
YVS1807) was diluted 1:1000 in PBST and incubated in the wells for 1 hr. at 37° C. The 
wells were washed 4x with PBST. Streptavidin/Alkaline Phosphatase Conjugate (Pierce 

30 21324) was diluted to 250 ng/ml in PBST, and incubated in the wells for 1 hr. at 37° C. 
After washing as above, p-nitrophenyl phosphate substrate (Pierce 37620) was added to 
the wells, which were then incubated for 1 hr. at 37° C. The optical density at 405 nm was 
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then determined. SEAP levels were assayed using the Great EscAPe® Detection Kit 
(Clontech K2041-1), as per the manufacturers instructions. 

Example 7: X-gene Reporter Assay 

The effect of ribozyme treatment on the level of transactivation of a SV40 promoter 
driven firefly luciferase gene by the HBV X-protein was analyzed in transfected Hep G2 
cells. As a control for variability in transfection efficiency, a Renilla luciferase reporter 
driven by the TK promoter, which is not transactivated by the X protein, was used. Hep 
G2 cells were plated (3 x 10 4 cells/well) in 96-well microtiter plates and incubated 
overnight. A lipid/DNA/ribozyme complex was formed containing (at final 
concentrations) cationic lipid (2.4 ug/ml), the X-gene vector pSBDR(2.5 ug/ml), the firefly 
reporter pSV40HCVluc (0.5 ug/ml), the Renilla luciferase control vector pRL-TK (0.5 
Ug/ml), and ribozyme (100 uM). Following a 15 min. incubation at 37° C, the complexes 
were added to the plated Hep G2 cells. Levels of firefly and Renilla luciferase were 
analyzed 48 hr. post transfection, using Promega's Dual-Luciferase Assay System. 

The HBV X protein is a transactivator of a number of viral and cellular genes. 
Ribozymes which target the X region were tested for their ability to cause a reduction in X 
protein transactivation of a firefly luciferase gene driven by the SV40 promoter in 
transfected Hep G2 cells. As a control for transfection variability, a vector containing the 
Renilla luciferase gene driven by the TK promotor, which is not activated by the X protein, 
was included in the co-transfections. The efficacy of the HBV ribozymes was determined 
by comparing the ratio of firefly luciferase: Renilla luciferase to that of a scrambled 
attenuated control (SAC) ribozyme. Eleven ribozymes (RPI18365, RPI18367, RPI18368, 
RPI18371, RPI18372, RPI18373, RPI18405, RPI18406, RPI1841 1, RPI18418, RPI18423) 
were identified which cause a reduction in the level of transactivation of a reporter gene by 
the X protein, as compared to the corresponding SAC ribozyme. 

Example 8: HBV transgenic mouse study 

A transgenic mouse strain (founder strain 1.3.32 with a C57B1/6 background) that 
expresses HBV RNA and forms HBV viremia (Morrey et al, 1999, Antiviral Res., 42, 97- 
108; Guidotti et al., 1995, J. Virology, 69, 10, 6158-6169) was utilized to study the in vivo 
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activity of ribozymes of the instant invention. This model is predictive in screening for 
anti-HBV agents. Ribozyme or the equivalent volume of saline was administered via a 
continuous s.c. infusion using Alzet® mini-osmotic pumps for 14 days. Alzet® pumps 
were filled with test material(s) in a sterile fashion according to the manufacturer's 

5 instructions. Prior to in vivo implantation, pumps were incubated at 37°C overnight (> 18 
hours) to prime the flow modulators. On the day of surgery, animals were lightly 
anesthetized with a ketamine/xylazine cocktail (94 mg/kg and 6 mg/kg, respectively; 0.3 
ml, IP). Baseline blood samples (200 ul) were obtained from each animal via a retro- 
orbital bleed. A 2 cm area near the base of the tail was shaved and cleansed with betadine 

0 surgical scrub and sequentially with 70% alcohol. A 1 cm incision in the skin was made 
with a #15 scalpel blade or a blunt pair of scissors near the base of the tail. Forceps were 
used to open a pocket rostrally (i.e., towards the head) by spreading apart the subcutaneous 
connective tissue. The pump was inserted with the delivery portal pointing away from the 
incision. Wounds were closed with sterile 9-mm stainless steel clips or with sterile 4-0 

5 suture. Animals were then allowed to recover from anesthesia on a warm heating pad 
before being returned to their cage. Wounds were checked daily. Clips or sutures were 
replaced as needed. Incisions typically healed completely within 7 days post-op. Animals 
were then deeply anesthetized with the ketamine/xylazine cocktail (150 mg/kg and 10 
mg/kg, respectively; 0.5 ml, IP) on day 14 post pump implantation. A midline 

!0 thoracotomy/ laparatomy was performed to expose the abdominal cavity and the thoracic 
cavity. The left ventricle was cannulated at the base and animals exsanguinated using a 
23G needle and 1 ml syringe. Serum was separated, frozen and analyzed for HBV DNA 
and antigen levels. Experimental groups were compared to the saline control group in 
respect to percent change from day 0 to day 14. HBV DNA was assayed by quantitative 

!5 PCR 

Results 

Table 44 is a summary of the group designation and dosage levels used in the HBV 
transgenic mouse study. Baseline blood samples were obtained via a retroorbital bleed and 
0 animals (N=10/group) received anti-HBV ribozymes (100 mg/kg/day) as a continuous SC 
infusion. After 14 days, animals treated with a ribozyme targeting site 273 (RPI. 18341) of 
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the HBV RNA showed a significant reduction in serum HBV DNA concentration, 
compared to the saline treated animals as measured by a quantitative PCR assay. More 
specifically, the saline treated animals had a 69% increase in serum HBV DNA 
concentrations over this 2-week period while treatment with the 273 ribozyme 
5 (RPI. 18341) resulted in a 60% decrease in serum HBV DNA concentrations. Ribozymes 
directed against sites 1833 (RPI. 18371), 1873 (RPI. 18418), and 1874 (RPI.18372) 
decreased serum HBV DNA concentrations by 49%, 15% and 16%, respectively. 

Example 7: Activity of NCH Ribozyme to inhibit HER2 gene expression 

10 HER2 (also known as neu, erbB2 and c-erbB2) is an oncogene that encodes a 185- 

kDa transmembrane tyrosine kinase receptor. HER2 is a member of the epidermal growth 
factor receptor (EGFR) family and shares partial homology with other family members. In 
normal adult tissues HER2 expression is low. However, HER2 is overexpressed in at least 
25-30% of breast (McGuire & Greene, 1989) and ovarian cancers (Berchuck, et al, 1990). 

5 Furthermore, overexpression of HER2 in malignant breast tumors has been correlated with 
increased metastasis, chemoresistance and poor survival rates (Slamon et al, 1987 Science 
235: 177-182). Because HER2 expression is high in aggressive human breast and ovarian 
cancers, but low in normal adult tissues, it is an attractive target for ribozyme-mediated 
therapy (Thompson et al, supra). 

!0 The greatest HER2 specific effects have been observed in cancer cell lines that 

express high levels of HER2 protein (as measured by ELISA). Specifically, in one study 
that treated five human breast cancer cell lines with the HER2 antibody (anti-erbB2-sFv), 
the greatest inhibition of cell growth was seen in three cell lines (MDA-MB-361, SKBR-3 
and BT-474) that express high levels of HER2 protein. No inhibition of cell growth was 

!5 observed in two cell lines (MDA-MB-23 1 and MCF-7) that express low levels of HER2 
protein (Wright et al, 1997). Another group successfully used SKBR-3 cells to show 
HER2 antisense oligonucleotide-mediated inhibition of HER2 protein expression and 
HER2 RNA knockdown (Vaughn et al, 1995). Other groups have also demonstrated a 
decrease in the levels of HER2 protein, HER2 mRNA and/or cell proliferation in cultured 

i0 cells using anti-HER2 ribozymes or antisense molecules (Suzuki, T. et al, 1997; Weichen, 
et al, 1997; Czubayko, F. et al, 1997; Colomer, et al, 1994; Betram et al, 1994). 
Because cell lines that express higher levels of HER2 have been more sensitive to anti- 
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HER2 agents, we are pursuing several medium to high expressing cell lines, including 
SKBR-3 and T47D, for ribozyme screens in cell culture. 

A variety of endpoints have been used in cell culture models to look at HER2- 
mediated effects after treatment with anti-HER2 agents. Phenotypic endpoints include 

5 inhibition of cell proliferation, apoptosis assays and reduction of HER2 protein expression. 
Because overexpression of HER2 is directly associated with increased proliferation of 
breast and ovarian tumor cells, a proliferation endpoint for cell culture assays will be our 
primary screen. There are several methods by which this endpoint can be measured. 
Following treatment of cells with ribozymes, cells are allowed to grow (typically 5 days) 

0 after which either the cell viability, the incorporation of [ 3 H] thymidine into cellular DNA 
and/or the cell density can be measured. The assay of cell density is very straightforward 
and can be done in a 96-well format using commercially available fluorescent nucleic acid 
stains (such as Syto 13 or CyQuant). The assay using CyQuant is in place at RPI and is 
currently being employed to screen -100 ribozymes targeting HER2 (details below). 

5 As a secondary, confirmatory endpoint a ribozyme-mediated decrease in the level of 

HER2 protein expression can be evaluated using a HER2-specific ELISA. 

Validation of Cell Lines and Ribozyme Treatment Conditions 

Two human breast cancer cell lines (T47D and SKBR-3) that are known to express 

!0 medium to high levels of HER2 protein, respectively, were considered for ribozyme 
screening. In order to validate these cell lines for HER2 -mediated sensitivity, both cell 
lines were treated with the HER2 specific antibody, Herceptin® (Genentech) and its effect 
on cell proliferation was determined. Herceptin was added to cells at concentrations 
ranging from 0-8 uM in medium containing either no serum (OptiMem), 0.1% or 0.5% 

5 FBS and efficacy was determined via cell proliferation. Maximal inhibition of 

proliferation (-50%) in both cell lines was observed after addition of Herceptin at 0.5 nM 
in medium containing 0.1% or no FBS. The fact that both cell lines are sensitive to an 
anti-HER2 agent (Herceptin) supports their use in experiments testing anti-HER2 
ribozymes. 

0 Prior to ribozyme screening, the choice of the optimal lipid(s) and conditions for 

ribozyme delivery was determined empirically for each cell line. Applicant has established 
a panel of proprietary lipids that can be used to deliver ribozymes to cultured cells and are 
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very useful for cell proliferation assays that are typically 3-5 days in length. Initially, this 
panel of proprietary lipid delivery vehicles was screened in SKBR-3 and T47D cells using 
previously established control oligonucleotides. Specific lipids and conditions for optimal 
delivery were selected for each cell line based on these screens. These conditions were 
5 used to deliver HER2 specific ribozymes to cells for primary (inhibition of cell 
proliferation) and secondary (decrease in HER2 protein) efficacy endpoints. 

Primary Screen: Inhibition of Cell Proliferation 

Although optimal ribozyme delivery conditions were determined for two cell lines, 

0 the SKBR-3 cell line were be used for the initial screen because it has the higher level of 
HER2 protein, and thus should be most susceptible to a HER2 -specific ribozyme. Follow- 
up studies can be carried out in T47D cells to confirm leads as necessary. 

Ribozyme screens were be performed using an automated, high throughput 96 -well 
cell proliferation assay. Cell proliferation were measured over a 5-day treatment period 

5 using the nucleic acid stain CyQuant for determining cell density. The growth of cells 
treated with ribozyme/lipid complexes were compared to both untreated cells and to cells 
treated with £crambled-arm Attenuated core Controls (SAC; or IA; Figure 8). SACs can 
no longer bind to the target site due to the scrambled arm sequence and have nucleotide 
changes in the core that greatly diminish ribozyme cleavage. These SACs are used to 

!0 determine non-specific inhibition of cell growth caused by ribozyme chemistry (i.e. 

multiple 2' 0-Me modified nucleotides, a single 2'C-allyl uridine, 4 phosphorothioates 
and a 3' inverted abasic). Lead ribozymes are chosen from the primary screen based on 
their ability to inhibit cell proliferation in a specific manner. Dose response assays are 
carried out on these leads and a subset was advanced into a secondary screen using the 

!5 level of HER2 protein as an endpoint. 

Secondary Screen; Decrease in HER2 Protein 

A secondary screen that measures the effect of anti-HER2 ribozymes on HER2 
protein levels is used to support preliminary findings. A robust HER2 ELISA for both 
10 T47D and SKBR-3 cells has been established and is available for use as an additional 
endpoint. 
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Ribozvme Mechanism Assays 

A Taqman assay for measuring the ribozyme-mediated decrease in HER2 RNA has 
also been established. This assay is based on PCR technology and can measure in real 
time the production of HER2 mRNA relative to a standard cellular mRNA such as 
5 GAPDH. This RNA assay is used to establish proof that lead ribozymes are working 
through an RNA cleavage mechanism and result in a decrease in the level of HER2 
mRNA, thus leading to a decrease in cell surface HER2 protein receptors and a subsequent 
decrease in tumor cell proliferation. 

0 Animal Models 

Evaluating the efficacy of anti-HER2 agents in animal models is an important 
prerequisite to human clinical trials. As in cell culture models, the most HER2 sensitive 
mouse tumor xenografts are those derived from human breast carcinoma cells that express 
high levels of HER2 protein. In a recent study, nude mice bearing BT-474 xenografts 

5 were sensitive to the anti-HER2 humanized monoclonal antibody Herceptin, resulting in 
an 80% inhibition of tumor growth at a 1 mg kg dose (ip, 2 X week for 4-5 weeks). Tumor 
eradication was observed in 3 of 8 mice treated in this manner (Baselga et ah, 1998). This 
same study compared the efficacy of Herceptin alone or in combination with the 
commonly used chemotherapeutics, paclitaxel or doxorubicin. Although, all three anti- 

!0 HER2 agents caused modest inhibition of tumor growth, the greatest antitumor activity 
was produced by the combination of Herceptin and paclitaxel (93% inhibition of tumor 
growth vs 35% with paclitaxel alone). The above studies provide proof that inhibition of 
HER2 expression by anti-HER2 agents causes inhibition of tumor growth in animals. 
Lead anti-HER2 ribozymes chosen from in vitro assays are further tested in mouse 

5 xenograft models. Ribozymes are first tested alone and then in combination with standard 
chemotherapies. 

Animal Model Development 

Three human breast tumor cell lines (T47D, SKBR-3 and BT-474) were 
0 characterized to establish their growth curves in mice. These three cell lines have been 
implanted into the mammary papillae of both nude and SCID mice and primary tumor 
volumes are measured 3 times per week. Growth characteristics of these tumor lines using 
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a Matrigel implantation format will also be established. In addition, the use of two other 
breast cell lines that have been engineered to express high levels of HER2 are also being 
used. The tumor cell line(s) and implantation method that supports the most consistent 
and reliable tumor growth is used in animal studies testing the lead HER2 ribozyme(s). 

5 Ribozyme are administered by daily subcutaneous injection or by continuous subcutaneous 
infusion from Alzet mini osmotic pumps beginning 3 days after tumor implantation and 
continuing for the duration of the study. Group sizes of at least 10 animals are employed. 
Efficacy is determined by statistical comparison of tumor volume of ribozyme-treated 
animals to a control group of animals treated with saline alone. Because the growth of 

0 these tumors is generally slow (45-60 days), an initial endpoint will be the time in days it 
takes to establish an easily measurable primary tumor (i.e. 50-100 mm 3 ) in the presence or 
absence of ribozyme treatment. 

Clinical Summary 

5 Breast cancer is a common cancer in women and also occurs in men to a lesser 

degree. The incidence of breast cancer in the United States is -180,000 cases per year and 
-46,000 die each year of the disease. In addition, 21,000 new cases of ovarian cancer per 
year lead to -13,000 deaths (data from Hung et al., 1995 and the Surveillance, 
Epidemiology and End Results Program, NCI). Ovarian cancer is a potential secondary 

0 indication for anti-HER2 ribozyme therapy. 

A full review of breast cancer is given in the NCI PDQ for Breast Cancer. A brief 
overview is given here. Breast cancer is evaluated or "staged" on the basis of tumor size, 
and whether it has spread to lymph nodes and/or other parts of the body. In Stage I breast 
cancer, the cancer is no larger than 2 centimeters and has not spread outside of the breast. 

5 In Stage n, the patient's tumor is 2-5 centimeters but cancer may have spread to the 

axillary lymph nodes. By Stage HI, metastasis to the lymph nodes is typical, and tumors 
are 5 centimeters. Additional tissue involvement (skin, chest wall, ribs, muscles etc) 
may also be noted. Once cancer has spread to additional organs of the body, it is classed 
as Stage IV. 

0 Almost all breast cancers (>90%) are detected at Stage I or n, but 3 1 % of these are 

already lymph node positive. The 5-year survival rate for node negative patients (with 
standard surgery/radiation/chemotherapy /hormone regimens) is 97%; however, 
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involvement of the lymph nodes reduces the 5-year survival to only 77%. Involvement of 
other organs ( Stage HI) drastically reduces the overall survival, to 22% at 5 years. Thus, 
chance of recovery from breast cancer is highly dependent on early detection. Because up 
to 10% of breast cancers are hereditary, those with a family history are considered to be at 

5 high risk for breast cancer and should be monitored very closely. 

Breast cancer is highly treatable and often curable when detected in the early stages. 
(For a complete review of breast cancer treatments, see the NCI PDQ for Breast Cancer.) 
Common therapies include surgery, radiation therapy, chemotherapy and hormonal 
therapy. Depending upon many factors, including the tumor size, lymph node involvement 

0 and location of the lesion, surgical removal varies from lumpectomy (removal of the tumor 
and some surrounding tissue) to mastectomy (removal of the breast, lymph nodes and 
some or all of the underlying chest muscle). Even with successful surgical resection, as 
many as 21% of the patients may ultimately relapse (10-20 years). Thus, once local 
disease is controlled by surgery, adjuvant radiation treatments, chemotherapies and/or 

5 hormonal therapies are typically used to reduce the rate of recurrence and improve 

survival. The therapy regimen employed depends not only on the stage of the cancer at its 
time of removal, but other variables such the type of cancer (ductal or lobular), whether 
lymph nodes were involved and removed, age and general health of the patient and if other 
organs are involved. 

!0 Common chemotherapies include various combinations cytotoxic drugs to kill the 

cancer cells. These drugs include paclitaxel (Taxol), docetaxel, cisplatin, methotrexate, 
cyclophosphamide, doxorubin, fluorouracil etc. Significant toxicities are associated with 
these cytotoxic therapies. Well-characterized toxicities include nausea and vomiting, 
myelosuppression, alopecia and mucosity. Serious cardiac problems are also associated 

!5 with certain of the combinations, e.g. doxorubin and paclitaxel, but are less common. 

Testing for estrogen and progesterone receptors helps to determine whether certain 
anti-hormone therapies might be helpful in inhibiting tumor growth. If either or both 
receptors are present, therapies to interfere with the action of the hormone ligands, can be 
given in combination with chemotherapy and are generally continued for several years. 

0 These adjuvant therapies are called SERMs, selective estrogen receptor modulators, and 
they can give beneficial estrogen-like effects on bone and lipid metabolism while 
antagonizing estrogen in reproductive tissues. Tamoxifen is one such compound. The 
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primary toxic effect associated with the use of tamoxifen is a 2 to 7-fold increase in the 
rate of endometrial cancer. Blood clots in the legs and lung and the possibility of stroke 
are additional side effects. However, tamoxifen has been determined to reduce breast 
cancer incidence by 49% in high-risk patients and an extensive, somewhat controversial, 

5 clinical study is underway to expand the prophylactic use of tamoxifen. Another SERM, 
raloxifene, was also shown to reduce the incidence of breast cancer in a large clinical trial 
where it was being used to treat osteoporosis. In additional studies, removal of the ovaries 
and/or drugs to keep the ovaries from working are being tested. 

Bone marrow transplantation is being studied in clinical trials for breast cancers that 

0 have become resistant to traditional chemotherapies or where >3 lymph nodes are 

involved. Marrow is removed from the patient prior to high-dose chemotherapy to protect 
it from being destroyed, and then replaced after the chemotherapy. Another type of 
"transplant" involves the exogenous treatment of peripheral blood stem cells with drugs to 
kill cancer cells prior to replacing the treated cells in the bloodstream. 

5 One biological treatment, a humanized monoclonal anti-HER2 antibody, Herceptin 

(Genentech) has been approved by the FDA as an additional treatment for HER2 positive 
tumors. Herceptin binds with high affinity to the extracellular domain of HER2 and thus 
blocks its signaling action. Herceptin can be used alone or in combination with 
chemotherapeutics (i.e. paclitaxel, docetaxel, cisplatin, etc.) (Pegram, et al, 1998). hi 

0 Phase IE studies, Herceptin significantly improved the response rate to chemotherapy as 
well as improving the time to progression (Ross & Fletcher, 1998). The most common 
side effects attributed to Herceptin are fever and chills, pain, asthenia, nausea, vomiting, 
increased cough, diarrhea, headache, dyspnea, infection, rhinitis, and insomnia. Herceptin 
in combination with chemotherapy (paclitaxel) can lead to cardiotoxicity (Sparano, 1999), 

5 leukopenia, anemia, diarrhea, abdominal pain and infection. 

HER2 Protein Levels for Patient Screening and as a Potential Endpoint 

Because elevated HER2 levels can be detected in at least 30% of breast cancers, 
breast cancer patients can be pre-screened for elevated HER2 prior to admission to initial 
0 clinical trials testing an anti-HER2 ribozyme. Initial HER2 levels can be determined (by 
ELISA) from tumor biopsies or resected tumor samples. 
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During clinical trials, it may be possible to monitor circulating HER2 protein by 
ELISA (Ross and Fletcher, 1998). Evaluation of serial blood/serum samples over the 
course of the anti-HER2 ribozyme treatment period could be useful in determining early 
indications of efficacy. In fact, the clinical course of Stage IV breast cancer was correlated 

5 with shed HER2 protein fragment following a dose-intensified paclitaxel monotherapy. In 
all responders, the HER2 serum level decreased below the detection limit (Luftner et al.). 

Two cancer-associated antigens, CA27.29 and CA15.3, can also be measured in the 
serum. Both of these glycoproteins have been used as diagnostic markers for breast 
cancer. CA27.29 levels are higher than CA15.3 in breast cancer patients; the reverse is 

0 true in healthy individuals. Of these two markers, CA27.29 was found to better 

discriminate primary cancer from healthy subjects. In addition, a statistically significant 
and direct relationship was shown between CA27.29 and large vs small tumors and node 
postive vs node negative disease (Gion, et al., 1999). Moreover, both cancer antigens were 
found to be suitable for the detection of possible metastases during follow-up (Rodriguez 

5 de Paterna et al., 1999). Thus, blocking breast tumor growth may be reflected in lower 
CA27.29 and/or CA15.3 levels compared to a control group. FDA submissions for the use 
of CA27.29 and CA15.3 for monitoring metastatic breast cancer patients have been filed 
(reviewed in Beveridge, 1999). Fully automated methods for measurement of either of 
these markers are commercially available. 

!0 
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Applicant has designed, synthesized and tested several NCH ribozymes and HH 
ribozymes targeted against HER2 RNA (see for example Tables 31 and 34) in cell 
proliferation assays. 

5 

Proliferation assay: 

The model proliferation assay used in the study can require a cell plating density of 
2000 cells/well in 96-well plates and at least 2 cell doublings over a 5-day treatment 
period. To calculate cell density for proliferation assays, the FIPS (fluoro-imaging 
0 processing system) method well in the art was used. This method allows for cell density 
measurements after nucleic acids are stained with CyQuant dye, and has the advantage of 
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accurately measuring cell densities over a very wide range 1,000-100,000 cells/well in 96- 
well format. 

Ribozymes (50-200 nM) were delivered in the presence of cationic lipid at 2.0 
(ag/mL and inhibition of proliferation was determined on day 5 post-treatment. Two full 

5 ribozyme screens were completed and 4 lead HH and 1 1 lead NCH ribozymes were chosen 
for further testing. Of the 15 lead Rzs chosen from primary screens, 4 NCH and 1 HH 
Rzs continued to inhibit cell proliferation in subsequent experiments. NCH Rzs against 
sites, 2001 (RPI No. 17236), 2783 (RPI No. 17249), 2939 (RPI No. 17251) or 3998 (RPI 
No. 17262) caused inhibition of proliferation ranging from 25-60% as compared to a 

0 scrambled control Rz (IA; RPI No. 17263). Of the five lead Rzs, the most efficacious is 
the NCH Rz (RPI No. 17251) against site 2939 of HER2 RNA. An example of results 
from cell culture assay is shown in Figure 8. Referring to Figure 8, NCH ribozymes and 
a HH ribozyme targeted against HER2 RNA, are shown to cause significant inhibition of 
proliferation of cells. This shows that ribozymes, for instance the NCH ribozymes are 

5 capable of inhibiting HER2 gene expression in mammalian cells. 

Example 8: Activity of Class II (Zinzyme) nucleic acid catalysts to inhibit HER2 gene 
expression 

Applicant has designed, synthesized and tested several class II (zinzyme) ribozymes 
!0 targeted against HER2 RNA (see, for example, Tables 58, 59, and 62) in cell proliferation 
RNA reduction assays. 

Proliferation assay: 

The model proliferation assay used in the study requires a cell-plating density of 
!5 2000-10000 cells/well in 96-well plates and at least 2 cell doublings over a 5-day treatment 
period. Cells used in proliferation studies were either human breast or ovarian cancer cells 
(SKBR-3 and SKOV-3 cells respectively). To calculate cell density for proliferation 
assays, the FIPS (fluoro-imaging processing system) method well known in the art was 
used. This method allows for cell density measurements after nucleic acids are stained 
0 with CyQuant® dye, and has the advantage of accurately measuring cell densities over a 
very wide range 1,000-100,000 cells/well in 96-well format. 
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Ribozymes (50-200 nM) were delivered in the presence of cationic lipid at 2.0-5.0 
Hg/rnL and inhibition of proliferation was determined on day 5 post-treatment. Two full 
ribozyme screens were completed resulting in the selection of 14 ribozymes. Class II 
(zinzyme) ribozymes against sites, 314 (RPI No. 18653), 443 (RPI No. 18680), 597 (RPI 

5 No. 18697), 659 (RPI No. 18682), 878 (RPINos. 18683 and 18654), 881 (RPINos. 18684 
and 18685) 934 (RPI No. 18651), 972 (RPI No. 18656, 19292, 19727, 19728, and 19293), 
1292 (RPI No. 18726), 1541 (RPI No. 18687), 2116 (RPI No. 18729), 2932 (RPI No. 
18678), 2540 (RPI No. 18715), and 3504 (RPI No. 18710) caused inhibition of 
proliferation ranging from 25-80% as compared to a scrambled control ribozyme. An 

0 example of results from a cell culture assay is shown in Figure 20. Referring to Figure 
20, Class II ribozymes targeted against HER2 RNA are shown to cause significant 
inhibition of proliferation of cells. This shows that ribozymes, for instance the Class II 
(zinzyme) ribozymes are capable of inhibiting HER2 gene expression in mammalian cells. 

5 RNA assay: 

RNA was harvested 24 hours post-treatment using the Qiagen RNeasy® 96 
procedure. Real time RT-PCR (TaqMan® assay) was performed on purified RNA 
samples using separate primer/probe sets specific for either target HER2 RNA or control 
actin RNA (to normalize for differences due to cell plating or sample recovery). Results 
!0 are shown as the average of triplicate determinations of HER2 to actin RNA levels post- 
treatment. Figure 30 shows class II ribozyme (zinzyme) mediated reduction in HER2 
RNA targeting site 972 vs a scrambled attenuated control. 

Dose response assays: 

5 Active ribozyme was mixed with binding arm-attenuated control (BAC) ribozyme to 

a final oligonucleotide concentration of either 100, 200 or 400 nM and delivered to cells in 
the presence of cationic lipid at 5.0 pg/mL. Mixing active and BAC in this manner 
maintains the lipid to ribozyme charge ratio throughout the dose response curve. HER2 
RNA reduction was measured 24 hours post-treatment and inhibition of proliferation was 

0 determined on day 5 post-treatment. The dose response antiproliferation results are 
summarized in Figure 31 and the dose-dependent reduction of HER2 RNA results are 
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summarized in Figure 32. Figure 33 shows a combined dose response plot of both anti- 
proliferation and RNA reduction data for a class II ribozyme targeting site 972 of HER2 
RNA (RPI 19293). 

5 Example 9: Compositions having RNA cleaving activity 

Hammerhead ribozymes are an example of catalytic RNA molecules which are able 
to recognize and cleave a given specific RNA substrate (Hutchins et al., 1986, Nucleic 
Acids Res. 14:3627; Keese and Symons, in Viroids and viroid-like pathogens (JJ. 
Semanchik, publ., CRC-Press, Boca Raton, Florida, 1987, pages 1-47). The catalytic 

0 center of hammerhead ribozymes is flanked by three stems and can be formed by adjacent 
sequence regions of the RNA or also by regions, which are separated from one another by 
many nucleotides. Figure 6 shows a diagram of such a catalytically active hammerhead 
structure. The stems have been denoted I, n and HI. The nucleotides are numbered 
according to the standard nomenclature for hammerhead ribozymes (Hertel et al., 1992, 

5 Nucleic Acids Res. 20:3252). In this nomenclature, bases are denoted by a number, which 
relates their position relative to the 5' side of the cleavage site. Furthermore, each base that 
is involved in a stem or loop region has an additional designation (which is denoted by a 
decimal point and then another number) that defines the position of that base within the 
stem or loop. A designation of A 15 ' 1 would indicate that this base is involved in a paired 

0 region and that it is the first nucleotide in that stem going away from the core region. This 
accepted convention for describing hammerhead-derived ribozymes allows for the 
nucleotides involved in the core of the enzyme to always have the same number relative to 
all of the other nucleotides. The size of the stems involved in substrate binding or core 
formation can be any size and of any sequence, and the position of A 9 , for example, will 

5 remain the same relative to all of the other core nucleotides. Nucleotides designated, for 
example, N A ' 2 or N 9a represent an inserted nucleotide where the position of the caret ( A ) 
relative to the number denotes whether the insertion is before or after the indicated 
nucleotide. Thus, N A ' 2 represents a nucleotide inserted before nucleotide position 12, and 
N 9a represents a nucleotide inserted after nucleotide position 9. 

0 The consensus sequence of the catalytic core structure is described by Ruffher and 

Uhlenbeck, \990,Nucleic Acids Res. 18:6025-6029. Perriman a/, 1992, Gene 1 13:157- 
163, have meanwhile shown that this structure can also contain variations, for example, 
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naturally occurring nucleotide insertions such as N 9a and N a12 . Thus, the positive strand 
of the satellite RNA of the tobacco ring-spot virus does not contain any of the two 
nucleotide insertions while the +RNA strand of the virusoid of the lucerne transient streak 
virus (vLTSV) contains a N 9a = U insertion which can be mutated to C or G without loss 

5 of activity (Sheldon and Symons, 1989, Nucleic Acids Res. 17:5679-5685). Furthermore, 
in this special case, N 7 = A and R 15 1 = A. On the other hand, the minus strand of the 
carnation stunt associated viroid (-CarSV) is quite unusual since it contains both 
nucleotide insertions, that is N a12 = A and N 9a = C (Hernandez et a/.,1992, Nucleic Acids 
Res. 20:6323-6329). In this viroid N 7 = A and R 15 1 = A. In addition, this special 

0 hammerhead structure exhibits a very effective self-catalytic cleavage despite the more 
open central stem. 

Possible uses of hammerhead ribozymes include, for example, generation of RNA 
restriction enzymes and the specific inactivation of the expression of genes in, for 
example, animal, human or plant cells and prokaryotes, yeasts and plasmodia. A particular 

5 biomedical interest is based on the fact that many diseases, including many forms of 
tumors, are related to the overexpression of specific genes. Inactivating such genes by 
cleaving the associated mRNA represents a possible way to control and eventually treat 
such diseases. Moreover there is a great need to develop antiviral, antibacterial, and 
antifungal pharmaceutical agents. Ribozymes have potential as such anti-infective agents 

!0 since RNA molecules vital to the survival of the organism can be selectively destroyed. 
In addition to needing the correct hybridizing sequences for substrate binding, 
substrates for hammerhead ribozymes have been shown to strongly prefer the triplet 
N 16 2 U 161 H 17 (NUH) where N can be any nucleotide, U is uridine, and H is either 
adenosine, cytidine, or uridine (Koizumi et al, 1988, FEBS Lett. 228, 228-230; Ruffner et 

:5 al, 1990, Biochemistry 29, 10695-10702 ; Perriman et al, 1992, Gene 113, 157-163). 
NUH is sometimes designated as NUX. The fact that changes to this general rule for 
substrate specificity result in non- functional substrates implies that there are "non core 
compatible" structures which are formed when substrates are provided which deviate from 
the stated requirements. Evidence along these lines was recently reported by Uhlenbeck 

0 and co-workers (Uhlenbeck et al, 1997, Biochemistry 36: 1 108-1 1 14) when they 

demonstrated that the substitution of a G at position 17 caused a functionally catastrophic 
base pair between G 17 and C 3 to form, both preventing the correct orientation of the 



WO 01/16312 PCT/US00/23998 
113 

scissile bond for cleavage and the needed tertiary interactions of C 3 (Murray et al, 1995, 
Biochem. J. 3 1 1 :487-494). The strong preference for a U at position 16.1 may exist for 
similar reasons. Many experiments have been done in an attempt to isolate ribozymes 
which are able to efficiently relieve the requirement of a U at position 16.1, however, 
5 attempts to find hammerhead type ribozymes which can cleave substrates having a base 
other than a U at position 16. 1 have proven impossible (Perriman et al, 1992, Gene 113, 
157-163). 

Efficient catalytic molecules with reduced or altered requirements in the cleavage 
region are highly desirable because their isolation would greatly increase the number of 
0 available target sequences that molecules of this type could cleave. For example, it would 
be desirable to have a ribozyme variant that could efficiently cleave substrates containing 
triplets other than N 16 2 U 161 H 17 since this would increase the number of potential target 
cleavage sites. 

Chemically modified oligonucleotides which contain a block of 

5 deoxyribonucleotides in the middle region of the molecule have potential as 

pharmaceutical agents for the specific inactivation of the expression of genes (Giles et al, 
1992, Nucleic Acids Res. 20:763-770). These oligonucleotides can form a hybrid DNA- 
RNA duplex in which the DNA bound RNA strand is degraded by RNase H. Such 
oligonucleotides are considered to promote cleavage of the RNA and so cannot be 

!0 characterized as having an RNA-c leaving activity nor as cleaving an RNA molecule (the 
RNase H is cleaving). A significant disadvantage of these oligonucleotides for in vivo 
applications is their low specificity, since hybrid formation, and thus cleavage, can also 
take place at undesired positions on the RNA molecules. 

Since, unmodified ribozymes are sensitive to degradation by RNases, chemically 

!5 modified active substances have to be used in order to administer hammerhead ribozymes 
exogenously (discussed, for example, by Heidenreich et al., 1994, J. Biol. Chem. 
269:2131-2138; Kiehntopf et al., \994,EMBOJ. 13:4645-4652; Paolella et al., 1992, 
EMBOJ. 11:1913-1919; and Usmmet al, 1994, Nucleic Acids Symp. Ser. 31:163-164). 
Sproat et al, U.S. Pat. No. 5,334,71 1, describe such chemically modified active 

10 substances based on synthetic catalytic oligonucleotide structures with a length of 35 to 40 
nucleotides which are suitable for cleaving a nucleic acid target sequence and contain 
modified nucleotides that contain an optionally substituted alkyl, alkenyl or alkynyl group 
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with 1 - 10 carbon atoms at the 2'-0 atom of the ribose. These oligonucleotides contain 
modified nucleotide building blocks and form a structure resembling a hammerhead 
structure. These oligonucleotides are able to cleave specific RNA substrates. 

Usman et al, U.S. Patent No. 5,891,684, describe enzymatic nucleic acid molecules 

5 with one or more nucleotide base modification(s) in a substrate binding arm. 

Thompson et al, US Patent No. 5,599,704 describe enzymatic RNA molecules 
targeted against ErbB2/new/Her2 RNA. 

Sullivan et al, US Patent No. 5,616,490 describe enzymatic RNA molecules 
targeted against protein kinase C (PKC) RNA. 

0 Sioud, International PCT publication No. WO 99/63066 describe hammerhead 

ribozymes targeted against specific sites within protein kinase C alpha (PKC alpha), 
VEGF, and TNF alpha RNA. 

Jarvis et al., International PCT publication No. WO 98/505030, describe the 
synthesis of xylo-ribonucleosides and oligonucleotides comprising xylo modifications. 

5 This invention relates to novel enzymatic nucleic acid molecules having an RNA- 

cleavage activity, as well as their use for cleaving RNA substrates in vitro and in vivo. The 
compositions contain an active center, the subunits of which are selected from nucleotides 
and/or nucleotide analogues, as well as flanking regions contributing to the formation of a 
specific hybridization with an RNA substrate. Preferred compositions form, in 

0 combination with an RNA substrate, a structure resembling a hammerhead structure. The 
active center of the disclosed compositions is characterized by the presence of I 15 1 which 
allows cleavage of RNA substrates having C 16 ' 1 . It is therefore an object of the present 
invention to provide compositions that cleave RNA, and in particular to provide RNA- 
cleaving oligomers which at the same time have a high stability, activity, and specificity. 

5 This invention relates to novel nucleic acid molecules with catalytic activity, which are 
particularly useful for cleavage of RNA or DNA or combination thereof. The nucleic acid 
catalysts of the instant invention are distinct from other nucleic acid catalysts known in the 
art. Specifically, nucleic acid catalysts of the instant invention are capable of catalyzing an 
intermolecular or intramolecular endonuclease reaction. 

0 It is another object of the present invention to provide compositions that cleave RNA 

substrates having a cleavage site triplet other than N 162 U 16I H 17 (NUH; Figure 6), where N 
is a nucleotide, U is uridine and H is adenosine, uridine or cytidine. H is used 
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interchangably with X. Specifically, the enzymatic nucleic acid molecule of the instant 
invention has an endonuclease activity to cleave RNA substrates having a cleavage triplet 
N ,6 2 C 16 'H 17 (NCR; Figure 6), where N is a nucleotide, C is cytidine and H is adenosine, 
uridine or cytidine. H is used interchangeably with X. In another aspect the invention 
features an enzymatic nucleic acid molecule of the instant invention has an endonuclease 
activity to cleave RNA substrates having a cleavage triplet N 16 ' 2 C 161 N 17 (NCN; Figure 6), 
where N is a nucleotide, C is cytidine. 

In a preferred embodiment, the invention features an enzymatic nucleic acid 
molecule having formula 1 : 

^ — G— A — A — I D — y 

L " 
^ (N) n — (N)— A— G— N — A — G — U — C— E— 5< 

where N represents independently a nucleotide or a non-nucleotide linker, which 
may be same or different; D and E are independently oligonucleotides of length sufficient 
to stably interact (e.g., by forming hydrogen bonds with complementary nucleotides in the 
target) with a target nucleic acid molecule (the target can be an RNA, DNA or mixed 
polymers), preferably, the length of D and E are independently between 3-20 nucleotides 
long, specifically, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 17, and 20; o andn are integers 
independently greater than or equal to 1 and preferably less than about 100, specifically 2, 
3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 50, wherein if (N) 0 and (N) n are nucleotides, (N)o 
and (N)n are optionally able to interact by hydrogen bond interaction, in particular if n =1 
and o=l then (N)n is preferably a purine (e.g., G, and A) and (N)o is preferably a 
pyrimidine (e.g., C and U) and (N)n preferably forms; • indicates base-paired interaction; 
L is a linker which may be present or absent (i.e., the molecule may be assembled from 
two separate oligonucleotides), but when present, is a nucleotide and/or a non-nucleotide 
linker, which may be a single-stranded and/or double-stranded region; p is an integer 0 or 

1, when p=l, (N)p is preferably A or U; and represents a chemical linkage (e.g. a 

phosphate ester linkage, amide linkage, phosphorothioate linkage or others known in the 
art). A, U, I, C and G represent adenosine, uridine, inosine, cytidine and guanosine 
nucleotides, respectively. The N in S'-CUGANGA-S' region of formula 1 is preferably U. 
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The nucleotides in the formula 1 are unmodified or modified at the sugar, base, and/or 
phosphate as known in the art. 

In a preferred embodiment, the invention features an enzymatic nucleic acid 
molecule having formula 2: 

^ C — G-A- A- I D- 3- 

L 

^ (N) n G-(N) P -A- G- N- A- G~ U- C~ E - 5' 



where N represents independently a nucleotide or a non-nucleotide linker, which 
may be same or different; D and E are independently oligonucleotides of length sufficient 
to stably interact (e.g., by forming hydrogen bonds with complementary nucleotides in the 

0 target) with a target nucleic acid molecule (the target can be an RNA, DNA or mixed 
polymers), preferably, the length of D and E are independently between 3-20 nucleotides 
long, specifically, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 15, 17, and 20; o and n are integers 
independently greater than or equal to 0 and preferably less than about 1 00, specifically 2, 
3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 50, wherein if (N) 0 and (N) n are nucleotides, (N)o 

5 and (N)n are optionally able to interact by hydrogen bond interaction; • indicates base- 
paired interaction; L is a linker which may be present or absent (i.e., the molecule may be 
assembled from two separate oligonucleotides), but when present, is a nucleotide and/or a 
non-nucleotide linker, which may be a single-stranded and/or double-stranded region; p is 
an integer 0 or 1, when p=l, (N)p is preferably A, C or U; and represents a 

0 chemical linkage (e.g. a phosphate ester linkage, amide linkage, phosphorothioate linkage 
or others known in the art). A, U, I, C and G represent adenosine, uridine, inosine, cytidine 
and guanosine nucleotides, respectively. The N in 5'-CUGANGA-3' region of formula 2 
is preferably U. The nucleotides in the formula 2 are unmodified or modified at the sugar, 
base, and/or phosphate as known in the art. 

5 In a preferred embodiment, the I (inosine) in formula 1 and 2 is preferably a ribo- 

inosine or a xylo-inosine. 
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In yet another embodiment, the nucleotide linker (L) is a nucleic acid aptamer, such 
as an ATP aptamer, HIV Rev aptamer (RRE), HIV Tat aptamer (TAR) and others (for a 
review see Gold et al, 1995, Annu. Rev. Biochem., 64, 763; and Szostak & Ellington, 
1993, in The RNA World, ed. Gesteland and Atkins, pp 51 1, CSH Laboratory Press). A 

5 "nucleic acid aptamer" as used herein is meant to indicate nucleic acid sequence capable of 
interacting with a ligand. The ligand can be any natural or a synthetic molecule, including 
but not limited to a resin, metabolites, nucleosides, nucleotides, drugs, toxins, transition 
state analogs, peptides, lipids, proteins, amino acids, nucleic acid molecules, hormones, 
carbohydrates, receptors, cells, viruses, bacteria and others. In a preferred embodiment L 

0 has the sequence 5 '-GAAA-3 ' or 5 ' -GUUA-3 ' . 

In yet another embodiment, the non-nucleotide linker (L) is as defined herein. 
The term "non-nucleotide", as used herein, includes either abasic nucleotide, 
polyether, polyamine, polyamide, peptide, carbohydrate, lipid, or polyhydrocarbon 
compounds. Specific examples include those described by Seela and Kaiser, Nucleic 

5 Acids Res. 1990, 75:6353 and Nucleic Acids Res. 1987, 75:31 13; Cload and Schepartz, J. 
Am. Chem. Soc. 1991, 7/3:6324; Richardson and Schepartz, J. Am. Chem. Soc. 1991, 
7/3:5109; Ma et al., Nucleic Acids Res. 1993, 2/:2585 and Biochemistry 1993, 32:1751; 
Durand et al, Nucleic Acids Res. 1990, /£:6353; McCurdy et al., Nucleosides & 
Nucleotides 1991, /0:287; Jschke et al, Tetrahedron Lett. 1993, 34:301; Ono et al., 

0 Biochemistry 1991, 30:9914; Arnold et al, International Publication No. WO 89/02439; 
Usman et al, International Publication No. WO 95/06731 ; Dudycz et al. International 
Publication No. WO 95/1 1910 and Ferentz and Verdine, J. Am. Chem. Soc. 1991, 
//3:4000, all hereby incorporated by reference herein. Non-nucleotide linkers can be any 
molecule, which is not an oligomeric sequence, that can be covalently coupled to an 

5 oligomeric sequence. Preferred non-nucleotide linkers are oligomeric molecules formed of 
non-nucleotide subunits. Examples of such non-nucleotide linkers are described by 
Letsingerand Wu, (J. Am. Chem. Soc. 117:7323-7328 (1995)), Benseler et al, {J. Am. 
Chem. Soc. 115:8483-8484(1993)) mdFuetal , (J. Am. Chem. Soc. 116:4591-4598 
(1994)). Preferred non-nucleotide linkers, or subunits for non-nucleotide linkers, include 

0 substituted or unsubstituted Ci-Cio straight chain or branched alkyl, substituted or 

unsubstituted C 2 -Ci 0 straight chain or branched alkenyl, substituted or unsubstituted C 2 - 
Cio straight chain or branched alkynyl, substituted or unsubstituted Ci-Cio straight chain or 
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branched alkoxy, substituted or unsubstituted C 2 -Ci 0 straight chain or branched 
alkenyloxy, and substituted or unsubstituted C2-C10 straight chain or branched alkynyloxy. 
The substituents for these preferred non-nucleotide linkers (or subunits) can be halogen, 
cyano, amino, carboxy, ester, ether, carboxamide, hydroxy, or mercapto. Thus, in a 

5 preferred embodiment, the invention features an enzymatic nucleic acid molecule having 
one or more non-nucleotide moieties, and having enzymatic activity to cleave an RNA or 
DNA molecule. By the term "non-nucleotide" is meant any group or compound which can 
be incorporated into a nucleic acid chain in the place of one or more nucleotide units, 
including either sugar and/or phosphate substitutions, and allows the remaining bases to 

0 exhibit their enzymatic activity. The group or compound is abasic in that it does not 
contain a commonly recognized nucleotide base, such as adenosine, guanine, cytosine, 
uracil or thymine. The terms "abasic" or "abasic nucleotide" as used herein encompass 
sugar moieties lacking a base or having other chemical groups in place of nucleotide base 
at the 1' position. 

5 In a preferred embodiment, the invention features modified ribozymes with 

phosphate backbone modifications comprising one or more phosphorothioate, 
phosphorodithioate, methylphosphonate, morpholino, amidate carbamate, carboxymethyl, 
acetamidate, polyamide, sulfonate, sulfonamide, sulfamate, formacetal, thioformacetal, 
and/or alkylsilyl, substitutions. For a review of oligonucleotide backbone modifications 

!0 see Hunziker and Leumann, 1995, Nucleic Acid Analogues: Synthesis and Properties, in 
Modern Synthetic Methods, VCH, 331-417, and Mesmaeker et al, 1994, Novel Backbone 
Replacements for Oligonucleotides, in Carbohydrate Modifications in Antisense Research, 
ACS, 24-39. 

In a further preferred embodiment of the instant invention, an inverted deoxy abasic 
!5 moiety is utilized at the 3' end of the enzymatic nucleic acid molecule. 

By "pyrimidines" is meant nucleotides comprising modified or unmodified 
derivatives of a six membered pyrimidine ring. An example of a pyrimidine is modified or 
unmodified uridine. 

In a preferred embodiment, the nucleosides of the instant invention include, T-O- 
0 methyl-2,6-diaminopurine riboside; 2'-deoxy-2'amino-2,6-diaminopurine riboside; 2'-(iV- 
alanyl) amino-2'-deoxy-uridine; 2'-(A f -phenylalanyl)amino-2'-deoxy-uridine; 2'-deoxy -2'- 
(iV-beta-alanyl) amino ; 2'-deoxy-2'-(lysiyl) amino uridine; 2'-C-allyl uridine; 2'-0-amino- 
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uridine; 2'-0-methylthiomethyl adenosine; 2'-0-methylthiomethyl cytidine ; 2'-0- 
methylthiomethyl guanosine; 2'-0-methylthiomethyl-uridine; 2'-Deoxy-2'-(A'-histidyl) 
amino uridine; 2'-deoxy-2'-amino-5-methyl cytidine; 2'-(N-p-carboxamidine-beta- 
alanyl)amino-2'-deoxy-uridine; 2'-deoxy-2'-(N-beta-alanyl)-guanosine; 2'-0-amino- 

5 adenosine; 2'-(N-lysyl)amino -2'-deoxy-cytidine; 2'-Deoxy -2'-(L-histidine) amino 
Cytidine; and 5-Imidazoleacetic acid 2'-deoxy-5'-triphosphate uridine. 

By "oligonucleotide" as used herein is meant a molecule having two or more 
nucleotides. The polynucleotide can be single, double or multiple stranded and may have 
modified or unmodified nucleotides or non-nucleotides or various mixtures and 

0 combinations thereof. 

In a preferred embodiment, the enzymatic nucleic acid molecule of formula 1 or 2 
include at least three ribonucleotide residues, preferably 4, 5, 6, 7, 8, 9, and 10 
ribonucleotide residues. 

In preferred embodiments, the enzymatic nucleic acid of the instant invention 

5 includes one or more stretches of RNA, which provide the enzymatic activity of the 
molecule, linked to the non-nucleotide moiety. The necessary RNA components are 
known in the art (see for e.g., Usman et al, supra). 

Thus, in one preferred embodiment, the invention features enzymatic nucleic acid 
molecules that inhibit gene expression and/or cell proliferation in vitro or in vivo {e.g. in 

0 patients). These chemically or enzymatically synthesized nucleic acid molecules contain 
substrate binding domains that bind to accessible regions of specific target nucleic acid 
molecules. The nucleic acid molecules also contain domains that catalyze the cleavage of 
target. Upon binding, the enzymatic nucleic acid molecules cleave the target molecules, 
preventing for example, translation and protein accumulation. In the absence of the 

5 expression of the target gene, cell proliferation, for example, is inhibited. 

In another preferred embodiment, catalytic activity of the molecules described in the 
instant invention can be optimized as described by Draper et al., supra. The details will 
not be repeated here, but include altering the length of the ribozyme binding arms, or 
chemically synthesizing ribozymes with modifications (base, sugar and/or phosphate) that 

0 prevent their degradation by serum ribonucleases and/or enhance their enzymatic activity 
(see e.g., Eckstein et al, International Publication No. WO 92/07065; Perrault et al, 1990 
Nature 344, 565; Pieken et al., 1991 Science 253, 314; Usman and Cedergren, 1992 
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Trends in Biochem. Sci. 17, 334; Usman et al, International Publication No. 
WO 93/15187; and Rossi et al, International Publication No. WO 91/03162; Sproat, US 
Patent No. 5,334,71 1; and Burgin et al, supra; all of these describe various chemical 
modifications that can be made to the base, phosphate and/or sugar moieties of enzymatic 
5 RNA molecules). Modifications which enhance their efficacy in cells, and removal of 
bases from stem loop structures to shorten RNA synthesis times and reduce chemical 
requirements are desired. (All these publications are hereby incorporated by reference 
herein.). 

By "nucleic acid catalyst" as used herein is meant a nucleic acid molecule {e.g., the 

1 0 molecule of formulae 1 and 2) capable of catalyzing (altering the velocity and/or rate of) a 
variety of reactions including the ability to repeatedly cleave other separate nucleic acid 
molecules (endonuclease activity) in a nucleotide base sequence-specific manner. Such a 
molecule with endonuclease activity may have complementarity in a substrate binding 
region to a specified gene target, and also has an enzymatic activity that specifically 

1 5 cleaves RNA or DNA in that target. That is, the nucleic acid molecule with endonuclease 
activity is able to intramolecularly or intermolecularly cleave RNA or DNA and thereby 
inactivate a target RNA or DNA molecule. This complementarity functions to allow 
sufficient hybridization of the enzymatic RNA molecule to the target RNA or DNA to 
allow the cleavage to occur. 1 00% complementarity is preferred, but complementarity as 

20 low as 50-75% may also be useful in this invention. The nucleic acids may be modified at 
the base, sugar, and/or phosphate groups. The term enzymatic nucleic acid as used herein 
is used interchangeably with phrases such as ribozymes, catalytic RNA, enzymatic RNA, 
catalytic oligonucleotides, nucleozyme, RNA enzyme, endoribonuclease, endonuclease, 
minizyme, oligozyme, finderon or nucleic acid catalyst. All of these terminologies 

25 describe nucleic acid molecules of the instant invention with enzymatic activity. The 

specific examples of enzymatic nucleic acid molecules described in the instant application 
are not limiting in the invention and those skilled in the art will recognize that all that is 
important in an enzymatic nucleic acid molecule of this invention is that it has a specific 
substrate binding site which is complementary to one or more of the target nucleic acid 

30 regions, and that it have nucleotide sequences within or surrounding that substrate binding 
site which impart a nucleic acid cleaving activity to the molecule (Cech et al., U.S. Patent 
No. 4,987,071 ; Cech et al, 1988, 260 JAMA 3030). 
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The enzymatic nucleic acid molecule of Formula 1 or 2 may independently comprise 
a cap structure which may independently be present or absent. 

By "chimeric nucleic acid molecule" or "mixed polymer" is meant that, the molecule 
may be comprised of both modified or unmodified nucleotides. 
5 hi yet another preferred embodiment, the 3 '-cap is selected from a group comprising, 

4',5-methylene nucleotide; l-(beta-D-erythrofuranosyl) nucleotide; 4'-thio nucleotide, 
carbocyclic nucleotide; 5'-amino-alkyl phosphate; l,3-diamino-2 -propyl phosphate, 3- 
aminopropyl phosphate; 6-aminohexyl phosphate; 1,2-aminododecyl phosphate; 
hydroxypropyl phosphate; 1,5-anhydrohexitol nucleotide; L-nucleotide; alpha-nucleotide; 

1 0 modified base nucleotide; phosphorodithioate; ^reo-pentofuranosyl nucleotide; acyclic 
3',4'-seco nucleotide; 3,4-dihydroxybutyl nucleotide; 3,5-dihydroxypentyl nucleotide, 5-5'- 
inverted nucleotide moiety; 5'-5'-inverted abasic moiety; 5'-phosphoramidate; 5'- 
phosphorothioate; 1,4-butanediol phosphate; 5'-amino; bridging and/or non-bridging 5'- 
phosphoramidate, phosphorothioate and/or phosphorodithioate, bridging or non bridging 

1 5 methylphosphonate and 5'-mercapto moieties (for more details, see Beaucage and Iyer, 
1993, Tetrahedron 49, 1925; incorporated by reference herein). By the term "non- 
nucleotide" is meant any group or compound which can be incorporated into a nucleic acid 
chain in the place of one or more nucleotide units, including either sugar and/or phosphate 
substitutions, and allows the remaining bases to exhibit their enzymatic activity. The 

20 group or compound is abasic in that it does not contain a commonly recognized nucleotide 
base, such as adenosine, guanine, cytosine, uracil or thymine. The terms "abasic" or 
"abasic nucleotide" as used herein encompass sugar moieties lacking a base or having 
other chemical groups in place of a base at the 1' position. 

In a preferred embodiment, the invention features l-(beta-D-xylofuranosyl)- 

25 xypoxanthine phosphoramidite and a process for the synthesis thereof and incorporation 
into oligonucleotides, such as enzymatic nucleic acid molecule. 

In yet another preferred embodiment, the invention features enzymatic nucleic acid 
molecules targeted against HER2 RNA, specifically, ribozymes in the hammerhead and 
NCH motifs. 

30 In a preferred embodiment, the invention features enzymatic nucleic acid molecules 

targeted against PKC alpha RNA, specifically, ribozymes in the hammerhead and NCH 
motifs. 
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Targets, for example PKC alpha RNA, for useful ribozymes and antisense nucleic 
acids can be determined, for example, as described in Draper et al, WO 95/04818; 
McSwiggen et al, U.S. Patent Nos. 5,525,468 and 5,646,042, all are hereby incorporated 
by reference herein in their totality. Other examples include the following PCT 

5 applications, which concern inactivation of expression of disease-related genes: WO 
95/23225, WO 95/13380, WO 94/02595, all incorporated by reference herein. 

The specific enzymatic nucleic acid molecules described in the instant application 
are not limiting in the invention and those skilled in the art will recognize that all that is 
important in an enzymatic nucleic acid molecule of this invention is that it has a specific 

0 substrate binding site (e.g., D and E of Formula 1 above) which is complementary to one 
or more of the target nucleic acid regions, and that it have nucleotide sequences within or 
surrounding that substrate binding site which impart a nucleic acid cleaving activity to the 
molecule. 

All naturally occurring hammerhead ribozymes have an A 151 -U 16 1 base pair. In 

5 addition, it is known that substrates for ribozymes based on the consensus hammerhead 
sequence strongly prefer a substrate that contains an N 16 ' 2 U 161 H 17 triplet in which H 17 is 
not a guanosine (Koizumi et al.,FEBS Lett. 228, 228-230 (1988); Ruffher et al, 
Biochemistry 29, 10695-10702 (1990); Perriman et al, Gene 1 13, 157-163 (1992)). Many 
experiments have been done in an attempt to isolate ribozymes which are able to 

0 efficiently relieve the requirement of a U at position 16.1, however, attempts to find 

ribozymes which can cleave substrates having a base other than a U at position 16.1 have 
proven largely unsuccessful (Perriman et al, Gene 113, 157-163 1992, Singh et al, 
Antisense and Nucleic Acid Drug Development 6:165-168 (1996)). 

However, examination of the recently published X-ray crystal structures (Pley el al, 

5 Nature 372:68-74 (1994), Scott et al, Cell 81 :991-1002 (1995), and Scott et al, Science 
274:2065-2069 (1996)) led to the realization that the A 151 -U 16 1 interaction is a non- 
standard base pair with a single hydrogen bond between the exocyclic amine (N6) of the 
adenosine and the 4-oxo group of the uridine. Modeling studies (based on the crystal 
structure) then led to the discovery that the interaction of the wild-type A 15 '-U 16 1 base pair 

0 can be spatially mimicked by replacement with an I 15 - l -C 16 -' base pair that adopts an 
isostructural orientation and which preserves the required contact of the 2-keto group of 
C 16 1 with A 6 of the uridine turn. In the model, the polarity of the stabilizing hydrogen 
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bond between positions 15.1 and 16.1 is reversed in the I 15 '-C 16 1 interaction, but the 
correct orientation of the bases around this bond is maintained. 

It has been discovered that hammerhead ribozyme analogues containing an inosine at 
position 15.1 readily cleave RNA substrates containing an n 162 c ,61 H 17 triplet. Based on 

5 this, disclosed are compositions, preferably synthetic oligomers, which cleave a nucleic 
acid target sequence containing the triplet N l6,2 C 161 H 17 . It is preferred that H 17 is not 
guanosine, however, under certain circumstances, NCG triplet containing RNA can be 
cleaved by the ribozymes of the instant invention. The ability to cleave substrates having 
N 16 2 C 16 ! X 17 triplets effectively doubles the number of targets available for cleavage by 

0 compositions of the type disclosed. 

Example 10: Synthesis of l-(beta-D-xvlofuranosyl)-xvpoxanthine phosphoramidite 
Referring to Figure 9, Inosine (1) was 5'-0-monomethoxytritylated and T-O- 
silylated under standard conditions to afford 2 (Charubala, R; Pfleiderer, W. Heterocycles 
5 1990, 30, 1 141). Oxidation/reduction procedure afforded 3 in moderate yield (Matulic- 
Adamic, J.; Daniher, A.T.; Gonzalez, C; Beigelman, L. Bioorg. Med. Chem. Lett.. 1999, 
9, 157): 1H NMR (CDC! 3 ) 8 12.80 (br s, 1H, NH), 8.1 1 (s, 1H, H-8), 8.08 (s, 1H, H-2), 
7.45-6.80 (m, 14H, trityl), 5.85 (d, J r)2 >= 1.6, 1H, H-l'), 4.83 (d, J 2 ', 3 '=7.2, 1H, H-2'), 
4.46 (br s, 1H, 3'-OH), 4.34 (m, 1H, H-4'), 4.06 (m, 1H, H-3'), 3.77 (s, 6H, 2 x OMe), 
0 3.60 (app d, 2H, H-5', H-5"), 0.89 (s, 9H, *-Bu), 0.07 (s, 3H, Me), 0.06 (s, 3H, Me). 
Standard phosphitylation of 3 afforded the desired phosphoramidite 4. 
More acid stable 5'-<9-MMT group is used in this particular case because applicant 
found that 5'-0-DMT protection is more labile in xylo nucleoside series than in ribo 
nucleoside series. 

5 The xylo-inosine was incorporated into oligonucleotides using the standard 

procedures known in the art and as described herein. 



Example 1 1 : Activity of the xvlo-rnosine-modified NCH Ribozyme 

Several NCH ribozymes with xylo-inosine at position 15.1 were designed (Figure 7) 
0 to cleave RNA containing GCA, ACA, UCA or the CCA triplet. These ribozymes were 
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synthesized and purified as described herein and tested using standard RNA cleavage 
reaction conditions (see Table 31, for example, and see below). 

The ribozymes were chemically synthesized. The method of synthesis used followed 
the procedure for normal RNA synthesis as described above and in Usman et al., (1987 J. 
5 Am. Chem. Soc, 109, 7845), Scaringe et al., (1990 Nucleic Acids Res., 18, 5433) and 
Wincott et al., supra, and made use of common nucleic acid protecting and coupling 
groups, such as dimethoxytrityl at the 5'-end, and phosphoramidites at the 3'-end. The 
average stepwise coupling yields were >98%. 

Ribozymes were purified by gel electrophoresis using general methods or were 
10 purified by high pressure liquid chromatography (HPLC; See Wincott et al., supra; the 
totality of which is hereby incorporated herein by reference) and were resuspended in 
water. The sequences of the chemically synthesized ribozymes used in this study are 
shown below in Table 33. 

Cleavage Reactions: Full-length or partially full-length, internally-labeled target 
5 RNA for ribozyme cleavage assay is prepared by in vitro transcription in the presence of 
[alpha- 32 p] CTP, passed over a G 50 Sephadex column by spin chromatography and used 

as substrate RNA without further purification. Alternately, substrates were 5'-32p-end 
labeled using T4 polynucleotide kinase enzyme. Assays are performed by pre-warming a 
2X concentration of purified ribozyme in ribozyme cleavage buffer (50 mM Tris-HCl, pH 

;0 7.5 at 37°C, 10 mM MgC^) and the cleavage reaction was initiated by adding the 2X 
ribozyme mix to an equal volume of substrate RNA (maximum of 1-5 nM) that was also 
pre-warmed in cleavage buffer. As an initial screen, assays are carried out for 1 hour at 
37°C using a final concentration of 40 nM or 1 mM ribozyme, i.e., ribozyme excess. The 
reaction is quenched by the addition of an equal volume of 95% formamide, 20 mM 

!5 EDTA, 0.05% bromophenol blue and 0.05% xylene cyanol after which the sample is 
heated to 95 C for 2 minutes, quick chilled and loaded onto a denaturing polyacrylamide 
gel. Substrate RNA and the specific RNA cleavage products generated by ribozyme 
cleavage are visualized on an autoradiograph of the gel. The percentage of cleavage is 
determined by Phosphor Imager® quantitation of bands representing the intact substrate 

(0 and the cleavage products. 
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The results of the experiments are summarized in Table 32, which shows that NCH- 
xylo ribozymes are catalytically active to cleave target RNA. 

Example 12: Activity of NCH Ribozyme variants 

5 The nucleic acid molecules of the instant invention allow for the ability to cleave a 

new set of 12 NCH triplets. Determination of single turnover rate constants at pH 6 of 
these ribozymes in the all ribo form show that with NCA type triplets, the cleavage rate is 
higher than at NUA sites. NCC and NUC site rates are similar, and NCU sites are slightly 
lower than NUU sites. Additional measurements of multiple turnover parameters of the all 

0 ribo ribozymes performed under non-saturating conditions using 5nM ribozyme and 
changing the substrate concentration from 50 to 500 nM at pH 7.4 with 10 mM Mg ++ at 
37 °C gave Km= 100 nM and kcat=6.5 min - 1 for GCA vs Km =30 nM and kcat =2.0 min 
_1 for GUA cleaving all ribo ribozymes. These data verify that the ribozymes with an I»C 
base pair are efficient catalysts in multiple turnover reactions and the relative order of 

5 activity between NCH and NUH cleavers established at pH 6 (Ludwig et al, 1 998, Nucleic 
Acids Res., 26, 2279-2285) remains unchanged. 

To gain more insight into the structural requirements of the 15.1- 16.1 base pair of 
the ribozymes of the instant invention, applicant synthesized several variants of the active 
1-15.1 »C-16.1 structure and tested these ribozyme analogues with their corresponding 

0 substrates. The influence of several core stabilization strategies on the activity of the NCH 
cleaving ribozymes was also investigated. 

Various nucleoside analogs were incorporated at position 15.1 of the ribozyme. 
Cleavage activity was tested with the complementary Fl* labeled substrates at pH 7.4 in 
the presence of 10 mM Mg ++ under conditions of ribozyme excess (i.e. single turnover 

5 conditions). The modified oligonucleotides were synthesized by standard oligonucleotide 
synthesis procedures. Xanthosine was protected using 0-2 ,0-4pivaloyloxymethyl groups; 
N,N-dimethylguanosine with 6-0-( 2-nitrophenyl-)ethyl and 6-thio-inosine with S- 
cyanoethyl protecting groups. The cleavage activity of the ribozymes containing the 15.1 
analogs is summarized in Figure 36. For comparison Figure 37 summarizes reported 

0 functional group modification studies performed at the A 1 5 . 1 residue in the A- 1 5 . 1 •U- 
16.1 context of NUH cleaving ribozymes. 
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Modifications at the purine 15.1 Nl and/or C6 positions (Figure 36 A, B, C) 

In the 6-thio-inosine (A) (si) 15.1 substituted ribozyme, the original (1-15.1) position 
6 OH-N (C-16.1) bonds are replaced by weaker (sl-15.1) position 6 S»H-N (C-16.1) 
hydrogen bonds while all other functional groups remain unchanged. Ribozymes with an 
5 adenosine (B) at position 15.1 (A- 1 5 . 1 ) are inactive with C-16.1 substrates since the 

ribozyme geometry requires the [A-15.1] position 6 amino group and the [C-16.1] position 
4 amino group hydrogen-bond donor functional groups to be in close proximity. Similarly, 
low activity is observed with 1-15.1 ribozymes and U-16.1 substrates, where the [1-15.1] 
position 6 keto and [C-16.1] position 4 keto hydrogen-bond acceptor groups are opposed 
1 0 (Figure 37, B). Although inosine can form stable mismatch pairs with uridine in RNA 
duplexes or in tRNA anticodon-mRNA interactions, these results suggest that the 
geometry in the MJ mismatches differ from that of the A«U (or I»C) base pair in the active 
NUH ribozyme. Substitution of Nl-Methyl-inosine (C) in place of inosine at position 15.1 
leads to complete loss of cleavage activity. 

15 

Modifications at the purine 15.1 C2 and/or N3 position (Figure 36 D, E, F) 

The extremely low activity observed with the G- 15.1 (D) substituted analog may be 
explained by the formation of a G-C Watson-Crick base pair. The replacement of the I«C 
pair with a G»C pair can significantly distort the geometry at the 15.1-16.1 position. G- 

20 15.1 N2-alkylation (E) gives only minimal recovery of catalytic activity compared to G- 
15.1, suggesting that the steric problems introduced by the bulky N-methyl groups may 
interfere with stacking interactions. The activity of this construct is significantly less than 
that of iso-G-15.1 (Figure 37, E) containing ribozymes in the standard A-U context. 
Xanthosine 15.1 (F) contains the same functional groups as inosine at the Nl and C6 sites 

25 but contains an additional hydrogen-bond donor site at position N3 along with a C2 
carbonyl group. The complete lack of activity seen with this construct reinforces the 
importance of the purine N3 acceptor functionality in transition state formation. Similarly, 
3-deaza-adenosine (Figure 37, F) containing ribozymes were also inactive. The C2 
carbonyl of the 15.1 purine shows no significant negative interference in iso-guanosine 

30 containing 15.1 ribozymes . 
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Activity of modified core variants 

To complete the characterization of the I«C pair containing ribozymes, the 
acceptance of various core substitution patterns was tested. Short substrates containing 
GCH and GUH (H= non G) triplets were compared using 3 different modified ribozymes. 
5 The acceptance of the U-4 2 '-O-alkyl substituent is the greatest with GCA triplets while U- 
4= 2'-deoxy-2'-amino uridine and U-4 = ribo uridine substituted ribozymes show a similar 
level of activity with NCH and NUH triplets. The results of this comparison are 
summarized in Table 64. In addition, a ribozyme construct in which ribo inosine replaces 
adenosine at positions 14 and 15.1 was tested which demonstrated cleavage activity. 

10 Apart from the A-15.1 'U-16.1 to 1-15 .1 »C-16.1 change that reverses the polarity 

of an important H-bond in the ribozyme structure, no other functional group changes at 
the 15.1 purine residue seem to be compatible with the requirements of efficient 
catalysis. The 1-15.1 and A-15.1 ribozymes are equally suitable for practical 
applications because there are only minor differences in the acceptance of stabilizing 

1 5 residues. 



Example 13: Activity of NCH Ribozyme to inhibit HER2 gene expression 

Applicant has designed, synthesized and tested several NCH ribozymes and HH 

ribozymes targeted against HER2 RNA (see, for example, Tables 31 and 34) in cell 
20 proliferation assays. 

Proliferation assay: The model proliferation assay used in the study can require a 

cell plating density of 2000 cells/well in 96-well plates and at least 2 cell doublings over a 

5-day treatment period. To calculate cell density for proliferation assays, the FIPS (fluoro- 

imaging processing system) method well in the art was used. This method allows for cell 
25 density measurements after nucleic acids are stained with CyQuant® dye, and has the 

advantage of accurately measuring cell densities over a very wide range 1,000-100,000 

cells/well in 96-well format. 

Ribozymes (50-200 nM) were delivered in the presence of cationic lipid at 2.0 

Hg/mL and inhibition of proliferation was determined on day 5 post-treatment. Two full 
30 ribozyme screens were completed and 4 lead HH and 1 1 lead NCH ribozymes were 

chosen for further testing. Of the 15 lead Rzs chosen from primary screens, 4 NCH and 1 
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#7/ Rzs continued to inhibit cell proliferation in subsequent experiments. NCH Rzs 
against sites, 2001 (RPI No. 17236), 2783 (RPI No. 17249), 2939 (RPI No. 17251) or 
3998 (RPI No. 17262) caused inhibition of proliferation ranging from 25-60% as 
compared to a scrambled control Rz (IA; RPI No. 1 7263). Of the five lead Rzs, the most 
5 efficacious is the NCH Rz (RPI No. 1 725 1) against site 2939 of HER2 RNA. An 
example of results from cell culture assay is shown in Figure 3. Referring to Figure 3, 
NCH ribozymes and a HH ribozyme targeted against HER2 RNA are shown to cause 
significant inhibition of proliferation of cells. This shows that ribozymes, for instance, the 
NCH ribozymes are capable of inhibiting HER2 gene expression in mammalian cells. 

0 

Example 14: Activity of NCH Ribozyme to inhibit PKC alpha gene expression 

The Protein Kinase C family contains twelve currently known isozymes divided into 
three classes: the classic, Ca 4-4 " dependent (PKCa, (31, pn, y), the novel, non-Ca ++ 
dependent (PKC8, e, u, r|, 0) and the atypical (PKC irk); all of which are 

5 serine/threonine kinases. These isozymes show distinct and overlapping tissue, cellular, 
and subcellular distribution. They aid in the regulation of cell growth and differentiation 
through their response to second messenger products of lipid metabolism (Blobe, et al, 
1996, Cancer Surveys, 27, 213-248). These second messengers include diacylglyceral 
(DAG), inositol-triphosphate (IP3), lysophospholipids, free fatty acids, and phosphatidate 

!0 which act directly or in addition to changes in the Ca^ concentration. A simple model for 
PKCa activation follows a two step mechanism. First, membrane association of PKCa is 
through Ca^ and phospholipid interactions and second, the kinase is activated by 
interaction with DAG. An example of a signal cascade subsequent to PKC activation is 
PKC's phosphorylation of c-Raf, which phosphorylates MEK, which phosphorylates 

5 MAP, which phosphorylates transcription factors such as Jun and thereby activates a 
mitogenic program in the nucleus. There are numerous substrates for the various PKC's, 
one which for PKCa ultimately stimulates transcription factors that activate P- 
glycoprotein (P-gp) causing the multi-drug resistant phenotype (MDR) (Blobe, et al., 
1994, Cancer and Metastasis Reviews, 13, 41 1-431). 
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Cell Culture Review 

PKC's have been implicated in tumor promotion since the discovery that these 
molecules can serve as receptors for tumor-promoting phorbol esters. An increase in PKC 
overexpression in numerous tumor cell lines and tumor tissues has also been 

5 demonstrated. PKC overexpression has been shown to be associated with increased 

invasion and metastasis in mouse Lewis lung carcinoma, mouse B16 melanoma (Lee et al, 
1997, Molecular Carcinogenesis, 18, 44-53), mouse mammary adenocarcinoma, mouse 
fibrosarcoma, human lung carcinoma (Wang and Liu, 1998, Acta Pharmacologica Sinica, 
19, 265-268), human bladder carcinoma, human pancreatic cancer (Denham et al., 1998, 

0 Surgery, 124, 218-223), and human gastric cancer (Dean et al, 1996, Cancer Research, 
56, 3499-3507). Mounting evidence suggests PKCct can stimulate adhesion molecule 
expression and can directly act on these membrane bound species as substrates, thereby 
modulating cellular adhesion to the extracellular matrix and increasing metastic potential. 
Furthermore, human surgical specimens have demonstrated elevated PKC in breast 

5 tumors, thyroid carcinomas and melanomas (Becker et al, 1990, Oncogene, 5, 1 133- 
1139). 

Utz et al, 1994, Int. J. Cancer, 57, 104-1 10, describe a cell proliferation assay in 
which small molecule inhibitors of PKC demonstrate anti-proliferative activity in CCRF- 
VCR 1000 and KB-85 1 1 cells with the multidrug resistant (MDR) phenotype. PKCcc is 

0 overexpressed in tumor tissues that express the MDR phenotype. This phenotype is 
associated with the expression of a 170 kDa broad specificity drug efflux pump, P-gp. 
PKCa phosphorylation of P-gp has been shown in vitro. In addition, PKC expression 
correlates with resistance to doxorubicin and high P-gp levels in human renal carcinoma 
and non-small cell lung carcinoma. Inhibitors of PKC partially reverse the MDR 

5 phenotype and decrease phosphorylation of P-gp (Caponigro et al, 1997, Anti-Cancer 
Drugs, 8, 26-33). 

Dean et al, 1994, Journal of Biological Chemistry, 269, 16416-24, describe cell 
culture studies in which antisense targeting of PKC a resulted in the potent inhibition of 
mRNA and protein expression in human lung carcinoma (A549) cells. In this study, PKC 
0 a inhibition resulted in the reduced induction of intercellular adhesion molecule 1 (ICAM- 
1) mRNA by phorbol esters. 
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Yano et al, 1999, Endocrinology, 140, 4622-4632, describe a cell proliferation study 
in which down regulation of different PKC isoforms, including PKCa, results in the 
inhibition of insulin like growth factor I induced vascular smooth muscle cell proliferation, 
migration, and gene expression. 

5 Wang et al, 1999, Experimental Cell Research, 250, 253-263, describe cell culture 

studies in which antisense inhibition of PKCa results in the reversal of the transformed 
phenotype in human lung carcinoma (LTEPa-2) cells. In this study, the amounts of PKCa 
protein and total PKC activity were decreased when compared to control cells. 
Sioud and Sorensen, 1998, 'Nature Biotechnology, 16, 556-561, describe 

0 hammerhead ribozyme inhibition of PKCa in rat glioma cell lines (BT4C and BT4Cn). 
This study demonstrated inhibition of malignant glioma cell proliferation along with the 
inhibition of regulatory Bcl-x L protein expression. Bcl-x L is overexpressed in glioma cells 
and is an apoptosis inhibitor. The ribozyme mediated inhibition of cell proliferation 
presumably results from apoptosis induction of transformed glioma cells through 

5 suppression of PKCa and Bcl-x L (Leirdal and Sioud, 1 999, British J. of Cancer, 80, 1558- 
1564). 

Animal Models 

Evaluating the efficacy of anti-PKCa agents in animal models is an important 
prerequisite to human clinical trials. A variety of mouse xenograft models using human 

0 tumor cell lines have been developed using cell lines which express high levels of PKCa 
protein. McGrawe/a/, \991 , Anti-Cancer Drug Design, 12, 315-326, describe mouse 
xenograft models using human breast (MDA MB-321), prostate (Du-145), colon (Colo 
205, WiDr), lung (NCI H69, H209, J460, H520, A549), bladder (T-24), and melanoma 
(SK-mel 1) carcinoma cells. Antisense oligonucleotides targeting PKCa administered 

5 intravenously following s.c. transplanted tumor cells resulted in dose dependant decreases 
in tumor size when compared to controls in most cases. Similar studies using T-24 
bladder carcinoma, non-small cell lung carcinoma (A549), and Colo 205 colon carcinoma 
mouse xenografts are described in Dean et al, 1996, Biochemical Society Transactions, 24, 
623. Sioud and Sorensen, 1998, Nature Biotechnology, 16, 556-561, describe a rat model 

0 in which inbred syngeneic BDEX rats were inoculated subcutaneously with BT4Cn glioma 
cells. After approximately three weeks, rats were treated with a single injection of 
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ribozyme targeting PKCa resulting in inhibition of tumor growth as determined by tumor 
size and/or weight when compared to controls. The above studies provide proof that 
inhibition of PKCa expression by anti-PKCa agents causes inhibition of tumor growth in 
animals. Lead anti-PKCa ribozymes chosen from in vitro assays can be further tested in 
5 mouse xenograft models. Ribozymes can be first tested alone and then in combination 
with standard chemotherapies. 
Animal Model Development 

Human lung (A549, NCI H520) tumor and breast (MDA-MB 231) cell lines can be 
characterized to establish their growth curves in mice. These cell lines are been implanted 

1 0 into both nude and SCID mice and primary tumor volumes are measured 3 times per week. 
Growth characteristics of these tumor lines using a Matrigel implantation format can also 
be established. In addition, the use of other cell lines that have been engineered to express 
high levels of PKCa can also be used. The tumor cell line(s) and implantation method 
that supports the most consistent and reliable tumor growth can be used in animal studies 

15 to test promising PKCa ribozyme(s). Ribozymes can be administered by daily 

subcutaneous injection or by continuous subcutaneous infusion from Alzet mini osmotic 
pumps beginning 3 days after tumor implantation and continuing for the duration of the 
study. Group sizes of at least 10 animals are employed. Efficacy is determined by 
statistical comparison of tumor volume of ribozyme-treated animals to a control group of 

20 animals treated with saline alone. Because the growth of these tumors is generally slow 
(45-60 days), an initial endpoint will be the time in days it takes to establish an easily 
measurable primary tumor (i.e. 50-100 mm 3 ) in the presence or absence of ribozyme 
treatment. 
Clinical Summary 

25 Overview 

Ribozymes targeting PKCa have strong potential to develop into useful therapeutics 
directed towards numerous cancer types. Lung cancer is the leading cause of cancer deaths 
for both men and women in the USA. The incidence of lung cancer in the United States is 
~1 72,000 cases per year, accounting for 14% of cancer diagnoses. Approximately 158,000 
30 die each year of lung cancer, accounting for 28% of all cancer deaths. Numerous other 
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indications exist including cancers of the bladder, colon, breast, prostate, and ovary in 
addition to melanoma and glioblastoma. 

McGraw et al, 1997, Anti-Cancer Drug Design, 12, 315-326, describe a Phase I trial 
for ISIS 3521/CGP 64128A, a PKC alpha antisense construct. In this trial, ISIS 3521/CGP 
5 64128A was administered as either a two-hour i.v. infusion three times per week for three 
consecutive weeks, or as a continuous i.v. infusion for twenty-one consecutive days. The 
authors report that patients demonstrated excellent tolerance to the antisense compound 
when administered at doses of up to 2.5 mg/kg by the two-hour i.v. infusion and at 1 .5 
mg/kg/day by continuous i.v. infusion. In patients receiving the two-hour i.v. infusion 

1 0 schedule, the post-infusion plasma concentration of the compound increased proportional 
to the dose, and metabolites were determined to have been cleared rapidly from plasma 
with a half-life of thirty to forty- five minutes. These metabolites were composed of chain- 
shortened oligonucleotides, consistent with exonuclease-mediated degradation. No 
evidence of accumulation, induction, or inhibition of metabolism was found after the 

1 5 administration of repetitive doses. 
Therapy 

Treatment options for lung cancer are determined by the type and stage of the cancer 
and include surgery, radiation therapy, and chemotherapy. For many localized cancers, 
surgery is usually the treatment of choice. Because the disease has usually spread by the 

20 time it is discovered, radiation therapy and chemotherapy are often needed in combination 
with surgery. Chemotherapy alone or combined with radiation has replaced surgery as the 
treatment of choice for small cell lung cancer; on this regimen, a large percentage of 
patients experience remission, which in some cases is long-lasting. The I -year relative 
survival rates for lung cancer have increased from 32% in 1973 to 41% in 1994, largely 

25 due to improvements in surgical techniques. The 5-year relative survival rate for all stages 
combined is only 14%. The survival rate is 50% for cases detected when the disease is still 
localized, but only 15% of lung cancers are discovered that early. 

Common chemotherapies include various combinations of cytotoxic drugs to kill the 
cancer cells. These drugs include paclitaxel (Taxol), docetaxel, cisplatin, methotrexate, 

30 cyclophosphamide, doxorubin, fluorouracil etc. Significant toxicities are associated with 
these cytotoxic therapies. Well-characterized toxicities include nausea and vomiting, 
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myelosuppression, alopecia and mucosity. Serious cardiac problems are also associated 
with certain of the combinations, e.g. doxorubin and paclitaxel, but are less common. 

Applicant has designed several NCH ribozymes targeted against PKCa RNA 
(Genebank accession No NM 002737) (see, for example, Table 63). These ribozymes are 

5 used first in a proliferation assay that is used to select ribozyme leads. 

Proliferation assay: The model proliferation assay useful in the study can require a 
cell plating density of 2000 cells/well in 96-well plates and at least 2 cell doublings over a 
5-day treatment period. To calculate cell density for proliferation assays, the FTPS (fluoro- 
imaging processing system) method well known in the art can be used. This method allows 

0 for cell density measurements after nucleic acids are stained with CyQuant® dye, and has 
the advantage of accurately measuring cell densities over a very wide range 1,000-100,000 
cells/well in 96-well format. 

Ribozymes (50-200 nM) are delivered in the presence of cationic lipid at 2.0 ug/mL 
and inhibition of proliferation is determined on day 5 post-treatment. Two full ribozyme 

5 screens are usually completed and lead ribozymes are chosen for further testing. Of the 
lead ribozymes chosen from primary screens, ribozymes which continue to inhibit cell 
proliferation in subsequent experiments are selected for PKCa RNA and protein inhibition 
studies. 

!0 Example 15: Nucleoside Triphosphates and their incorporation into oligonucleotides 

The synthesis of nucleotide triphosphates and their incorporation into nucleic acids 
using polymerase enzymes has greatly assisted in the advancement of nucleic acid 
research. The polymerase enzyme utilizes nucleotide triphosphates as precursor molecules 
to assemble oligonucleotides. Each nucleotide is attached by a phosphodiester bond 

!5 formed through nucleophilic attack by the 3 ' hydroxyl group of the oligonucleotide's last 
nucleotide onto the 5' triphosphate of the next nucleotide. Nucleotides are incorporated 
one at a time into the oligonucleotide in a 5' to 3' direction. This process allows RNA to 
be produced and amplified from virtually any DNA or RNA templates. 

Most natural polymerase enzymes incorporate standard nucleotide triphosphates into 

10 nucleic acid. For example, a DNA polymerase incorporates dATP, dTTP, dCTP, and 
dGTP into DNA and an RNA polymerase generally incorporates ATP, CTP, UTP, and 
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GTP into RNA. There are however, certain polymerases that are capable of incorporating 
non-standard nucleotide triphosphates into nucleic acids (Joyce, 1997, PNAS 94, 1619- 
1622, Huang et al., Biochemistry 36, 8231-8242). 

Before nucleosides can be incorporated into RNA transcripts using polymerase 
5 enzymes they must first be converted into nucleotide triphosphates which can be 

recognized by these enzymes. Phosphorylation of unblocked nucleosides by treatment 
with POCh and trialkyl phosphates was shown to yield nucleoside 5'- 
phosphorodichloridates (Yoshikawa et al, 1969, Bull. Chem. Soc.(Japan) 42, 3505). 
Adenosine or 2'-deoxyadenosine 5 '-triphosphate was synthesized by adding an additional 
1 0 step consisting of treatment with excess tri-n-butylammonium pyrophosphate in DMF 
followed by hydrolysis (Ludwig, 1981, Acta Biochim. et Biophys. Acad. Sci. Hung. 16, 
131-133). 

Non-standard nucleotide triphosphates are not readily incorporated into RNA 
transcripts by traditional RNA polymerases. Mutations have been introduced into RNA 
1 5 polymerase to facilitate incorporation of deoxyribonucleotides into RNA (Sousa & Padilla, 
\995,EMBOJ. 14,4609-4621, Bonner et al., \992,EMBOJ. 11, 3767-3775, Bonner etal., 
1994, J. Biol. Chem. 42, 25120-25128, Aurup et al., 1992, Biochemistry 31, 9636-9641). 

McGee et al., International PCT Publication No. WO 95/35102, describes the 
incorporation of 2'-NH 2 -NTP's, 2'-F-NTP's, and 2'-deoxy-2'-benzyloxyamino UTP into 
20 RNA using bacteriophage T7 polymerase. 

Wieczorek et al, 1994, Bioorganic & Medicinal Chemistry Letters 4, 987-994, 
describes the incorporation of 7-deaza-adenosine triphosphate into an RNA transcript 
using bacteriophage T7 RNA polymerase. 

Lin et al, 1994, Nucleic Acids Research 22, 5229-5234, reports the incorporation of 
25 2'-NH 2 -CTP and 2'-NH 2 -UTP into RNA using bacteriophage T7 RNA polymerase and 
polyethylene glycol containing buffer. The article describes the use of the polymerase 
synthesized RNA for in vitro selection of aptamers to human neutrophil elastase (HNE). 

This invention relates to novel nucleotide triphosphate (NTP) molecules, and their 
incorporation into nucleic acid molecules, including nucleic acid catalysts. The NTPs of 
30 the instant invention are distinct from other NTPs known in the art. The invention further 
relates to incorporation of these nucleotide triphosphates into oligonucleotides using an 
RNA polymerase; the invention further relates to novel transcription conditions for the 
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incorporation of modified (non-standard) and unmodified NTP's, into nucleic acid 
molecules. Further, the invention relates to methods for synthesis of novel NTP's 

In a first aspect, the invention features NTP's having the formula triphosphate-OR, 
for example the following formula 3: 




where R is any nucleoside; specifically the nucleosides 2'-0-methyl-2,6- 
diaminopurine riboside; 2'-deoxy-2'amino-2,6-diaminopurine riboside; 2'-(A^-alanyl) 
amino-2'-deoxy-uridine; 2'-(N-phenylalanyl)amino-2'-deoxy-uridine; 2'-deoxy -2'-(A r -P- 
alanyl) amino ; 2'-deoxy-2'-(lysiyl) amino uridine; 2'-C-allyl uridine; 2'-0-amino-uridine; 

0 2'-0-methylthiometbyl adenosine; 2'-<9-methylthiomethyl cytidine ; V-O- 

methylthiomethyl guanosine; 2'-0-methylthiomethyl-uridine; 2'-deoxy-2'-(jV-histidyl) 
amino uridine; 2'-deoxy-2'-amino-5-methyl cytidine; 2'-(A^-p-carboxamidine-P- 
alanyl)amino-2' -deoxy-uridine; 2 '-deoxy-2 '-(N-P-aIanyl)-guanosine; 2 '-O-amino- 
adenosine; 2'-(A r -lysyl)amino-2'-deoxy-cytidine; 2'-Deoxy -2'-(L-histidine) amino 

5 Cytidine; 5-Imidazoleacetic acid 2-deoxy uridine, 5-[3-(N-4- 

imidazoleacetyl)aminopropynyl] -2 ' -O-methyl uridine, 5 -(3 -aminopropynyl)-2 ' -O-methyl 
uridine, 5-(3-aminopropyl)-2'-0-methyl uridine, 5-[3-(N-4-imidazoleacetyl)aminopropyl]- 
2'-0-methyl uridine, 5-(3-aminopropyl)-2'-deoxy-2-fluoro uridine, 2'-Deoxy-2'-(P-alanyl- 
L-histidyl)amino uridine, 2'-deoxy-2 , -P-alaninamido-uridine, 3-(2'-deoxy-2'-fiuoro-P-D- 

0 ribofuranosyl)piperazino[2,3-D]pyrimidine-2-one, 5-[3-(N-4- 

imidazoleacetyl)aminopropyl]-2'-deoxy-2'-fluoro uridine, 5-[3-(N-4- 
imidazoleacetyl)aminopropynyl] -2 ' -deoxy-2 ' -fluoro uridine, 5 -E-(2-carboxyvinyl-2 '- 
deoxy-2'-fluoro uridine, 5-[3-(N-4-aspartyl)aminopropynyl-2'-fluoro uridine, 5-(3- 
aminopropyl)-2'-deoxy-2-fluoro cytidine, and 5-[3-(N-4-succynyl)aminopropyl-2'-deoxy- 

5 2-fluoro cytidine. 

In a second aspect, the invention features inorganic and organic salts of the 
nucleoside triphosphates of the instant invention. 
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In a third aspect, the invention features a process for the synthesis of pyrimidine 
nucleotide triphosphate (such as UTP, 2'-0-MTM-UTP, dUTP and the like) including the 
steps of monophosphorylation where the pyrimidine nucleoside is contacted with a 
mixture having a phosphorylating agent (such as phosphorus oxychloride, phospho-tris- 

5 triazolides, phospho-tris-triimidazolides and the like), trialkyl phosphate (such as 
triethylphosphate or trimethylphosphate or the like) and a hindered base (such as 
dimethylaminopyridine, DMAP and the like) under conditions suitable for the formation of 
pyrimidine monophosphate; and pyrophosphorylation where the pyrimidine 
monophosphate is contacted with a pyrophosphorylating reagent (such as 

1 0 tributylammonium pyrophosphate) under conditions suitable for the formation of 
pyrimidine triphosphates. 

By "nucleotide triphosphate" or "NTP" is meant a nucleoside bound to three 
inorganic phosphate groups at the 5' hydroxyl group of the modified or unmodified ribose 
or deoxyribose sugar where the 1 ' position of the sugar may comprise a nucleic acid base 

1 5 or hydrogen. The triphosphate portion may be modified to include chemical moieties 

which do not destroy the functionality of the group (i.e., allow incorporation into an RNA 
molecule). 

In another preferred embodiment, nucleotide triphosphates (NTPs) of the instant 
invention are incorporated into an oligonucleotide using an RNA polymerase enzyme. 

!0 RNA polymerases include but are not limited to mutated and wild type versions of 

bacteriophage T7, SP6, or T3 RNA polymerases. Applicant has also found that the NTPs 
of the present invention can be incorporated into oligonucleotides using certain DNA 
polymerases, such as Taq polymerase. 

In yet another preferred embodiment, the invention features a process for 

15 incorporating modified NTP's into an oligonucleotide including the step of incubating a 
mixture having a DNA template, RNA polymerase, NTP, and an enhancer of modified 
NTP incorporation under conditions suitable for the incorporation of the modified NTP 
into the oligonucleotide. 

By "enhancer of modified NTP incorporation" is meant a reagent which facilitates 

10 the incorporation of modified nucleotides into a nucleic acid transcript by an RNA 

polymerase. Such reagents include, but are not limited to, methanol, LiCl, polyethylene 
glycol (PEG), diethyl ether, propanol, methyl amine, ethanol, and the like. 
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In another preferred embodiment, the modified nucleotide triphosphates can be 
incorporated by transcription into a nucleic acid molecules including enzymatic nucleic 
acid, antisense, 2-5A antisense chimera, oligonucleotides, triplex forming oligonucleotide 
(TFO), aptamers and the like (Stull et al, 1995 Pharmaceutical Res. 12, 465). 

5 By "triplex forming oligonucleotides (TFO)" it is meant an oligonucleotide that can 

bind to a double-stranded DNA in a sequence-specific manner to form a triple-strand 
helix. Formation of such triple helix structure has been shown to inhibit transcription of 
the targeted gene (Duval-Valentin et al, 1992 Proc. Natl. Acad. Set. USA 89, 504). 

In yet another preferred embodiment, the modified nucleotide triphosphates of the 

0 instant invention can be used for combinatorial chemistry or in vitro selection of nucleic 
acid molecules with novel function. Modified oligonucleotides can be enzymatically 
synthesized to generate libraries for screening. 

In another preferred embodiment, the invention features nucleic acid based 
techniques (e.g., enzymatic nucleic acid molecules), antisense nucleic acids, 2-5 A 

5 antisense chimeras, triplex DNA, antisense nucleic acids containing RNA cleaving 

chemical groups) isolated using the methods described in this invention and methods for 
their use to diagnose, down regulate or inhibit gene expression. 

In yet another preferred embodiment, the invention features enzymatic nucleic acid 
molecules targeted against HER2 RNA, specifically including ribozymes in the class II 

!0 (zinzyme) motif. 

Targets, for example HER2 RNA, for useful ribozymes and antisense nucleic acids 
can be determined, for example, as described in Draper et al, WO 93/23569; Sullivan et 
al., WO 93/23057; Thompson et al, WO 94/02595; Draper et al, WO 95/04818; 
McSwiggen et al, US Patent Nos. 5,525,468 and 5,646,042, all are hereby incorporated by 

5 reference herein in their totalities. Other examples include the following PCT 

applications, which concern inactivation of expression of disease-related genes: WO 
95/23225, and WO 95/13380; all of which are incorporated by reference herein. 

In yet another preferred embodiment, the invention features a process for 
incorporating a plurality of compounds of formula 3. 

0 In yet another embodiment, the invention features a nucleic acid molecule with 

catalytic activity having formula 4: 
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In the formula shown above X, Y, and Z represent independently a nucleotide or a 
non-nucleotide linker, which may be same or different; • indicates hydrogen bond 
5 formation between two adjacent nucleotides which may or may not be present; Y' is a 
nucleotide complementary to Y; Z' is a nucleotide complementary to Z; 1 is an integer 
greater than or equal to 3 and preferably less than 20, more specifically 4, 5, 6, 7, 8, 9, 10, 
1 1, 12, or 15; m is an integer greater than 1 and preferably less than 10, more specifically 
2, 3, 4, 5, 6, or 7; n is an integer greater than 1 and preferably less than 10, more 

1 0 specifically 3, 4, 5, 6, or 7; o is an integer greater than or equal to 3 and preferably less 
than 20, more specifically 3 , 4, 5, 6, 7, 8, 9, 1 0, 1 1 , 1 2, or 1 5 ; 1 and o may be the same 
length (1 = o) or different lengths (1 * o); each X(l) and X(o) are oligonucleotides which are 
of sufficient length to stably interact independently with a target nucleic acid sequence (the 
target can be an RNA, DNA or RNA/DNA mixed polymers); W is a linker of > 2 

1 5 nucleotides in length or may be a non-nucleotide linker; A, U, C, and G represent the 

nucleotides; G is a nucleotide, preferably 2'-0-methyl or ribo; A is a nucleotide, preferably 
2'-0-methyl or ribo; U is a nucleotide, preferably 2'-arnino (e.g., 2'-NH 2 or 2'-0- NH 2 ), 
2'-0-methyl or ribo; C represents a nucleotide, preferably 2'-amino (e.g., 2'-NH 2 or 2'-0- 
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NH2), and represents a chemical linkage (e.g. a phosphate ester linkage, amide 

linkage, phosphorothioate, phosphorodithioate or others known in the art). 

In yet another embodiment, the invention features a nucleic acid molecule with 
catalytic activity having formula 5: 



5' (X)o 




5 In the formula shown above X, Y, and Z represent independently a nucleotide or a 

non-nucleotide linker, which may be same or different; • indicates hydrogen bond 
formation between two adjacent nucleotides which may or may not be present; Z' is a 
nucleotide complementary to Z; 1 is an integer greater than or equal to 3 and preferably 
less than 20, more specifically 4, 5, 6, 7, 8, 9, 10, 11, 12, or 15; n is an integer greater than 

1 0 1 and preferably less than 10, more specifically 3, 4, 5, 6, or 7; 0 is an integer greater than 
or equal to 3 and preferably less than 20, more specifically 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, or 
15; 1 and 0 may be the same length (1 = o) or different lengths (1 * o); each X(i) and X( o) 
are oligonucleotides which are of sufficient length to stably interact independently with a 
target nucleic acid sequence (the target can be an RNA, DNA or RNA/DNA mixed 

1 5 polymers); X( 0 ) preferably has a G at the 3'-end, Xq) preferably has a G at the 5'-end; W 
is a linker of > 2 nucleotides in length or may be a non-nucleotide linker; Y is a linker of S 
1 nucleotides in length, preferably G, 5'-CA-3', or 5'-CAA-3', or may be a non-nucleotide 
linker; A, U, C, and G represent nucleotides; G is a nucleotide, preferably 2'-0-methyl, 
2'-deozy-2'-fluoro, or 2'-OH; A is a nucleotide, preferably 2'-0-methyl, 2'-deozy-2'- 
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fluoro, or 2'-OH; U is a nucleotide, preferably 2'-0-methyl, 2'-deozy-2'-fluoro, or 2'-OH; 

C represents a nucleotide, preferably 2'-amino {e.g., 2'-NH2 or 2'-0- NH 2 , and 

represents a chemical linkage (e.g. a phosphate ester linkage, amide linkage, 

phosphorothioate, phosphorodithioate or others known in the art). 
5 The enzymatic nucleic acid molecules of Formula 4 and Formula 5 may 

independently comprise a cap structure which may independently be present or absent. 

In yet another preferred embodiment, the 3 '-cap is selected from a group comprising, 

4',5'-methylene nucleotide; l-(beta-D-erythrofuranosyl) nucleotide; 4'-thio nucleotide; 

carbocyclic nucleotide; 5'-amino-alkyl phosphate; l,3-diamino-2-propyl phosphate; 3- 
1 0 aminopropyl phosphate; 6-aminohexyl phosphate; 1 ,2-aminododecyl phosphate; 

hydroxypropyl phosphate; 1,5-anhydrohexitol nucleotide; L-nucleotide; alpha-nucleotide; 

modified base nucleotide; phosphorodithioate; /Areo-pentofuranosyl nucleotide; acyclic 

3',4'-seco nucleotide; 3,4-dihydroxybutyl nucleotide; 3,5-dihydroxypentyl nucleotide; 5'-5'- 

inverted nucleotide moiety; 5'-5'-inverted abasic moiety; 5'-phosphoramidate; 5'- 
1 5 phosphorothioate; 1 ,4-butanediol phosphate 5'-amino; bridging and/or non-bridging 5'- 

phosphoramidate, phosphorothioate and/or hosphorodithioate; bridging or non bridging 

methylphosphonate and 5'-mercapto moieties (for more details, see Beaucage and Iyer, 

1993, Tetrahedron 49, 1925; incorporated by reference herein). 

In another aspect, the invention provides mammalian cells containing one or more 
20 nucleic acid molecules and/or expression vectors of this invention. The one or more 

nucleic acid molecules may independently be targeted to the same or different sites. 

Nucleotide Synthesis 

Addition of dimethylaminopyridine (DMAP) to the phosphorylation protocols 

25 known in the art can greatly increase the yield of nucleotide monophosphates while 
decreasing the reaction time. Synthesis of the nucleosides of the invention have been 
described in several publications and Applicants previous applications (Beigelman et al, 
International PCT publication No. WO 96/18736; Dudzcy et al, Int. PCT Pub. No. WO 
95/1 1910; Usman et al, Int. PCT Pub. No. WO 95/13378; Matulic-Adamic et al, 1997, 

30 Tetrahedron Lett. 38, 203; Matulic-Adamic et al, 1997, Tetrahedron Lett. 38, 1669; all of 
which are incorporated herein by reference). These nucleosides are dissolved in triethyl 
phosphate and chilled in an ice bath. Phosphorus oxychloride (POCI3) is then added 
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followed by the introduction of DMAP. The reaction is then warmed to room temperature 
and allowed to proceed for 5 hours. This reaction allows the formation of nucleotide 
monophosphates which can then be used in the formation of nucleotide triphosphates. 
Tributylamine is added followed by the addition of anhydrous acetonitrile and 
5 tributylammonium pyrophosphate. The reaction is then quenched with TEAB and stirred 
overnight at room temperature (about 20°C). The triphosphate is purified using 
Sephadex® column purification or equivalent and/or HPLC and the chemical structure is 
confirmed using NMR analysis. Those skilled in the art will recognize that the reagents, 
temperatures of the reaction, and purification methods can easily be alternated with 
1 0 substitutes and equivalents and still obtain the desired product. 



Nucleotide Triphosphates 

The invention provides nucleotide triphosphates which can be used for a number of 
different functions. The nucleotide triphosphates formed from nucleosides found in Table 

15 45 are unique and distinct from other nucleotide triphosphates known in the art. 

Incorporation of modified nucleotides into DNA or RNA oligonucleotides can alter the 
properties of the molecule. For example, modified nucleotides can hinder binding of 
nucleases, thus increasing the chemical half-life of the molecule. This is especially 
important if the molecule is to be used for cell culture or in vivo. It is known in the art that 

20 the introduction of modified nucleotides into these molecules can greatly increase the 

stability and thereby the effectiveness of the molecules (Burgin et al, 1996, Biochemistry 
35, 14090-14097; Usman et al, 1996, Curr. Opin. Struct. Biol. 6, 527-533). 

Modified nucleotides are incorporated using either wild type or mutant polymerases. 
For example, mutant T7 polymerase is used in the presence of modified nucleotide 

25 triphosphate(s), DNA template and suitable buffers. Those skilled in the art will recognize 
that other polymerases and their respective mutant versions can also be utilized for the 
incorporation of NTP's of the invention. Nucleic acid transcripts were detected by 
incorporating radiolabeled nucleotides (ct- 32 P NTP). The radiolabeled NTP contained the 
same base as the modified triphosphate being tested. The effects of methanol, PEG and 

30 LiCl were tested by adding these compounds independently or in combination. Detection 
and quantitation of the nucleic acid transcripts was performed using a Molecular Dynamics 
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Phosphorlmager. Efficiency of transcription was assessed by comparing modified 
nucleotide triphosphate incorporation with all-ribonucleotide incorporation control. Wild- 
type polymerase was used to incorporate NTP ? s using the manufacturer's buffers and 
instructions (Boehringer Mannheim). 

5 

Transcription Conditions 

Incorporation rates of modified nucleotide triphosphates into oligonucleotides can be 
increased by adding to traditional buffer conditions, several different enhancers of 
modified NTP incorporation. Applicant has utilized methanol and LiCl in an attempt to 

1 0 increase incorporation rates of dNTP using RNA polymerase. These enhancers of 

modified NTP incorporation can be used in different combinations and ratios to optimize 
transcription. Optimal reaction conditions differ between nucleotide triphosphates and can 
readily be determined by standard experimentation. Overall, however, Applicant has 
found that inclusion of enhancers of modified NTP incorporation such as methanol or 

1 5 inorganic compound such as lithium chloride increase the mean transcription rates. 

Applicant synthesized pyrimidine nucleotide triphosphates using DMAP in the 
reaction. For purines, applicant utilized standard protocols previously described in the art 
(Yoshikawa et al supra;. Ludwig, supra). Described below is one example of a pyrimdine 
nucleotide triphosphate and one purine nucleotide triphosphate synthesis. 

20 

Synthesis of purine nucleotide triphosphates: 2 '-O-methvl-guanosine-5' -triphosphate 

2'-0-methyl guanosine nucleoside (0.25 grams, 0.84 mmol) was dissolved in triethyl 
phosphate (5.0) ml by heating to 100°C for 5 minutes. The resulting clear, colorless 
solution was cooled to 0°C using an ice bath under an argon atmosphere. Phosphorous 

25 oxychloride (1.8 eq., 0.141 ml) was then added to the reaction mixture with vigorous 
stirring. The reaction was monitored by HPLC, using a sodium perchlorate gradient. 
After 5 hours at 0°C, tributylamine (0.65 ml) was added followed by the addition of 
anhydrous acetonitrile (10.0 ml), and after 5 minutes (reequilibration to 0°C) 
tributylammonium pyrophosphate (4.0 eq., 1.53 g) was added. The reaction mixture was 

30 quenched with 20 ml of 2M TEAB after 1 5 minutes at 0°C (HPLC analysis with above 

conditions showed consumption of monophosphate at 10 minutes) then stirred overnight at 
room temperature, the mixture was evaporated in vacuo with methanol co-evaporation 
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(4x) then diluted in 50 ml 0.05M TEAR DEAE sephadex purification was used with a 
gradient of 0.05 to 0.6 M TEAB to obtain pure triphosphate (0.52 g, 66.0% yield) (elutes 
around 0.3M TEAB); the purity was confirmed by HPLC and NMR analysis. 

5 Synthesis of Pvrimidine nucleotide triphosphates: 2'-0-methvlthiomethvl-uridine-5'- 
triphosphate 

2'-0-methylthiomethyl uridine nucleoside (0.27 grams, 1.0 mmol) was dissolved in 
triethyl phosphate (5.0 ml). The resulting clear, colorless solution was cooled to 0°C with 
an ice bath under an argon atmosphere. Phosphorus oxychloride (2.0 eq., 0. 190 ml) was 

1 0 then added to the reaction mixture with vigorous stirring. Dimethylaminopyridine 
(DMAP, 0.2eq., 25 mg) was added, the solution warmed to room temperature and the 
reaction was monitored by HPLC, using a sodium perchlorate gradient. After 5 hours at 
20°C, tributylamine (1 .0 ml) was added followed by anhydrous acetonitrile (10.0 ml), and 
after 5 minutes tributylammonium pyrophosphate (4.0 eq., 1.8 g) was added. The reaction 

1 5 mixture was quenched with 20 ml of 2M TEAB after 1 5 minutes at 20°C (HPLC analysis 
with above conditions showed consumption of monophosphate at 10 minutes) then stirred 
overnight at room temperature. The mixture was evaporated in vacuo with methanol co- 
evaporation (4x) then diluted in 50 ml 0.05M TEAB. DEAE fast flow Sepharose 
purification with a gradient of 0.05 to 1 .0 M TEAB was used to obtain pure triphosphate 

20 (0.40 g, 44% yield) (elutes around 0.3M TEAB) as determined by HPLC and NMR 
analysis. 

Utilization of DMAP in Uridine 5 '-Triphosphate Synthesis 

The reactions were performed on 20 mg aliquots of nucleoside dissolved in 1 ml of 

25 triethyl phosphate and 19 ul of phosphorus oxychloride. The reactions were monitored at 
40 minute intervals automatically by HPLC to generate yield-of-product curves at times up 
to 1 8 hours. A reverse phase column and ammonium acetate/ sodium acetate buffer 
system (50mM & lOOmM respectively at pH 4.2) was used to separate the 5', 3', 2' 
monophosphates (the monophosphates elute in that order) from the 5 '-triphosphate and the 

30 starting nucleoside. The data is shown in Table 46. These conditions doubled the product 
yield and resulted in a 10-fold improvement in the reaction time to maximum yield (1200 
minutes down to 120 minutes for a 90% yield). Selectivity for 5'-monophosphorylation 
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was observed for all reactions. Subsequent triphosphorylation occurred in nearly 
quantitative yield. 

Materials Used in Bacteriophage T7 RNA Polymerase Reactions 
5 Buffer 1: Reagents are mixed together to form a 10X stock solution of buffer 1 

(400 mM Tris-Cl [pH 8.1], 200 raM MgCl 2 , 100 raM DTT, 50 mM spermidine, and 0.1% 
triton® X-100). Prior to initiation of the polymerase reaction methanol, LiCl is added and 
the buffer is diluted such that the final reaction conditions for condition 1 consisted of : 
40mM tris (pH 8.1), 20mM MgCl 2 , 10 mM DTT, 5 mM spermidine, 0.01% triton® X- 

10 1 00, 1 0% methanol, and 1 mM LiCl. 

BUFFER 2: Reagents are mixed together to form a 10X stock solution of buffer 2 
(400 mM Tris-Cl [pH 8.1], 200 mM MgCl 2 , 100 mM DTT, 50 mM spermidine, and 0.1% 
triton® X-100). Prior to initiation of the polymerase reaction PEG, LiCl is added and the 
buffer is diluted such that the final reaction conditions for buffer 2 consisted of : 40mM 

1 5 tris (pH 8.1), 20mM MgCl 2 , 10 mM DTT, 5 mM spermidine, 0.01% triton® X-100, 4% 
PEG, and 1 mM LiCl. 

BUFFER 3: Reagents are mixed together to form a 10X stock solution of buffer 3 
(400 mM Tris-Cl [pH 8.0], 120 mM MgCl 2 , 50 mM DTT, 10 mM spermidine and 0.02% 
triton® X-100). Prior to initiation of the polymerase reaction PEG is added and the buffer 

20 is diluted such that the final reaction conditions for buffer 3 consisted of : 40mM tris (pH 
8.0), 12 mM MgCl 2 , 5 mM DTT, 1 mM spermidine, 0.002% triton® X-100, and 4% PEG. 

BUFFER 4: Reagents are mixed together to form a 10X stock solution of buffer 4 
(400 mM Tris-Cl [pH 8.0], 120 mM MgCl 2 , 50 mM DTT, 10 mM spermidine and 0.02% 
triton® X-100). Prior to initiation of the polymerase reaction PEG, methanol is added and 

25 the buffer is diluted such that the final reaction conditions for buffer 4 consisted of : 

40mM tris (pH 8.0), 12 mM MgCl 2 , 5 mM DTT, 1 mM spermidine, 0.002% triton® X- 
100, 10% methanol, and 4% PEG. 

BUFFER 5: Reagents are mixed together to form a 10X stock solution of buffer 5 
(400 mM Tris-Cl [pH 8.0], 120 mM MgCl 2 , 50 mM DTT, 10 mM spermidine and 0.02% 

30 triton® X-100). Prior to initiation of the polymerase reaction PEG, LiCl is added and the 
buffer is diluted such that the final reaction conditions for buffer 5 consisted of : 40mM 
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tris (pH 8.0), 12 mM MgCl 2 , 5 mM DTT, 1 mM spermidine, 0.002% triton® X-100, 1 
mM LiCl and 4% PEG. 

BUFFER 6: Reagents are mixed together to form a 10X stock solution of buffer 6 
(400 mM Tris-Cl [pH 8.0], 120 mM MgCl 2 , 50 mM DTT, 10 mM spermidine and 0.02% 
5 triton® X-l 00). Prior to initiation of the polymerase reaction PEG, methanol is added and 
the buffer is diluted such that the final reaction conditions for buffer 6 consisted of : 
40mM tris (pH 8.0), 12 mM MgCl 2 , 5 mM DTT, 1 mM spermidine, 0.002% triton® X- 
100, 10% methanol, and 4% PEG. 

BUFFER 7: Reagents are mixed together to form a 10X stock solution of buffer 6 
1 0 (400 mM Tris-Cl [pH 8.0], 120 mM MgCl 2 , 50 mM DTT, 10 mM spermidine and 0.02% 
triton® X-100). Prior to initiation of the polymerase reaction PEG, methanol and LiCl is 
added and the buffer is diluted such that the final reaction conditions for buffer 6 consisted 
of : 40mM tris (pH 8.0), 12 mM MgCl 2 , 5 mM DTT, 1 mM spermidine, 0.002% triton® 
X-100, 10% methanol, 4% PEG, and 1 mM LiCl. 

15 

Screening of Modified nucleotide triphosphates with Mutant T7 RNA Polymerase 

Modified nucleotide triphosphates were tested in buffers 1 through 6 at two different 
temperatures (25 and 37°C). Buffers 1-6 tested at 25°C were designated conditions 1-6 
and buffers 1-6 tested at 37°C were designated conditions 7-12 (Table 47). In each 

20 condition, Y639F mutant T7 polymerase (Sousa and Padilla, supra) (0.3-2 mg/20 ml 
reaction), NTP's (2 mM each), DNA template (10 pmol), inorganic pyrophosphatase 
(5U/ml) and o> 32 P NTP (0.8 mCi/pmol template) were combined and heated at the 
designated temperatures for 1-2 hours. The radiolabeled NTP used was different from the 
modified triphosphate being testing. The samples were resolved by polyacrylamide gel 

25 electrophoresis. Using a Phosphorlmager (Molecular Dynamics, Sunnyvale, CA), the 
amount of full-length transcript was quantified and compared with an all-RNA control 
reaction. The data is presented in Table 48; results in each reaction are expressed as a 
percent compared to the all-ribonucleotide triphosphate (rNTP) control. The control was 
run with the mutant T7 polymerase using commercially available polymerase buffer 

30 (Boehringer Mannheim, Indianapolis, IN). 
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Incorporation of Modified NTP's using Wild-type T7 RNA polymerase 

Bacteriophage T7 RNA polymerase was purchased from Boehringer Mannheim at 
0.4 U/uL concentration. Applicant used the commercial buffer supplied with the enzyme 
and 0.2 uCi alpha- 32 P NTP in a 50 uL reaction with nucleotides triphosphates at 2 mM 
5 each. The template was a double-stranded PCR fragment, which was used in previous 
screens. Reactions were carried out at 37°C for 1 hour. Ten uL of the sample was run on 
a 7.5% analytical PAGE and bands were quantitated using a Phosphorlmager. Results are 
calculated as a comparison to an "all ribo" control (non-modified nucleotide triphosphates) 
and the results are in Table 49. 

10 Incorporation of Multiple Modified nucleotide triphosphates Into Oligonucleotides 
Combinations of modified nucleotide triphosphates were tested with the 
transcription protocol described above, to determine the rates of incorporation of two or 
more of these triphosphates. Incorporation of 2'-Deoxy-2'-(L-histidine) amino uridine (2'- 
his-NH 2 -UTP) was tested with unmodified cytidine nucleotide triphosphates, rATP and 

1 5 rGTP in reaction condition number 9. The data is presented as a percentage of 

incorporation of modified NTP's compared to the all rNTP control and is shown in Table 
50a. 

Two modified cytidines (2'-NH 2 -CTP or 2'dCTP) were incorporated along with 2'- 
his-NH 2 -UTP with identical efficiencies. 2'-his-NH 2 -UTP and 2'-NH 2 -CTP were then 
20 tested with various unmodified and modified adenosine triphosphates in the same buffer 
(Table 50b). The best modified adenosine triphosphate for incorporation with both 2'-his- 
NH 2 -UTP and 2'-NH 2 -CTP was 2'-NH 2 -DAPTP. 



Optimization of Reaction conditions for Incorporation of Modified Nucleotide 
25 Triphosphate 

The combination of 2'-his-NH 2 -UTP, 2'-NH 2 -CTP, 2'-NH 2 -DAP, and rGTP was 
tested in several reaction conditions (Table 51) using the incorporation protocol described 
above. The results demonstrate that of the buffer conditions tested, incorporation of these 
modified nucleotide triphosphates occur in the presence of both methanol and LiCl. 
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Selection of Novel Enzymatic nucleic acid molecule Motifs using 2'-deoxv-2 , amino 
Modified GTP and CTP 

For selection of new enzymatic nucleic acid molecule motifs, pools of enzymatic 
nucleic acid molecules were designed to have two substrate binding arms (5 and 16 
5 nucleotides long) and a random region in the middle. The substrate has a biotin on the 5' 
end, 5 nucleotides complementary to the short binding arm of the pool, an unpaired G (the 
desired cleavage site), and 16 nucleotides complementary to the long binding arm of the 
pool. The substrate was bound to column resin through an avidin-biotin complex. The 
general process for selection is shown in Figure 11. The protocols described below 
1 0 represent one possible method that may be utilized for selection of enzymatic nucleic acid 
molecules and are given as a non-limiting example of enzymatic nucleic acid molecule 
selection with combinatorial libraries. 

Construction of Libraries: 
1 5 The oligonucleotides listed below were synthesized by Operon Technologies 

(Alameda, CA). Templates were gel purified and then run through a Sep-Pak™ cartridge 
(Waters, Millford, MA) using the manufacturers protocol. Primers (MST3, MST7c, 
MST3del) were used without purification. 
Primers: 

20 MST3 (30 mer): 5 '- CAC TTA GCA TTA ACC CTC ACT AAA GGC CGT-3 ' 

MST7c (33 mer): 5'-TAA TAC GAC TCA CTA TAG GAA AGG TGT GCA ACC-3' 
MST3del (18 mer): 5'-ACC CTC ACT AAA GGC CGT-3' 

Templates: 

MSN60c (93 mer): 5'-ACC CTC ACT AAA GGC CGT (N)eo GGT TGC ACA CCT 
25 TTG-3' 

MSN40c (73 mer): 5'-ACC CTC ACT AAA GGC CGT (N) 40 GGT TGC ACA CCT 
TTG-3' 

MSN20c (53 mer): 5'-ACC CTC ACT AAA GGC CGT (N) 20 GGT TGC ACA CCT 
TTG-3' 

30 N60 library was constructed using MSN60c as a template and MST3/MST7c as 

primers. N40 and N20 libraries were constructed using MSN40c (or MSN20c) as template 
and MST3del/MST7c as primers. 
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Single-stranded templates were converted into double-stranded DNA by the 
following protocol: 5 nmol template, 10 nmol each primer, in 10 ml reaction volume 
using standard PCR buffer, dNTP's, and taq DNA polymerase (all reagents from 
Boerhinger Mannheim). Synthesis cycle conditions were 94°C, 4 minutes; (94°C, 1 
5 minute; 42°C, 1 minute; 72°C, 2 minutes) x 4; 72°C, 10 minutes. Products were 

checked on agarose gel to confirm the length of each fragment (N60=123 bp, N40=91 bp, 
N20=71 bp) and then were phenol/chloroform extracted and ethanol precipitated. The 
concentration of the double-stranded product was 25 uM. 

Transcription of the initial pools was performed in a 1 ml volume comprising: 500 

1 0 pmol double-stranded template (3 x 10 14 molecules), 40 mM tris-HCl (pH 8.0), 12 mM 
MgCl 2 , 1 mM spermidine, 5 mM DTT, 0.002% triton X-100, 1 mM LiCl, 4% PEG 8000, 
10% methanol, 2 mM ATP (Pharmacia), 2 mM GTP (Pharmacia), 2 mM 2'-deoxy-2'- 
amino-CTP (USB), 2 mM 2'-deoxy-2'-amino-UTP (USB), 5 U/ml inorganic 
pyrophosphatase (Sigma), 5 U/ul T7 RNA polymerase (USB; Y639F mutant was used in 

1 5 some cases at 0.1 mg/ml (Sousa and Padilla, supra)), 37°C, 2 hours. Transcribed libraries 
were purified by denaturing PAGE (N60=106 ntds, N40=74, N20=54) and the resulting 
product was desalted using Sep-Pak™ columns and then ethanol precipitated. 



Initial column-Selection : 

20 The following biotinylated substrate was synthesized using standard protocols 

(Usman et al, 1987 J. Am. Chem. Soc, 109, 7845; Scaringe et al, 1990 Nucleic Acids 
Res., 18, 5433; and Wincott et al, 1995 Nucleic Acids Res. 23, 2677-2684): 

5'-biotin-C18 spacer-GCC GUG GGU UGC ACA CCU UUC C-C18 spacer-thiol- 
modifier C6 S-S-inverted abasic-3 ' 

25 Substrate was purified by denaturing PAGE and ethanol precipitated. 1 0 nmol of 

substrate was linked to a NeutrAvidin™ column using the following protocol: 400 ul 
UltraLink Immobilized NeutrAvidin™ slurry (200 ul beads, Pierce, Rockford, IL) were 
loaded into a polystyrene column (Pierce). The column was washed twice with 1 ml of 
binding buffer (20 mM NaP0 4 (pH 7.5), 1 50 mM NaCl) and then capped off (i.e., a cap 

30 was put on the bottom of the column to stop the flow). 200 ul of the substrate suspended 
in binding buffer was applied and allowed to incubate at room temperature for 30 minutes 
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with occasional vortexing to ensure even linking and distribution of the solution to the 
resin. After the incubation, the cap was removed and the column was washed with 1 ml 
binding buffer followed by 1 ml column buffer (50 mM tris-HCL (pH 8.5), 100 mM NaCl, 
50 mM KC1). The column was then ready for use and capped off. 1 nmol of the initial 
5 pool RNA was loaded on the column in a volume of 200 ul column buffer. It was allowed 
to bind the substrate by incubating for 30 minutes at room temperature with occasional 
vortexing. After the incubation, the cap was removed and the column was washed twice 
with 1 ml column buffer and capped off. 200 ul of elution buffer (50 mM tris-HCl (pH 
8.5), 100 mM NaCl, 50 mM KC1, 25 mM MgCl 2 ) was applied to the column followed by 
10 30 minute incubation at room temperature with occasional vortexing. The cap was 
removed and four 200 ul fractions were collected using elution buffer. 

Second column (counter selection) : 

A diagram for events in the second column is generally shown in Figure 12 and 
1 5 substrate oligonucleotide used is shown below: 

5'-GGU UGC ACA CCU UUC C-C18 spacer-biotin-inverted abasic-3' 
This column substrate was linked to UltraLink NeutrAvidin™ resin as previously 
described (40 pmol) which was washed twice with elution buffer. The eluent from the 
first column purification was then run on the second column. The use of this column 
20 allowed for binding of RNA that non-specifically diluted from the first column, while 
RNA that performed a catalytic event and had product bound to it, flowed through the 
second column. The fractions were ethanol precipitated using glycogen as carrier and 
rehydrated in sterile water for amplification. 

25 Amplification: 

RNA and primer MST3 (10-100 pmol) were denatured at 90°C for 3 minutes in 
water and then snap-cooled on ice for one minute. The following reagents were added to 
the tube (final concentrations given): IX PCR buffer (Boerhinger Mannheim), 1 mM 
dNTP's (for PCR, Boerhinger Mannheim), 2 U/ul RNase-Inhibitor (Boerhinger 

30 Mannheim), 10 U/pl Superscript™ II Reverse Transcriptase (BRL). The reaction was 
incubated for 1 hour at 42°C, then at 95 C C for 5 minutes in order to destroy the 
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Superscript™. The following reagents were then added to the tube to increase the volume 
five-fold for the PCR step (final concentrations/amounts given): MST7c primer (10-100 
pmol, same amount as in RT step), IX PCR buffer, taq DNA polymerase (0.025-0.05 
U/ul, Boerhinger Mannheim). The reaction was cycled as follows: 94°C, 4minutes; 
5 (94°C, 30s; 42-54°C, 30s; 72°C, lminute) x 4-30 cycles; 72°C, 5minutes; 30°C, 30 
minutes. Cycle number and annealing temperature were decided on a round by round 
basis. In cases where heteroduplex was observed, the reaction was diluted five-fold with 
fresh reagents and allowed to progress through 2 more amplification cycles. Resulting 
products were analyzed for size on an agarose gel (N60=123 bp, N40=103 bp, N20=83 bp) 
1 0 and then ethanol precipitated. 

Transcriptions: 

Transcription of amplified products was done using the conditions described above 
with the following modifications: 10-20% of the amplification reaction was used as 
1 5 template, reaction volume was 100-500 ul, and the products sizes varied slightly 
(N60-106 ntds, N40=86, N20=66). A small amount of 32 P-GTP was added to the 
reactions for quantitation purposes. 

Subsequent rounds: 

20 Subsequent rounds of selection used 20 pmols of input RNA and 40 pmol of the 22 

nucleotide substrate on the column. 

Activity of pools: 

Pools were assayed for activity under single turnover conditions every three to four 
25 rounds. Activity assay conditions were as follows: 50 mM tris-HCl (pH 8.5), 25 mM 
MgCl 2 , 100 mM NaCl, 50 mM KC1, trace 32 P-labeled substrate, 10 nM RNA pool. 2X 
pool in buffer and, separately, 2X substrate in buffer were incubated at 90°C for 3 minutes, 
then at 37°C for 3 minutes. Equal volume 2X substrate was then added the 2X pool tube 
(t=0). Initial assay time points were taken at 4 and 24 hours: 5 ul was removed and 
30 quenched in 8 pi cold Stop buffer (96% formamide, 20 mM EDTA, 0.05% bromphenyl 
blue/xylene cyanol). Samples were heated 90°C, 3 minutes, and loaded on a 20% 
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sequencing gel. Quantitation was performed using a Molecular Dynamics Phosphorimager 
and ImageQuaNT™ software. The data is shown in Table 52. 

Samples from the pools of oligonucleotide were cloned into vectors and sequenced 
using standard protocols (Sambrook et al, Molecular Cloning: A Laboratory Manual, 
5 Cold Spring Harbor Laboratory Press). The enzymatic nucleic acid molecules were 

transcribed from a representative number of these clones using methods described in this 
application. Individuals from each pool were tested for RNA cleavage from N60 and N40 
by incubating the enzymatic nucleic acid molecules from the clones with 5/16 substrate in 
2mM MgC12, pH 7.5, lOmM KC1 at 37°C. The data in Table 54 shows that the enzymatic 
1 0 nucleic acid molecules isolated from the pool are individually active. 



Kinetic Activity : 

Kinetic activity of the enzymatic nucleic acid molecule shown in Table 54, was 
determined by incubating enzymatic nucleic acid molecule (10 nM) with substrate in a 
1 5 cleavage buffer (pH 8.5, 25 mM MgCl 2 , 1 00 mM NaCl, 50 mM KC1) at 37°C. 



Magnesium Dependence: 

Magnesium dependence of round 15 of N20 was tested by varying MgCl 2 while 
other conditions were held constant (50 mM tris [pH 8.0], 100 mM NaCl, 50 mM KC1, 
20 single turnover, 10 nM pool). The data is shown in Table 55, which demonstrates 
increased activity with increased magnesium concentrations. 

Selection of Novel Enzymatic nucleic acid molecule Motifs using 2'-Deoxv-2'-(N- 
histidvl) amino UTP. 2'-Fluoro-ATP. and 2'-deoxv-2'-amino CTP and GTP 

25 The method used for selection of novel enzymatic nucleic acid molecule motifs 

using 2'-deoxy-2'amino modified GTP and CTP was repeated using 2'-Deoxy-2'-(A r - 
histidyl) amino UTP, 2'-Fluoro-ATP, and 2'-deoxy-2'-amino CTP and GTP. However, 
rather than causing cleavage on the initial column with MgCl 2 , the initial random 
modified-RNA pool was loaded onto substrate-resin in the following buffer; 5 mM 

30 NaOAc pH 5.2, 1 M NaCl at 4° C. After ample washing, the resin was moved to 22 0 C 
and the buffer switch 20 mM HEPES pH 7.4, 140mMKCl, 10 mM NaCl, lmMCaCl 2 , 
1 mM MgCl 2 . In one selection of N60 oligonucleotides, no divalent cations (MgCl 2 , 
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CaCl 2 ) was used. The resin was incubated for 10 minutes to allow reaction and the eluant 
collected. 

The enzymatic nucleic acid molecule pools were capable of cleaving 1-3% of the 
present substrate even in the absence of divalent cations, the background (in the absence of 
5 modified pools) was 0.2 - 0.4 %. 

Synthesis of 5-substituted 2 , -modified nucleosides 

When designing monomeric nucleoside triphosphates for selection of therapeutic 
catalytic RNAs, one has to take into account nuclease stability of such molecules in 

1 0 biological sera. A common approach to increase RNA stability is to replace the sugar 2'- 
OH group with other groups like 2'-fluoro, 2'-C>-methyl or 2'-amino. Fortunately such 2'- 
modified pyrimidine 5 'triphosphates are shown to be substrates for RNA polymerases. 
(Aurup, H.; Williams, D.M.; Eckstein, F. Biochemistry 1992, 31, 9637; and Padilla, R.; 
Sousa, R. Nucleic Acids Res. 1999, 27, 1561.) On the other hand it was shown that 

1 5 variety of substituents at pyrimidine 5-position is well tolerated by T7 RNA polymerase 
(Tarasow, T.M.; Eaton, B.E. Biopolymers 1998, 48, 29), most likely because the natural 
hydrogen-bonding pattern of these nucleotides is preserved. We have chosen 2'-fluoro and 
2'-0-methyl pyrimidine nucleosides as starting materials for attachment of different 
functionalities to the 5-position of the base. Both rigid (alkynyl) and flexible (alkyl) 

20 spacers are used. The choice of imidazole, amino and carboxylate pendant groups is based 
on their ability to act as general acids, general bases, nucleophiles and metal ligands, all of 
which can improve the catalytic effectiveness of selected nucleic acids. Figures 21-24 
relate to the synthesis of these compounds. 

2'-0-methyluridine was 3',5'-bis-acetylated using acetic anhydride in pyridine and 

25 then converted to its 5-iodo derivative la using tyceric ammonium nitrate reagent 

(Asakura, J.; Robins, M.J. J. Org. Chem. 1990, 55, 4928) (Scheme 1). Both reactions 
proceeded in a quantitative yield and no chromatographic purifications were needed. 
Coupling between 1 and N-trifiuoroacetyl propargylamine using copper(I) iodide and 
tetrakis(triphenylphosphine)palladium(0) catalyst as described by Hobbs (Hobbs, F.W., Jr. 

30 J. Org. Chem. 1989, 54, 3420) yielded 2a in 89% yield. Selective O-deacylation with 
aqueous NaOH afforded 3a which was phosphorylated with POCl 3 /triethylphosphate 
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(TEP) in the presence of l,8-bis(dimethylamino)naphthalene (Proton-Sponge) (Method A) 
(Kovacz, T; Otvos, L. Tetrahedron Lett. 1988, 29, 4525). The intermediate nucleoside 
phosphorodichloridate was condensed in situ with tri-w-butylammonium pyrophosphate. 
At the end, the Af-TFA group was removed with concentrated ammonia. 5 '- Triphosphate 
5 was purified on Sephadex® DEAE A-25 ion exchange column using a linear gradient of 
0.1-0.8M triethylammonium bicarbonate (TEAB) for elution. Traces of contaminating 
inorganic pyrophosphate are removed using C-18 RP HPLC to afford analytically pure 
material. Conversion into Na-salt was achieved by passing the aqueous solution of 
triphosphate through Dowex 50WX8 ion exchange resin in Na + form to afford 4a in 45% 

1 0 yield. When Proton-Sponge was omitted in the first phosphorylation step, yields were 
reduced to 10-20%. Catalytic hydrogenation of 3a yielded 5-aminopropyl derivative 5a 
which was phosphorylated under conditions identical to those described for propynyl 
derivative 3a to afford triphosphate 6a in 50% yield. 

For the preparation of imidazole derivatized triphosphates 9a and 11a, we 

1 5 developed an efficient synthesis of iV-diphenylcarbamoyl 4-imidazoleacetic acid 
(ImAA DPC ): Transient protection of carboxyl group as TMS-ester using TMS- 
Cl/pyridine followed by DPC-C1 allowed for a clean and quantitative conversion of 4- 
imidazoleacetic acid (ImAA) to its Af-DPC protected derivative. 

Complete deacylation of 2a afforded 5-(3-aminopropynyl) derivative 8a which 

20 was condensed with 4-imidazoleacetic acid in the presence of 1 -(3- 

dimethylaminopropyl)-3-ethylcarbodiimide (EDC) to afford 9a in 68% yield. Catalytic 
hydrogenation of 8a yielded 5-(3-aminopropyl) derivative 10a which was condensed 
with ImAA DPC to yield conjugate 11a in 32% yield. Yields in these couplings were 
greatly improved when 5' -OH was protected with DMT group (not shown) thus 

25 efficiently preventing undesired 5 '-Oesterification. Both 9a and 11a failed to yield 
triphosphate products in reaction with POCl 3 /TEP/Proton-Sponge. 

On the contrary, phosphorylation of 3'-0-acetylated derivatives 12a and 13a using 
2-chloro-4//-l,3,2-benzodioxaphosphorin-4-one followed by pyrophosphate addition 
and oxidation (Method B, Scheme 2; Ludwig, J., Eckstein, F., J. Org. Chem. 1989, 54, 

30 631) afforded the desired triphosphates 14a and 15a in 57% yield, respectively. 
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2'-Deoxy-2'-fluoro nucleoside 5 '-triphosphates containing amino- (4b, 6b) and 
imidazole- (14b, 15b) linked groups were synthesized in a manner analogous to that 
described for the preparation of 2'-<9-methyl nucleoside 5 '-triphosphates (Schemes 1 and 
2). Again, only Ludwig-Eckstein's phosphorylation worked for the preparation of 4- 
5 imidazoleacetyl derivatized triphosphates. 

It is worth noting that when "one-pot-two-steps" phosphorylation reaction 
(Kovacz, T; Otvos, L. Tetrahedron Lett. 1988, 29, 4525) of 5b was quenched with 40% 
aqueous methylamine instead of TEAB or H2O, the y-amidate 7b was generated as the 
only detectable product. Similar reaction was reported recently for the preparation of the 

10 y-amidate of pppA2'p5'A2'p5'A. 12 

Carboxylate group was introduced into 5-position of uridine both on the nucleoside 
level and post-synthetically (Method C) (Scheme 3). 5-Iodo-2'-deoxy-2'-fluorouridine (16) 
was coupled with methyl acrylate using modified Heck reaction 13 to yield 17 in 85% yield. 
5'-0-Dimethoxytritylation, followed by in situ 3'-Oacetylation and subsequent 

15 detritylation afforded 3 '-protected derivative 18. Phosphorylation using 2-chloro-4#- 
l,3,2-benzodioxa-phosphorin-4-one followed by pyrophosphate addition and oxidation 
(Ludwig, J.; Eckstein, F. J. Org. Chem. 1989, 54, 631) afforded the desired triphosphate 
in 54% yield. On the other hand, 5-(3-aminopropyl)uridine 5 '-triphosphate 6b was coupled 
with A'-hydroxysuccinimide ester of Fmoc-Asp-OFm to afford, after removal of Fmoc and 

20 Fm groups with diethylamine, the desired aminoacyl conjugate 20 in 50% yield. 

Cytidine derivatives comprising 3-aminopropyl and 3(JV-succinyl)aminopropyl 
groups were synthesized according to Scheme 4. Peracylated 5-(3-aminopropynyl)uracil 
derivative 2b is reduced using catalytic hydrogenation and then converted in seven steps 
and 5% overall yield into 3'-acetylated cytidine derivative 25. This synthesis was plagued 

25 by poor solubility of intermediates and formation of the N 4 -cyclized byproduct during 
ammonia treatment of the 4-triazolyl intermediate. Phosphorylation of 25 as described in 
reference 1 1 yielded triphosphate 26 and N 4 -cyclized product 27 in 1 : 1 ratio. They were 
easily separated on Sephadex DEAE A-25 ion exchange column using 0.1-0.8M TEAB 
gradient. It appears that under basic conditions the free primary amine can displace any 

30 remaining intact 4-NHBz group leading to the cyclized product. This is similar to 
displacement of 4-triazolyl group by primary amine as mentioned above. 
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We reasoned that utilization of A^-unprotected cytidine will solve this problem. 
This lead to an improved synthesis of 26: Iodination of 2'-deoxy-2'-fluorocytidine (28) 
provided the 5-iodo derivative 29 in 58% yield. This compound was then smoothly 
converted into 5-(3-aminopropynyl) derivative 30. Hydrogenation afforded 5-(3- 
5 aminopropyl) derivative 31 which was phosphorylated directly with POCl 3 / PPi to afford 
26 in 37% yield. Coupling of the 5 '-triphosphate 26 with succinic anhydride yielded 
succinylated derivative 32 in 36% yield. 

Synthesis of 5-Imidazoleacetic acid 2'-deoxv-5'-triphosphate uridine 

5-dintrophenylimidazoleacetic acid 2'-deoxy uridine nucleoside (80 mg) was 

1 0 dissolved in 5 ml of triethylphosphate while stirring under argon, and the reaction mixture 
was cooled to 0°C. Phosphorous oxychloride (1.8 eq, 22 ml) was added to the reaction 
mixture at 0°C, three more aliquots were added over the course of 48 hours at room 
temperature. The reaction mixture was then diluted with anhydrous MeCN (5 ml) and 
cooled to 0°C, followed by the addition of tributylamine (0.65 ml) and tributylarnmonium 

1 5 pyrophosphate (4.0 eq, 0.24 g). After 45 minutes, the reaction was quenched with 10 ml 
aq. methyl amine for four hours. After co-evaporation with MeOH (3x), purified material 
on DEAE Sephadex followed by RP chromatography to afford 15 mg of triphosphate. 
Synthesis of 2'-(iV-lvsyl)-amino-2'-deoxv-cvtidine Triphosphate 

2'-(N-lysyl)-amino-2'-deoxy cytidine (0.180 g, 0.22 mmol) was dissolved in triethyl 

20 phosphate (2.00 ml) under Ar. The solution was cooled to 0 °C in an ice bath. 

Phosphorus oxychloride (99.999%, 3 eq., 0.0672 mL) was added to the solution and the 
reaction was stirred for two hours at 0 °C. Tributylarnmonium pyrophosphate (4 eq., 0.400 
g) was dissolved in 3.42 mL of acetonitrile and tribuytylamine (0.165 mL). Acetonitrile (1 
mL) was added to the monophosphate solution followed by the pyrophosphate solution 

25 which was added dropwise. The resulting solution was clear. The reaction was allowed to 
warm up to room temperature. After stirring for 45 minutes, methylamine (5 mL) was 
added and the reaction and stirred at room temperature for 2 hours. A biphasic mixture 
appeared (little beads at the bottom of the flask). TLC (7:1:2 iPrOH:NH40H:H20) 
showed the appearance of triphosphate material. The solution was concentrated, dissolved 

30 in water and loaded on a newly prepared DEAE Sephadex A-25 column. The column was 
washed with a gradient up to 0.6 M TEAB buffer and the product eluted off in fractions 
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90-95. The fractions were analyzed by ion exchange HPLC. Each fraction showed one 
triphosphate peak that eluted at -4.000 minutes. The fractions were combined and 
pumped down from methanol to remove buffer salt to yield 15.7 mg of product. 
Synthesis of 2'-deoxy-2'-(L-histidine)amino Cytidine Triphosphate 
5 2'-[//-Fmoc, Ntmid -dinitrophenyl-histidyl]amino-2' -cytidine (0.3 10 g, 4.04 mmol) 

was dissolved in triethyl phosphate (3 ml) under Ar. The solution was cooled to 0 °C. 
Phosphorus oxychloride (1 .8 eq., 0.068 mL) was added to the solution and stored 
overnight in the freezer. The next moming TLC (10% MeOH in CH 2 C1 2 ) showed 
significant starting material, one more equivalent of POCI3 was added. After two hours, 

1 0 TLC still showed starting material. Tributylamine (0.303 mL) and Tributylammonium 
pyrophosphate (4 eq., 0.734 g) dissolved in 6.3 mL of acetonitrile (added dropwise) were 
added to the monophosphate solution. The reaction was allowed to warm up to room 
temperature. After stirring for 15 min, methylamine (10 mL) was added at room 
temperature and stirring continued for 2 hours. TLC (7:1:2 iPrOH:NH40H:H20) showed 

1 5 the appearance of triphosphate material. The solution was concentrated, dissolved in water 
and loaded on a DEAE Sephadex A-25 column. The column was washed with a gradient 
up to 0.6 M TEAB buffer and the product eluted off in fractions 170-179. The fractions 
were analyzed by ion exchange HPLC. Each fraction showed one triphosphate peak that 
eluted at ~6.77 minutes. The fractions were combined and pumped down from methanol 

20 to remove buffer salt to afford 1 7 mg of product. 

Screening for Novel Enzymatic nucleic acid molecule Motifs Using Modified NTPs (Class 
I Motif) 

Our initial pool contained 3 x 10 14 individual sequences of 2'-amino-dCTP/2'- 
amino-dUTP RNA. We optimized transcription conditions in order to increase the amount 

25 of RNA product by inclusion of methanol and lithium chloride. 2'-amino-2'- 

deoxynucleotides do not interfere with the reverse transcription and amplification steps of 
selection and confer nuclease resistance. We designed the pool to have two binding arms 
complementary to the substrate, separated by the random 40 nucleotide region. The 16- 
mer substrate had two domains, 5 and 10 nucleotides long, that bind the pool, separated by 

30 an unpaired guanosine. On the 5 ' end of the substrate was a biotin attached by a C 1 8 

linker. This enabled us to link the substrate to a NeutrAvidin™ resin in a column format. 
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The desired reaction would be cleavage at the unpaired G upon addition of magnesium 
cofactor followed by dissociation from the column due to instability of the 5 base pair 
helix. A detailed protocol follows: 

Enzymatic nucleic acid molecule Pool Prep: The initial pool DNA was prepared by 
5 converting the following template oligonucleotides into double-stranded DNA by filling in 
with taq polymerase. (template=5'-ACC CTC ACT AAA GGC CGT (N) 40 GGT TGC 
ACA CCT TTC-3'; primer 1=5'- CAC TTA GCA TTA ACC CTC ACT AAA GGC 
CGT-3'; primer 2=5'-TAA TAC GAC TCA CTA TAG GAA AGG TGT GCA ACC-3'.) 
All DNA oligonucleotides were synthesized by Operon technologies. Template oligos 

1 0 were purified by denaturing PAGE and Sep-pak chromatography columns (Waters). RNA 
substrate oligos were using standard solid phase chemistry and purified by denaturing 
PAGE followed by ethanol precipitation. Substrates for in vitro cleavage assays were 5'- 
end labeled with gamma- 32 P-ATP and T4 polynucleotide kinase followed by denaturing 
PAGE purification and ethanol precipitation. 

15 5 nmole of template, 10 nmole of each primer and 250 U taq polymerase were 

incubated in a 10 ml volume with IX PCR buffer (10 mM tris-HCl (pH 8.3), 1.5 mM 
MgCl 2 , 50 mM KC1) and 0.2 mM each dNTP as follows: 94°C, 4 minutes; (94°C, 1 min; 
42°C, 1 min; 72°C, 2 min) through four cycles; and then 72°C, for 10 minutes. The 
product was analyzed on 2% Separide™ agarose gel for size and then was extracted twice 

20 with buffered phenol, then chloroform-isoamyl alcohol, and ethanol precipitated. The 
initial RNA pool was made by transcription of 500 pmole (3 x 10 14 molecules) of this 
DNA as follows. Template DNA was added to 40 mM tris-HCl (pH 8.0), 12 mM MgCl 2 , 
5 mM dithiothreitol (DTT), 1 mM spermidine, 0.002% triton X-100, 1 mM LiCl, 4% 
PEG-8000, 10% methanol, 2 mM ATP, 2 mM GTP, 2 mM 2'-amino-dCTP, 2 mM 2'- 

25 amino-dUTP, 5 U/ml inorganic pyrophosphatase, and 5 U/ul T7 RNA polymerase at room 
temperature for a total volume of 1 ml. A separate reaction contained a trace amount of 
alpha- 32 P-GTP for detection. Transcriptions were incubated at 37°C for 2 hours followed 
by addition of equal volume STOP buffer (94% formamide, 20 mM EDTA, 0.05% 
bromophenol blue). The resulting RNA was purified by 6% denaturing PAGE gel, Sep- 

30 pak™ chromatography, and ethanol precipitated. 

INITIAL SELECTION: 2 nmole of 16 mer 5'-biotinylated substrate (5'-biotin-C18 
linker-GCC GUG GGU UGC ACA C-3') was linked to 200 ul UltraLink Immobilized 
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NeutrAvidin™ resin (400 pi slurry, Pierce) in binding buffer (20 mM NaP0 4 (pH 7.5), 150 
mM NaCl) for 30 minutes at room temperature. The resulting substrate column was 
washed with 2 ml binding buffer followed by 2 ml column buffer (50 mM tris-HCl (pH 
8.5), 100 mM NaCl, 50 mM KC1). The flow was capped off and 1000 pmole of initial 
5 pool RNA in 200 pi column buffer was added to the column and incubated 30 minutes at 
room temperature. The column was uncapped and washed with 2 ml column buffer, then 
capped off. 200 pi elution buffer (=column buffer + 25 mM MgCh) was added to the 
column and allowed to incubate 30 minutes at room temperature. The column was 
uncapped and eluent collected followed by three 200 ul elution buffer washes. The 

1 0 eluent/washes were ethanol precipitated using glycogen as carrier and rehydrated in 50 ul 
sterile H 2 0. The eluted RNA was amplified by standard reverse transcription/PCR 
amplification techniques. 5-31 pi RNA was incubated with 20 pmol of primer 1 in 14 ul 
volume 90° for 3 min then placed on ice for 1 minute. The following reagent were added 
(final concentrations noted): IX PCR buffer, 1 mM each dNTP, 2 U/ul RNase Inhibitor, 

15 10 U/ul Superscript™ II reverse transcriptase. The reaction was incubated 42° for 1 hour 
followed by 95° for 5 min in order to inactivate the reverse transcriptase. The volume was 
then increased to 100 pi by adding water and reagents for PCR: IX PCR buffer, 20 pmol 
primer 2, and 2.5 U taq DNA polymerase. The reaction was cycled in a Hybaid 
thermocycler: 94°, 4 min; (94°C, 30 sec; 54°C, 30 sec; 72°C, 1 min) X 25; 72°C, 5 min. 

20 Products were analyzed on agarose gel for size and ethanol precipitated. One-third to one- 
fifth of the PCR DNA was used to transcribe the next generation, in 1 00 pi volume, as 
described above. Subsequent rounds used 20 pmol RNA for the column with 40 pmol 
substrate. 

TWO COLUMN SELECTION: At generation 8 (G8), the column selection was 
25 changed to the two column format. 200 pmoles of 22 mer 5'-biotinylated substrate (5'- 
biotin-C18 linker-GCC GUG GGU UGC ACA CCU UUC C-C18 linker-thiol modifier C6 
S-S-inverted abasic-3') was used in the selection column as described above. Elution was 
in 200 pi elution buffer followed by a 1 ml elution buffer wash. The 1200 pi eluent was 
passed through a product trap column by gravity. The product trap column was prepared 
30 as follows: 200 pmol 16 mer 5'-biotinylated "product" (5'-GGU UGC ACA CCU UUC 
C-C18 linker-biotin-3') was linked to the column as described above and the column was 
equilibrated in elution buffer. Eluent from the product column was precipitated as 
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previously described. The products were amplified as above only with 2.5-fold more 
volume and 100 pmol each primer. 100 pi of the PCR reaction was used to do a cycle 
course; the remaining fraction was amplified the minimal number of cycles needed for 
product. After 3 rounds (Gil), there was visible activity in a single turnover cleavage 
5 assay. By generation 13, 45% of the substrate was cleaved at 4 hours; kobs of the pool was 
0.037 min' 1 in 25 mM MgCl 2 . We subcloned and sequenced generation 13; the pool was 
still very diverse. Since our goal was a enzymatic nucleic acid molecule that would work 
in a physiological environment, we decided to change selection pressure rather than 
exhaustively catalog G13. 

1 0 Reselection of the N40 pool was started from G 1 2 DNA. Part of the G 1 2 DNA was 

subjected to hypermutagenic PCR (Vartanian et al., 1996, Nucleic Acids Research 24, 
2627-2631) to introduce a 10% per position mutation frequency and was designated 
N40H. At round 19, part of the DNA was hypermutagenized again, giving N40M and 
N40HM (a total of 4 parallel pools). The column substrates remained the same; buffers 

1 5 were changed and temperature of binding and elution was raised to 37°C. Column buffer 
was replaced by physiological buffer (50 mM tris-HCl (pH 7.5), 140 mM KC1, 10 mM 
NaCl) and elution buffer was replaced by 1 mM Mg buffer (physiological buffer + 1 mM 
MgCh). Amount of time allowed for the pool to bind the column was eventually reduced 
to 10 min and elution time was gradually reduced from 30 min to 20 sec. Between rounds 

20 18 and 23, ko bs for the N40 pool stayed relatively constant at 0.035-0.04 min" 1 . Generation 
22 from each of the 4 pools was cloned and sequenced. 

CLONING AND SEQUENCING: Generations 13 and 22 were cloned using 
Novagen's Perfectly Blunt™ Cloning kit (pT7Blue-3 vector) following the kit protocol. 
Clones were screened for insert by PCR amplification using vector-specific primers. 

25 Positive clones were sequenced using ABI Prism 7700 sequence detection system and 
vector-specific primer. Sequences were aligned using Mac Vector software; two- 
dimensional folding was performed using Mulfold software ( Zuker,, 1989, Science 244, 
48-52; Jaeger et al, 1989, Biochemistry 86, 7706-7710; Jaeger et al, 1989, R. F. Doolittle 
ed., Methods in Enzymology, 183, 281-306). Individual clone transcription units were 

30 constructed by PCR amplification with 50 pmol each primer 1 and primer 2 in IX PCR 
buffer, 0.2 mM each dNTP, and 2.5 U of taq polymerase in 100 pi volume cycled as 
follows: 94°C, 4 min; (94°C, 30 sec; 54°C, 30 sec; 72°C, 1 min) X 20; 72°C, 5 min. 
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Transcription units were ethanol precipitated, rehydrated in 30 ul H20, and 10 ul was 
transcribed in 100 ul volume and purified as previously described. 

Thirty-six clones from each pool were sequenced and were found to be variations of 
the same consensus motif. Unique clones were assayed for activity in 1 mM MgC^ and 
5 physiological conditions; nine clones represented the consensus sequence and were used in 
subsequent experiments. There were no mutations that significantly increased activity; 
most of the mutations were in regions believed to be duplex, based on the proposed 
secondary structure. In order to make the motif shorter, we deleted the 3 '-terminal 25 
nucleotides necessary to bind the primer for amplification. The measured rates of the full 

1 0 length and truncated molecules were both 0.04 min 1 ; thus we were able reduce the size of 
the motif from 86 to 61 nucleotides. The molecule was shortened even further by 
truncating base pairs in the stem loop structures as well as the substrate recognition arms 
to yield a 48 nucleotide molecule. In addition, many of the ribonucleotides were replaced 
with 2-0-methyl modified nucleotides to stabilize the molecule. An example of the new 

1 5 motif is given in Figure 13. Those of ordinary skill in the art will recognize that the 
molecule is not limited to the chemical modifications shown in the figure and that it 
represents only one possible chemically modified molecule. 

Kinetic Analysis: 

20 Single turnover kinetics were performed with trace amounts of 5'- 32 P-labeled 

substrate and 10-1000 nM pool of enzymatic nucleic acid molecule. 2X substrate in IX 
buffer and 2X pool/enzymatic nucleic acid molecule in IX buffer were incubated 
separately 90° for 3 min followed by equilibration to 37° for 3 min. Equal volume of 2X 
substrate was added to pool/enzymatic nucleic acid molecule at to and the reaction was 

25 incubated at 37°C. Time points were quenched in 1.2 vol STOP buffer on ice. Samples 
were heated to 90°C for 3 min prior to separation on 15% sequencing gels. Gels were 
imaged using a Phosphorlmager and quantitated using ImageQuant™ software (Molecular 
Dynamics). Curves were fit to double-exponential decay in most cases, although some of 
the curves required linear fits. 

30 STABILITY: Serum stability assays were performed as previously described 

(Beigelman et al., 1995,./. Biol. Chem. 270, 25702-25708). 1 jig of 5'- 32 P-labeled 
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synthetic enzymatic nucleic acid molecule was added to 13 ul cold and assayed for decay 
in human serum. Gels and quantitation were as described in kinetics section. 

SUBSTRATE REQUIREMENTS: Table 60 outlines the substrate requirements for 
Class I motif. Substrates maintained Watson-Crick or wobble base pairing with mutant 
5 Class I constructs. Activity in single turnover kinetic assay is shown relative to wild type 
Class I and 22 mer substrate (50 mM Tris-HCL (pH 7.5), 140 mM KC1, 10 mM NaCl, 1 
mM MgCl 2 , 100 nM ribozyme, 5 nM substrate, 37°C). 

RANDOM REGION MUTATION ALIGNMENT: Table 61 outlines the random 
region alignment of 134 clones from generation 22 (1.x = N40, 2.x = N40M, 3.x = N40H, 

10 4.x = N40HM). The number of copies of each mutant is in parenthesis in the table, 
deviations from consensus are shown. Mutations that maintain base pair U19:A34 are 
shown in italic. Activity in single turnover kinetic assay is shown relative to the G22 pool 
rate (50 mM Tris-HCL pH 7.5, 140 mM KC1, 10 mM NaCl, 1 mM MgCl 2 , 100 nM 
ribozyme, trace substrate, 37°C). 

1 5 STEM TRUNCATION AND LOOP REPLACEMENT ANALYSIS: Figure 25 shows 

a representation of Class I ribozyme stem truncation and loop replacement analysis. The 
K rel is compared to a 61 mer Class I ribozyme measured as described above. Figure 26 
shows examples of Class I ribozymes with truncated stem(s) and/or non-nucleotide linker 
replaced loop structures. 

20 

Inhibition of HCV Using Class I (Amberzyme) Motif 

During HCV infection, viral RNA is present as a potential target for enzymatic 
nucleic acid molecule cleavage at several processes: uncoating, translation, RNA 
replication and packaging. Target RNA may be more or less accessible to enzymatic 

25 nucleic acid molecule cleavage at any one of these steps. Although the association 
between the HCV initial ribosome entry site (IRES) and the translation apparatus is 
mimicked in the HCV 5'UTR/luciferase reporter system, these other viral processes are 
not represented in the OST7 system. The resulting RNA/protein complexes associated 
with the target viral RNA are also absent. Moreover, these processes may be coupled in an 

30 HCV-infected cell which could further impact target RNA accessibility. Therefore, we 
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tested whether enzymatic nucleic acid molecules designed to cleave the HCV 5'UTR 
could effect a replicating viral system. 

Recently, Lu and Wimmer characterized an HCV-poliovirus chimera in which the 
poliovirus IRES was replaced by the IRES from HCV (Lu & Wimmer, 1996, Proc. Nail. 
5 Acad. Sci. USA. 93, 1412-1417). Poliovirus (PV) is a positive strand RNA virus like 
HCV, but unlike HCV is non-enveloped and replicates efficiently in cell culture. The 
HCV-PV chimera expresses a stable, small plaque phenotype relative to wild type PV. 

The capability of the new enzymatic nucleic acid molecule motifs to inhibit HCV 
RNA intracellularly was tested using a dual reporter system that utilizes both firefly and 

1 0 Renilla luciferase (Figure 14). A number of enzymatic nucleic acid molecules having the 
new class I motif (Amberzyme) were designed and tested (Table 56). The Amberzyme 
ribozymes were targeted to the 5' HCV UTR region, which when cleaved, would prevent 
the translation of the transcript into luciferase. OST-7 cells were plated at 12,500 cells per 
well in black walled 96-well plates (Packard) in medium DMEM containing 10% fetal 

1 5 bovine serum, 1% pen/strep, and 1% L-glutamine and incubated at 37°C overnight. A 

plasmid containing T7 promoter expressing 5' HCV UTR and firefly luciferase (T7C1-341 
(Wang et ah, 1993, J. of Virol. 67, 3338-3344)) was mixed with a pRLSV40 Renilla 
control plasmid (Promega Corporation) followed by enzymatic nucleic acid molecule, and 
cationic lipid to make a 5X concentration of the reagents (T7C 1-341 (4 ug/ml), pRLSV40 

20 renilla luciferase control (6 pg/ml), enzymatic nucleic acid molecule (250 nM), 
transfection reagent (28.5 pg/ml). 

The complex mixture was incubated at 37 C for 20 minutes. The media was 
removed from the cells and 120 pi of Opti-mem media was added to the well followed by 
30 pi of the 5X complex mixture. 150 pi of Opti-mem was added to the wells holding the 

25 untreated cells. The complex mixture was incubated on OST-7 cells for 4 hours, lysed 
with passive lysis buffer (Promega Corporation) and luminescent signals were quantified 
using the Dual Luciferase Assay Kit using the manufacturer's protocol (Promega 
Corporation). The data shown in Figure 15 is a dose curve of enzymatic nucleic acid 
molecule targeting site 146 of the HCV RNA and is presented as a ratio between the firefly 

30 and Renilla luciferase fluorescence. The enzymatic nucleic acid molecule was able to 
reduce the quantity of HCV RNA at all enzymatic nucleic acid molecule concentrations 
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yielding an IC 50 of approximately 5 nM. Other sites were also efficacious (Figure 16), in 
particular enzymatic nucleic acid molecules targeting sites 133, 209, and 273 were also 
able to reduce HCV RNA compared to the irrelevant (IRR) controls. 

5 Cleavage of Substrates Using Completely Modified class I (Amberzyme) enzymatic 
nucleic acid molecule 

The ability of an enzymatic nucleic acid, which is modified at every 2' position to 
cleave a target RNA was tested to determine if any ribonucleotide positions are necessary 
in the Amberzyme motif. Enzymatic nucleic acid molecules were constructed with T-O- 

1 0 methyl, and 2'-amino (NH 2 ) nucleotides and included no ribonucleotides (Table 56; gene 
name: no ribo) and kinetic analysis was performed as described in example 13. 100 nM 
enzymatic nucleic acid was mixed with trace amounts of substrate in the presence of 1 mM 
MgCl 2 at physiological conditions (37°C). The Amberzyme with no ribonucleotide 
present in it has a K re i of 0. 1 3 compared to the enzymatic nucleic acid with a few 

1 5 ribonucleotides present in the molecule shown in Table 56 (ribo). This shows that 
Amberzyme enzymatic nucleic acid molecule may not require the presence of 2' -OH 
groups within the molecule for activity. 
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Substrate Recognition Rules for Class II (zinzvme) enzymatic nucleic acid molecules 
Class II (zinzyme) ribozymes were tested for their ability to cleave base-paired 
substrates with all sixteen possible combinations of bases immediately 5' and 3' proximal 
to the bulged cleavage site G. Ribozymes were identical in all remaining positions of their 
5 7 base pair binding arms. Activity was assessed at two and twenty-four hour time points 
under standard reaction conditions [20 mM HEPES pH 7.4, 140 mM KC1, 10 mM NaCl, 1 
mM MgCl 2 , 1 mM CaCl 2 - 37° C]. Figure 19 shows the results of this study. Base paired 
substrate UGG (not shown in the figure) cleaved as poorly as CGG shown in the figure. 
The figure shows the cleavage site substrate triplet in the 5'- 3' direction and 2 and 24 

1 0 hour time points are shown top to bottom respectively. The results indicate the cleavage 
site triplet is most active with a 5 '- Y-G-H -3' (where Y is C or U and H is A, C or U with 
cleavage between G and H); however, activity is detected particularly with the 24 hour 
time point for most paired substrates. All positions outside of the cleavage triplet were 
found to tolerate any base pairings (data not shown). 

1 5 All possible mispairs immediately 5 * and 3 ' proximal to the bulged cleavage site G 

were tested to a class U ribozyme designed to cleave a 5'-C-G-C -3'. It was observed the 
5' and 3' proximal sites are as active with G:U wobble pairs, in addition, the 5' proximal 
site will tolerate a mismatch with only a slight reduction in activity [data not shown]. 



20 Screening for Novel Enzymatic nucleic acid molecule Motifs (Class II Motifs) 

The selections were initiated with pools of > 10 14 modified RNA's of the following 
sequence: 5'-GGGAGGAGGAAGUQCCU(N)3 5 UGCCGCGCUCGCUCCCAGUCC-3'. The RNA 
was enzymatically generated using the mutant T7 Y639F RNA polymerase prepared by 
Rui Souza. The following modified NTP's were incorporated: 2'-deoxy-2'-fluoro- 

25 adenine triphosphate, 2'-deoxy-2'-fluoro-uridine triphosphate or 2'-deoxy-2'-fluoro-5-[(N- 
imidazole-4acetyl)propyl amine] uridine triphosphate, and 2'-deoxy-2'-amino-cytidine 
triphosphate; natural guanidine triphosphate was used in all selections so that alpha - 32 P- 
GTP could be used to label pool RNA's. RNA pools were purified by denaturing gel 
electrophoresus 8% polyacrilamide 7 M Urea. 

30 The following target RNA (resin A) was synthesized and coupled to Iodoacetyl 

Ultralink™ resin (Pierce) by the supplier's proceedure:5' -b-L- 
GGACUGGGAGCGAGCGCGGCGCAGGCACU GAAG-L-S-B-3'; where b is biotin (Glenn 
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Research cat# 10-1953-nn), L is polyethylene glycol spacer (Glenn Research cat# 10-1918- 
nn), S is thiol-modifier C6 S-S (Glenn Research cat# 10-1936-nn), B is a standard inverted 
deoxy abasic. 

RNA pools were added to 100 ul of 5 uM Resin A in the buffer A (20 mM HEPES 
5 pH 7.4, 140 mM KCL, 10 mM NaCl) and incubated at 22°C for 5 minutes. The 

temperature was then raised to 37°C for 10 minutes. The resin was washed with 5 ml 
buffer A. Reaction was triggered by the addition of buffer B(20 mM HEPES pH 7.4, 140 
mM KCL, 10 mM NaCl, 1 mM MgCl 2 , 1 mM CaCl 2 ). Incubation proceeded for 20 
minutes in the first generation and was reduced progressively to 1 minute in the final 

1 0 generations; with 13 total generations. The reaction eluent was collected in 5 M NaCl to 
give a final concentration of 2 M NaCl. To this was added 100 p.1 of 50% slurry Ultralink 
NeutraAvidin™ (Pierce). Binding of cleaved bio tin product to the avidin resin was 
allowed by 20 minute incubation at 22° C. The resin was subsequently washed with 5 ml 
of 20 mM HEPES pH 7.4, 2 M NaCl. Desired RNA's were removed by a 1 .2 ml 

1 5 denaturing wash 1M NaCl, 10 M Urea at 94° C over 10 minutes. RNA's were double 

precipitated in 0.3 M sodium acetate to remove CI" ions inhibitory to reverse transcription. 
Standard protocols of reverse transcription and PCR amplification were performed. 
RNA's were again transcribed with the modified NTP's described above. After 13 
generations cloning and sequencing provided 14 sequences which were able to cleave the 

20 target substrate. Six sequences were characterized to determine secondary structure and 
kinetic cleavage rates. The structures and kinetic data are given in Figure 17. The 
sequences of eight other enzymatic nucleic acid molecule sequences are given in Table 57. 
The size, sequence, and chemical compositions of these molecules can be modified as 
described below or using other techniques well known in the art. 

25 

Nucleic Acid Catalyst Engineering 

Sequence, chemical and structural variants of Class I and Class II enzymatic nucleic 
acid molecule can be engineered and re-engineered using the techniques shown in this 
application and known in the art. For example, the size of class I and class II enzymatic 
30 nucleic acid molecules can, be reduced or increased using the techniques known in the art 
(Zaug et al, 1986, Nature, 324, 429; Ruffher et al, 1990, Biochem., 29, 10695; Beaudry et 
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al, 1990, Biochem., 29, 6534; McCall et al, 1992, Proc. Natl. Acad. Sci., USA., 89, 5710; 
Longer a/., 1994, supra; Hendry et al, 1994, BBA 1219, 405; Benseler etal, 1993, JACS, 
115, 8483; Thompson et al, 1996, Nucl. Acids Res., 24, 4401; Michels et al, 1995, 
Biochem., 34, 2965; Been a/., 1992, Biochem., 31, 1 1843; Guo et al, 1995, J., 
5 14, 368; Pan et al, 1994, Biochem., 33, 9561; Cech, 1992, Curr. Op. Struc. Bio., 2, 605; 
Sugiyama etal, 1996, FEBSLett., 392, 215; Beigelman etal, 1994, Bioorg. Med. Chem., 
A, 1715; Santoro et al, 1997, PiMS 94, 4262; all are incorporated in their totality by 
reference herein), to the extent that the overall catalytic activity of the ribozyme is not 
significantly decreased. 

1 0 Further rounds of in vitro selection strategies described herein and variations thereof 

can be readily used by a person skilled in the art to evolve additional nucleic acid catalysts 
and such new catalysts are within the scope of the instant invention. 



Example 16: Activity of Class n (zinzyme) nucleic acid catalysts to inhibit HER2 gene 
15 expression 

Applicant has designed, synthesized and tested several class II (zinzyme) ribozymes 
targeted against HER2 RNA (see, for example, Tables 58, 59, and 62) in cell proliferation 
RNA reduction assays. 
Proliferation assay: 

20 The model proliferation assay used in the study can require a cell-plating density of 

2000-10000 cells/well in 96-well plates and at least 2 cell doublings over a 5-day treatment 
period. Cells used in proliferation studies were either human breast or ovarian cancer cells 
(SKBR-3 and SKOV-3 cells respectively). To calculate cell density for proliferation 
assays, the FIPS (fluoro-imaging processing system) method well known in the art was 

25 used. This method allows for cell density measurements after nucleic acids are stained 
with CyQuant® dye, and has the advantage of accurately measuring cell densities over a 
very wide range 1,000-100,000 cells/well in 96-well format. 

Ribozymes (50-200 nM) were delivered in the presence of cationic lipid at 2.0-5.0 
u-g/mL and inhibition of proliferation was determined on day 5 post-treatment. Two full 

30 ribozyme screens were completed resulting in the selection of 14 ribozymes. Class II 

(zinzyme) ribozymes against sites, 314 (RPINo. 18653), 443 (RPINo. 18680), 597 (RPI 
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No. 18697), 659 (RPINo. 18682), 878 (RPINos. 18683 and 18654), 881 (RPINos. 18684 
and 18685) 934 (RPINo. 18651), 972 (RPINo. 18656, 19292, 19727, 19728, and 19293), 
1292 (RPINo. 18726), 1541 (RPINo. 18687), 2116 (RPINo. 18729), 2932 (RPINo. 
18678), 2540 (RPI No. 18715), and 3504 (RPI No. 18710) caused inhibition of 
proliferation ranging from 25-80% as compared to a scrambled control ribozyme. An 
example of results from a cell culture assay is shown in Figure 20. Referring to Figure 
20, Class II ribozymes targeted against HER2 RNA are shown to cause significant 
inhibition of proliferation of cells. This shows that ribozymes, for instance the Class II 
(zinzyme) ribozymes are capable of inhibiting HER2 gene expression in mammalian cells. 

RNA assay: 

RNA was harvested 24 hours post-treatment using the Qiagen RNeasy® 96 
procedure. Real time RT-PCR (TaqMan® assay) was performed on purified RNA 
samples using separate primer/probe sets specific for either target HER2 RNA or control 
actin RNA (to normalize for differences due to cell plating or sample recovery). Results 
are shown as the average of triplicate determinations of HER2 to actin RNA levels post- 
treatment. Figure 30 shows class II ribozyme (zinzyme) mediated reduction in HER2 
RNA targeting site 972 vs a scrambled attenuated control. 



20 Dose response assays: 

Active ribozyme was mixed with binding arm-attenuated control (BAC) ribozyme to 
a final oligonucleotide concentration of either 100, 200 or 400 nM and delivered to cells in 
the presence of cationic lipid at 5.0 (j.g/mL. Mixing active and BAC in this manner 
maintains the lipid to ribozyme charge ratio throughout the dose response curve. HER2 

25 RNA reduction was measured 24 hours post-treatment and inhibition of proliferation was 
determined on day 5 post-treatment. The dose response antiproliferation results are 
summarized in Figure 31 and the dose-dependent reduction of HER2 RNA results are 
summarized in Figure 32. Figure 33 shows a combined dose response plot of both anti- 
proliferation and RNA reduction data for a class II ribozyme targeting site 972 of HER2 

30 RNA (RPI 19293). 
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Example 17: Reduction of ribose residues in Class II (zinzyme) nucleic acid catalysts 

Class II (zinzyme) nucleic acid catalysts were tested for their activity as a function 
ribonucleotide content. A Zinzyme having no ribonucleotide residue (i'e., no 2'-OH group 
at the 2' position of the nucleotide sugar) against the K-Ras site 521 was designed. This 
5 molecules were tested utilizing the chemistry shown in Figure 27a. The in vitro catalytic 
activity zinzyme construct was not significantly effected (the cleavage rate reduced only 
10 fold). 

The Kras zinzyme shown in Figure 27a was tested in physiological buffer with the 
divalent concentrations as indicated in the legend (high NaCl is an altered monovalent 

1 0 condition shown) of Figure 28. The 1 mM Ca^ condition yielded a rate of 0.005 min" 1 
while the 1 mM Mg^ condition yielded a rate of 0.002 min" 1 . The ribose containing wild 
type yields a rate of 0.05 min" 1 while substrate in the absence of zinzyme demonstrates less 
than 2% degradation at the longest time point under reaction conditions shown. This 
illustrates a well-behaved cleavage reaction done by a non-ribose containing catalyst with 

1 5 only a 10-fold reduced cleavage as compared to ribonucleotide-containing zinzyme and 
vastly above non-catalyzed degradation. 

A more detailed investigation into the role of ribose positions in the Class n 
(zinzyme) motif was carried out in the context of the HER2 site 972 (Applicant has further 
designed a fully modified Zinzyme as shown in Figure 27b targeting the HER2 RNA site 

20 972). Figure 29 is a diagram of the alternate formats tested and their relative rates of 
catalysis. The effect of substitution of ribose G for the 2'-0-methyl C-2'-0-methyl A in 
the loop of Zinzyme (see Figure 34) was insignificant when assayed with the Kras target 
but showed a modest rate enhancement in the HER2 assays. The activity of all Zinzyme 
motifs, including the fully stabilized "0 ribose" (RPI 19727) are well above background 

25 noise level degradation. Zinzyme with only two ribose positions (RPI 19293) are 

sufficient to restore "wild-type" activity. Motifs containing 3 (RPI 19729), 4 (RPI 19730) 
or 5 ribose (RPI 19731) positions demonstrated a greater extent of cleavage and profiles 
almost identical to the 2 ribose motif. Applicant has thus demonstrated that a Zinzyme 
with no ribonucleotides present at any position can catalyze efficient RNA cleavage 

30 activity. Thus, Zinzyme enzymatic nucleic acid molecules do not require the presence of 
2' -OH group within the molecule for catalytic activity. 
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Example 18: Activity of reduced ribose containing Class II (zinzyme) nucleic acid 
catalysts to inhibit HER2 gene expression 

A cell proliferation assay for testing reduced ribo class II (zinzyme) nucleic acid 
catalysts (50-400 nM) targeting HER2 site 972 was performed as described in example 19. 
5 The results of this study are summarized in Figure 35. These results indicate significant 
inhibition of HER2 gene expression using stabilized Class II (zinzyme) motifs, including 
two ribo (RPI 19293), one ribo (RPI 19728), and non-ribo (RPI 19727) containing nucleic 
acid catalysts. 

10 Applications 

The use of NTP's described in this invention have several research and commercial 
applications. These modified nucleotide triphosphates can be used for in vitro selection 
(evolution) of oligonucleotides with novel functions. Examples of in vitro selection 
protocols are incorporated herein by reference (Joyce, 1989, Gene, 82, 83-87; Beaudry et 
15 al, 1992, Science 257, 635-641; Joyce, 1992, Scientific American 267, 90-97; Breaker et 
al, 1994, TIBTECH 12, 268; Bartel et a/.,1993, Science 261:141 1-1418; Szostak, 1993, 
TIBS 17, 89-93; Kumar et al, 1995, FASEB J., 9, 1183; Breaker, 1996, Curr. Op. Biotech., 
7, 442). 

Additionally, these modified nucleotide triphosphates can be employed to generate 
20 modified oligonucleotide combinatorial chemistry libraries. Several references for this 
technology exist (Brenner et al., 1992, PNAS 89, 5381-5383, Eaton, 1997, Curr. Opin. 
Chem. Biol. 1, 10-16), which are all incorporated herein by reference. 

Diagnostic uses 

25 Enzymatic nucleic acid molecules of this invention may be used as diagnostic tools 

to examine genetic drift and mutations within diseased cells or to detect the presence of 
specific RNA in a cell. The close relationship between enzymatic nucleic acid molecule 
activity and the structure of the target RNA allows the detection of mutations in any region 
of the molecule which alters the base-pairing and three-dimensional structure of the target 

30 RNA. By using multiple enzymatic nucleic acid molecules described in this invention, 
one may map nucleotide changes which are important to RNA structure and function in 
vitro, as well as in cells and tissues. Cleavage of target RNAs with enzymatic nucleic acid 
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molecules may be used to inhibit gene expression and define the role (essentially) of 
specified gene products in the progression of disease. In this manner, other genetic targets 
may be defined as important mediators of the disease. These experiments will lead to 
better treatment of the disease progression by affording the possibility of combinational 
5 therapies (e.g., multiple enzymatic nucleic acid molecules targeted to different genes, 

enzymatic nucleic acid molecules coupled with known small molecule inhibitors, radiation 
or intermittent treatment with combinations of enzymatic nucleic acid molecules and/or 
other chemical or biological molecules). Other in vitro uses of enzymatic nucleic acid 
molecules of this invention are well known in the art, and include detection of the presence 
1 0 ofmRNAs associated with related conditions. Such RNA is detected by determining the 
presence of a cleavage product after treatment with a enzymatic nucleic acid molecule 
using standard methodology. 

In a specific example, enzymatic nucleic acid molecules which can cleave only wild- 
type or mutant forms of the target RNA are used for the assay. The first enzymatic nucleic 
1 5 acid molecule is used to identify wild-type RNA present in the sample and the second 
enzymatic nucleic acid molecule will be used to identify mutant RNA in the sample. As 
reaction controls, synthetic substrates of both wild-type and mutant RNA will be cleaved 
by both enzymatic nucleic acid molecules to demonstrate the relative enzymatic nucleic 
acid molecule efficiencies in the reactions and the absence of cleavage of the "non- 
20 targeted" RNA species. The cleavage products from the synthetic substrates will also 
serve to generate size markers for the analysis of wild type and mutant RNAs in the 
sample population. Thus each analysis can involve two enzymatic nucleic acid molecules, 
two substrates and one unknown sample which can be combined into six reactions. The 
presence of cleavage products can be determined using an RNAse protection assay so that 
25 full-length and cleavage fragments of each RNA can be analyzed in one lane of a 

polyacrylamide gel. It is not absolutely required to quantify the results to gain insight into 
the expression of mutant RNAs and putative risk of the desired phenotypic changes in 
target cells. The expression of mRNA whose protein product is implicated in the 
development of the phenotype is adequate to establish risk. If probes of comparable 
30 specific activity are used for both transcripts, then a qualitative comparison of RNA levels 
will be adequate and will decrease the cost of the initial diagnosis. Higher mutant form to 
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wild-type ratios will be correlated with higher risk whether RNA levels are compared 
qualitatively or quantitatively. 



Additional Uses 

5 Potential usefulness of sequence-specific enzymatic nucleic acid molecules of the 

instant invention can have many of the same applications for the study of RNA that DNA 
restriction endonucleases have for the study of DNA (Nathans et al, 1975 Ann. Rev. 
Biochem. 44:273). For example, the pattern of restriction fragments can be used to 
establish sequence relationships between two related RNAs, and large RNAs could be 

1 0 specifically cleaved to fragments of a size more useful for study. The ability to engineer 
sequence specificity of the enzymatic nucleic acid molecule is ideal for cleavage of RNAs 
of unknown sequence. Applicant describes the use of nucleic acid molecules to down- 
regulate gene expression of target genes in bacterial, microbial, fungal, viral, and 
eukaryotic systems including plant, or mammalian cells. 

1 5 All patents and publications mentioned in the specification are indicative of the 

levels of skill of those skilled in the art to which the invention pertains. All references 
cited in this disclosure are incorporated by reference to the same extent as if each reference 
had been incorporated by reference in its entirety individually. 

One skilled in the art would readily appreciate that the present invention is well 

20 adapted to carry out the objects and obtain the ends and advantages mentioned, as well as 
those inherent therein. The methods and compositions described herein as presently 
representative of preferred embodiments are exemplary and are not intended as limitations 
on the scope of the invention. Changes therein and other uses will occur to those skilled in 
the art, which are encompassed within the spirit of the invention, are defined by the scope 

25 of the claims. 

It will be readily apparent to one skilled in the art that varying substitutions and 
modifications may be made to the invention disclosed herein without departing from the 
scope and spirit of the invention. Thus, such additional embodiments are within the scope 
of the present invention and the following claims. 

30 The invention illustratively described herein suitably may be practiced in the absence 

of any element or elements, limitation or limitations which is not specifically disclosed 
herein. Thus, for example, in each instance herein any of the terms "comprising", 
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"consisting essentially of and "consisting of may be replaced with either of the other two 
terms. The terms and expressions which have been employed are used as terms of 
description and not of limitation, and there is no intention that in the use of such terms and 
expressions of excluding any equivalents of the features shown and described or portions 
5 thereof, but it is recognized that various modifications are possible within the scope of the 
invention claimed. Thus, it should be understood that although the present invention has 
been specifically disclosed by preferred embodiments, optional features, modification and 
variation of the concepts herein disclosed may be resorted to by those skilled in the art, and 
that such modifications and variations are considered to be within the scope of this 

1 0 invention as defined by the description and the appended claims. 

In addition, where features or aspects of the invention are described in terms of 
Markush groups or other grouping of alternatives, those skilled in the art will recognize 
that the invention is also thereby described in terms of any individual member or subgroup 
of members of the Markush group or other group. 

1 5 Thus, additional embodiments are within the scope of the invention and within the 

following claims 
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Characteristics of naturally occurring ribozymes 
Group I Introns 

• Size: -150 to >1000 nucleotides. 

• Requires a U in the target sequence immediately 5' of the cleavage site. 

• Binds 4-6 nucleotides at the 5'-side of the cleavage site. 

• Reaction mechanism: attack by the 3' -OH of guanosine to generate cleavage products 
with 3'-OH and 5'-guanosine. 

• Additional protein cofactors required in some cases to help folding and maintainance of 
the active structure. 

• Over 300 known members of this class. Found as an intervening sequence in Tetrahymena 
thermophila rRNA, fungal mitochondria, chloroplasts, phage T4, blue-green algae; and 
others. 

• Major structural features largely established through phylogenetic comparisons, 
mutagenesis, and biochemical studies [',"]. 

• Complete kinetic framework established for one ribozyme ["VV/'l- 

• Studies of ribozyme folding and substrate docking underway [ vii , viii , ix ]. 

• Chemical modification investigation of important residues well established [ x , xi ]. 

• The small (4-6 nt) binding site may make this ribozyme too non-specific for targeted RNA 
cleavage, however, the Tetrahymena group I intron has been used to repair a "defective" 
P-galactosidase message by the ligation of new p-galactosidase sequences onto the 
defective message [*"]. 

RNAse P RNA (M1 RNA) 

Size: -290 to 400 nucleotides. 

• RNA portion of a ubiquitous ribonucleoprotein enzyme. 

• Cleaves tRNA precursors to form mature tRNA [ xiii ]. 

• Reaction mechanism: possible attack by M 2+ -OH to generate cleavage products with 3'- 
OH and 5' -phosphate. 

• RNAse P is found throughout the prokaryotes and eukaryotes. The RNA subunit has 
been sequenced from bacteria, yeast, rodents, and primates. 

Recruitment of endogenous RNAse P for therapeutic applications is possible through 
hybridization of an External Guide Sequence (EGS) to the target RNA [ xiv , xv ] 

• Important phosphate and 2' OH contacts recently identified [*vi/vii] 

Group II Introns 

Size: >1000 nucleotides. 

• Trans cleavage of target RNAs recently demonstrated [xviii xix] 
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• Sequence requirements not fully determined. 

• Reaction mechanism: 2'-OH of an internal adenosine generates cleavage products with 3'- 
OH and a "lariat" RNA containing a 3'-5' and a 2' -5' branch point. 

• Only natural ribozyme with demonstrated participation in DNA cleavage [ xx / xi ] in 
addition to RNA cleavage and ligation. 

• Major structural features largely established through phylogenetic comparisons [ xxii ]. 
Important 2' OH contacts beginning to be identified [ xxiii ] 

Kinetic framework under development [ xxiv ] 

Neurospora VS RNA 

• Size: -144 nucleotides. 

Trans cleavage of hairpin target RN As recently demonstrated [ xxv ] . 

• Sequence requirements not fully determined. 

• Reaction mechanism: attack by 2' -OH 5' to the scissile bond to generate cleavage products 
with 2',3'-cyclic phosphate and 5' -OH ends. 

Binding sites and structural requirements not fully determined. 

• Only 1 known member of this class. Found in Neurospora VS RNA. 

Hammerhead Ribozyme 

(see text for references) 

• Size: -13 to 40 nucleotides. 

Requires the target sequence UH immediately 5' of the cleavage site. 

• Binds a variable number nucleotides on both sides of the cleavage site. 

• Reaction mechanism: attack by 2'-OH 5' to the scissile bond to generate cleavage products 
with 2',3'-cyclic phosphate and 5'-OH ends. 

• 14 known members of this class. Found in a number of plant pathogens (virusoids) that 
use RNA as the infectious agent. 

• Essential structural features largely defined, including 2 crystal structures [xxvi/xvfl] 

• Minimal ligation activity demonstrated (for engineering through in vitro selection) [ xxv i"] 

• Complete kinetic framework established for two or more ribozymes [ xxix ]. 

• Chemical modification investigation of important residues well established [ xxx ]. 

Hairpin Ribozyme 

• Size: -50 nucleotides. 

• Requires the target sequence GUC immediately 3' of the cleavage site. 

• Binds 4-6 nucleotides at the 5'-side of the cleavage site and a variable number to the 3'- 
side of the cleavage site. 

• Reaction mechanism: attack by 2' -OH 5' to the scissile bond to generate cleavage products 
with 2' / 3'-cyclic phosphate and 5'-OH ends. 

• 3 known members of this class. Found in three plant pathogen (satellite RNAs of the 
tobacco ringspot virus, arabis mosaic virus and chicory yellow mottle virus) which uses 
RNA as the infectious agent. 

• Essential structural features largely defined [xxx^xxxi^xxxii^xxxiv] 
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• Ligation activity (in addition to cleavage activity) makes ribozyme amenable to 
engineering through in vitro selection [ xxxv ] 

• Complete kinetic framework established for one ribozyme [ x * xv «]. 

• Chemical modification investigation of important residues begun [xxxvii xxxviii] 

Hepatitis Delta Virus (HDV) Ribozyme 

• Size: ~60 nucleotides. 

• " Trans cleavage of target RN As demonstrated [ xxxix ]. 

• Binding sites and structural requirements not fully determined, although no sequences 5' 
of cleavage site are required. Folded ribozyme contains a pseudoknot structure [ x1 ]. 

• Reaction mechanism: attack by 2'-OH 5' to the scissile bond to generate cleavage products 
with 2',3'-cyclic phosphate and 5'-OH ends. 

• Only 2 known members of this class. Found in human HDV. 

• Circular form of HDV is active and shows increased nuclease stability [ xli ] 
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Table 3 

Table 3; Human PTP-1B Hammerhead Ribozyme and Target Sequence 



Nt. 


Ribozyme Sequence 


Seq. ID 


Substrate Sequence 


Seq. ID 


P °*'s" >n 


UGCCGCUC CUGAUGAG X CGAA AGGCCGCG 


1 


CGCGGCCT A GAGCGGCA 


529 


72 


AUCUCCAU CUGAUGAG X CGAA ACGG3CCA 




TGGCCCGT C ATGGAGAT 




92 


UCUGCUCG CUGAUGAG X CGAA ACUCCUUU 


3 


AAAGGAGT T CGAGCAGA 


531 


93 


AUCUGCUC CUGAUGAG X CGAA AACUCCUU 




AAGGAGTT C GAGCAGAT 


532 


102 


GACUUGUC CUGAUGAG X CGAA AUCUGCUC 


5 


GAGCAGAT C GACAAGTC 


533 




AGCUCCCG CUGAUGAG X CGAA ACUUGUCG 


6 


CGACAAGT C CGGGAGCT 




129 


UCCUGGUA CUGAUGAG X CGAA AUGGCCGC 




GCGGCCAT T TACCAGGA 


535 




AUCCUGGU CUGAUGAG X CGAA AAUG3CCG 


g 


CGGCCATT T ACCAGGAT 




— 151 — 


UAUCCUGG CUGAUGAG X CGAA AAAUGGCC 


g 


GGCCATTT A CCAGGATA 


— 5T7 — 


139 


AUGUCGGA CUGAUGAG X CGAA AUCCUGGU 


10 


ACCAGGAT A TCCGACAT 


538 


141 


UCAUGUCG CUGAUGAG X CGAA AUAUCCUG 




CAGGATAT C CGACATGA 


539 


161 


UACAUGGG CUGAUGAG X CGAA AGUCACUG 


— n 

— — 


CAGTGACT T CCCATGTA 


540 


152 


CUACAUGG CUGAUGAG X CGAA AAGUCACU 




AGTGACTT C CCATGTAG 


541 


169 


GGCCACUC CUGAUGAG X CGAA ACAU3GGA 


14 


TCCCATGT A GAGTGGCC 


542 


183 


UUCUUAGG CUGAUGAG X CGAA AGCUUGGC 


— 


GCCAAGCT T CCTAAGAA 


543 


184 


GUUCUUAG CUGAUGAG X CGAA AAGCUUGG 


16 


CCAAGCTT C CTAAGAAC 


544 


187 


UUUGUUCU CUGAUGAG X CGAA AGGAAGCU 


17 


AGCTTCCT A AGAACAAA 


545 


205 




18 




546 


209 


CGUCUCUG CUGAUGAG X CGAA ACCUAUUU 


19 


AAATAGGT A CAGAGACG 


547 


219 


AAGGGACU CUGAUGAG X CGAA ACGUCUCU 


20 


AGAGACGT C AGTCCCTT 


548 


223 


GUCAAAGG CUGAUGAG X CGAA ACUGACGU 


21 


ACGTCAGT C CCTTTGAC 


549 


227 


UAUGGUCA CUGAUGAG X CGAA AGGGACUG 


22 


CAGTCCCT T TGACCATA 


550 


22B 


CUAUGGUC CUGAUGAG X CGAA AAGGGACU 


23 


AGTCCCTT T GACCATAG 




235 


AAUCCGAC CUGAUGAG X CGAA AUGGUCAA 


24 


TTGACCAT A GTCGGATT 


5T1 


238 


UUUAAUCC CUGAUGAG X CGAA ACUAUGGU 


25 


ACCATAGT C GGATTAAA 


55! 


243 


UGUAGUUU CUGAUGAG X CGAA AUCC3ACU 


26 


AGTCGGAT T AAACTACA 


554 


244 


AUGUAGUU CUGAUGAG X CGAA AAUCCGAC 


27 


GTCGGATT A AACTACAT 




249 


UCUUGAUG CUGAUGAG X CGAA AGUUUAAU 


28 


ATTAAACT A CATCAAGA 


556 


253 


AUCUUCUU CUGAUGAG X CGAA AUGUAGUU 




AACTACAT C AAGAAGAT 


557 


262 


AUAGUCAU CUGAUGAG X CGAA AUCUUCUU 


3^ 


AAGAAGAT A ATGACTAT 


558 




CGUUGAUA CUGAUGAG X CGAA AGUCAUUA 


31 


TAATGACT A TATCAACG 


559 


271 


AGCGUUGA CUGAUGAG X CGAA AUAGUCAU 


32 


ATGACTAT A TCAACGCT 


560 


273 


CUAGCGUU CUGAUGAG X CGAA AUAUAGUC 


33 


GACTATAT C AACGCTAG 


561 


280 


UAUCAAAC CUGAUGAG X CGAA AGCGUUGA 


34 


TCAACGCT A GTTTGATA 


562 


283 


UUUUAUCA CUGAUGAG X CGAA ACUAGCGU 


35 


ACGCTAGT T TGATAAAA 


563 


284 


UUUUUAUC CUGAUGAG X CGAA AACUAGCG 


36 


CGCTAGTT T GATAAAAA 


564 


288 


UCCAUUUU CUGAUGAG X CGAA AUCAAACU 


37 


AGTTTGAT A AAAATGGA 


565 


313 


AAGAAUGU CUGAUGAG X CGAA ACUCCUUU 


38 


AAAGGAGT T ACATTCTT 


566 


314 


UAAGAAUG CUGAUGAG X CGAA AACUCCUU 


39 


AAGGAGTT A CATTCTTA 


567 


318 


UGGGUAAG CUGAUGAG X CGAA AUGUAACU 


40 


AGTTACAT T CTTACCCA 


568 


319 


CUGGGUAA CUGAUGAG X CGAA AAUG'JAAC 


41 


GTTACATT C TTACCCAG 


569 


321 


CCCUGGGU CUGAUGAG X CGAA AGAAUGUA 


42 


TACATTCT T ACCCAGGG 


570 


322 


GCCCUGGG CUGAUGAG X CGAA AAGAAUGU 


43 


ACATTCTT A CCCAGGGC 


571 


334 


GUUAGGCA CUGAUGAG X CGAA AGGGCCCU 




AGGGCCCT T TGCCTAAC 


572 
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UGUUAGGC CUGAUGAG X CGAA AAGGGCCC 
CUGAUGAG X CGAA AGGCAAAG 



GGGCCCTT T GCCTAACA 
CTTTGCCT A ACACATGC 



CCAAAAGU CUGAUGAG X CGAA ACCGCAUG 



CATGCGGT C ACTTTTGG 



UCUCCCAA CUGAUGAG X CGAA AGUGACCG 



CGGTCACT T TTGGGAGA 



AUCUCCCA CUGAUGAG X CGAA AAGUGACC 



GGTCACTT T TGGGAGAT 



CAUCUCCC CUGAUGAG X CGAA AAAGUGAC 



GTCACTTT T GGGAGATG 



AGCAUGAC CUGAUGAG X CGAA ACACCCCU 



AGGGGTGT C GTCATGCT 



UUGAGCAU CUGAUGAG X CGAA ACGACACC 



GGTGTCGT C ATGCTCAA 



ACUCUGUU CUGAUGAG X CGAA AGCAUGAC 



GTCATGCT C AACAGAGT 



UUUUAACG CUGAUGAG X CGAA ACCUUUCU 



AGAAAGGT T CGTTAAAA 



AUUUUAAC CUGAUGAG X CGAA AACCUUUC 



GAAAGGTT C GTTAAAAT 



CGCAUUUU CUGAUGAG X CGAA ACGAACCU 



AGGTTCGT T AAAATGCG 



GCGCAUUU CUGAUGAG X CGAA AACGAACC 



GGTTCGTT A AAATGCGC 



GUGGCCAG CUGAUGAG X CGAA AUUGUGCG 



CGCACAAT A CTGGCCAC 



UCUUCAAA CUGAUGAG X CGAA AUCAUCUC 



GAGATGAT C TTTGAAGA 



UGUCUUCA CUGAUGAG X CGAA AGAUCAUC 



GATGATCT T TGAAGACA 



GUGUCUUC CUGAUGAG X CGAA AAGAUCAU 



ATGATCTT T GAAGACAC 



UAAUUUCA CUGAUGAG X CGAA AUUUGUGU 



UUAAUUUC CUGAUGAG X CGAA AAUUUGUG 



CACAAATT T GAAATTAA 



UCAAUGUU CUGAUGAG X CGAA AUUUCAAA 



TTTGAAAT T AACATTGA 



AUCAAUGU CUGAUGAG X CGAA AAUUUCAA 



TTGAAATT A ACATTGAT 



CAGAGAUC CUGAUGAG X CGAA AUGUUAAU 



ATTAACAT T GATCTCTG 



UCUUCAGA CUGAUGAG X CGAA AUCAAUGU 



ACATTGAT C TCTGAAGA 



UAUCUUCA CUGAUGAG X CGAA AGAUCAAU 



UGACUUGA CUGAUGAG X CGAA AUCUUCAG 



CTGAAGAT A TCAAGTCA 



UAUGACUU CUGAUGAG X CGAA AUAUCUUC 



GAAGATAT C AAGTCATA 



UAUAAUAU CUGAUGAG X CGAA ACUUGAUA 



TATCAAGT C ATATTATA 



CUGUAUAA CUGAUGAG X CGAA AUGACUUG 



CAAGTCAT A TTATACAG 



618 
624 



CACUGUAU CUGAUGAG X CGAA 
GCACUGUA CUGAUGAG X CGAA AAUAUGAC 
UCGCACUG CUGAUGAG X CGAA AUAAUAUG 
UCCAAUUC CUGAUGAG X CGAA AGCUGUCG 
GGUUUUCC CUGAUGAG X CGAA AUUCUAGC 
UGGGUUGU CUGAUGAG X CGAA AGGUUUUC 
UUGGGUUG CUGAUGAG X CGAA AAGGUUUU 
GAUCUCUC CUGAUGAG X CGAA AGUUUCUU 
AAAUGUAA CUGAUGAG X CGAA AUCUCUCG 
GGAAAUGU CUGAUGAG X CGAA AGAUCUCU 
UGGAAAUG CUGAUGAG X CGAA AAGAUCUC 
AUAGUGGA CUGAUGAG X CGAA AUGUAAGA 
UAUAGUGG CUGAUGAG X CGAA AAUGUAAG 
GUAUAGUG CUGAUGAG X CGAA AAAUGUAA 
AUGUGGUA CUGAUGAG X CGAA AGUGGAAA 
CCAUGUGG CUGAUGAG X CGAA AUAGUGGA 
GGACUCCA CUGAUGAG X CGAA AGUCAGGC 
GGGACUCC CUGAUGAG X CGAA AAGUCAGG 
GAUUCAGG CUGAUGAG X CGAA ACUCCAAA 



AGTCATAT T ATACAGTG 
GTCATATT A TACAGTGC 
CATATTAT A CAGTGCGA 
CGACAGCT A GAATTGGA 
GCTAGAAT T GGAAAACC 
GAAAACCT T ACAACCCA 
AAAACCTT A CAACCCAA 
AAGAAACT C GAGAGATC 
CGAGAGAT C TTACATTT 
AGAGATCT T ACATTTCC 
GAGATCTT A CATTTCCA 
TCTTACAT T TCCACTAT 
CTTACATT T CCACTATA 
TTACATTT C CACTATAC 
TTTCCACT A TACCACAT 
TCCACTAT A CCACATGG 
GCCTGACT T TGGAGTCC 
CCTGACTT T GGAGTCCC 
TTTGGAGT C CCTGAATC 
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AGGCUGGU CUGAUGAG X CGAA AUUCAGGG 

UCAAGAAU CUGAUGAG X CGAA AGGCUGGU 

AGUUCAAG CUGAUGAG X CGAA AUGAGGCU 

AAGUUCAA CUGAUGAG X CGAA AAUGAGGC 

GAAAGUUC CUGAUGAG X CGAA AGAAUGAG 

UGAAAAGA CUGAUGAG X CGAA AGUUCAAG 

UUGAAAAG CUGAUGAG X CGAA AAGUUCAA 

UUUGAAAA CUGAUGAG X CGAA AAAGUUCA 

ACUUUGAA CUGAUGAG X CGAA AGAAAGUU 

GACUUUGA CUGAUGAG X CGAA AAGAAAGU 

GGACUUUG CUGAUGAG X CGAA AAAGAAAG 

CGGACUUU CUGAUGAG X CGAA AAAAGAAA 

GACUCUCG CUGAUGAG X CGAA ACUUUGAA 

GUGACCCU CUGAUGAG X CGAA ACUCUCGG 

GGCUGAGU CUGAUGAG X CGAA ACCCUGAC 

UCCGGGCU CUGAUGAG X CGAA AGUGACCC 

UGCACCAC CUGAUGAG X CGAA ACGGGCCC 

GACCUGCC CUGAUGAG X CGAA AUGCCUGC 

AGGUUCCA CUGAUGAG X CGAA ACCUGCCG 

CCAGACAG CUGAUGAG X CGAA AGGUUCCA 

GCCAGACA CUGAUGAG X CGAA AAGGUUCC 

AUCAGCCA CUGAUGAG X CGAA ACAGAAGG 

GAGGCAGG CUGAUGAG X CGAA AUCAGCCA 

AUCAGCAA CUGAUGAG X CGAA AGGCAGGU 

CCAUCAGC CUGAUGAG X CGAA AGAGGCAG 



CCCTGAAT C ACCAGCCT 
ACCAGCCT C ATTCTTGA 
AGCCTCAT T CTTGAACT 
GCCTCATT C TTGAACTT 
CTCATTCT T GAACTTTC 
CTTGAACT T TCTTTTCA 
TTGAACTT T CTTTTCAA 
TGAACTTT C TTTTCAAA 
AACTTTCT T TTCAAAGT 
ACTTTCTT T TCAAAGTC 
CTTTCTTT T CAAAGTCC 
TTTCTTTT C AAAGTCCG 
TTCAAAGT C CGAGAGTC 
CCGAGAGT C AGGGTCAC 
GTCAGGGT C ACTCAGCC 
GGGTCACT C AGCCCGGA 
GGGCCCGT T GTGGTGCA 
GCAGGCAT C GGCAGGTC 
CGGCAGGT C TGGAACCT 
TGGAACCT T CTGTCTGG 
GGAACCTT C TGTCTGGC 
CCTTCTGT C TGGCTGAT 
TGGCTGAT A CCTGCCTC 
ACCTGCCT C TTGCTGAT 
T GCTGATGG 



AACGGAAG CUGAUGAG X CGAA AGGGUCUU 



AAGACCCT T CTTCCGTT 



CAACGGAA CUGAUGAG X CGAA 

AUCAACGG CUGAUGAG X CGAA AGAAGGGU 

UAUCAACG CUGAUGAG X CGAA AAGAAGGG 

UUGAUAUC CUGAUGAG X CGAA ACGGAAGA 

UUUCUUGA CUGAUGAG X CGAA AUCAACGG 

ACUUUCUU CUGAUGAG X CGAA AUAUCAAC 

UCAUUUCU CUGAUGAG X CGAA ACAGCACU 

CUCAUUUC CUGAUGAG X CGAA AACAGCAC 

CCAUCCGA CUGAUGAG X CGAA ACUUCCUC 

CCCAUCCG CUGAUGAG X CGAA AACUUCCU 

CCCCAUCC CUGAUGAG X CGAA AAACUUCC 

GCUGUCUG CUGAUGAG X CGAA AUCAGCCC 

GGUAGGAG CUGAUGAG X CGAA AGCGCAGC 

AGGUAGGA CUGAUGAG X CGAA AAGCGCAG 

CCAGGUAG CUGAUGAG X CGAA AGAAGCGC 

CAGCCAGG CUGAUGAG X CGAA AGGAGAAG 

GCACCUUC CUGAUGAG X CGAA AUCACAGC 

CCAUGAUG CUGAUGAG X CGAA AUUUGGCA 

CCCAUGAU CUGAUGAG X CGAA AAUUUGGC 

UCCCCCAU CUGAUGAG X CGAA AUGAAUUU 

GCACGGAA CUGAUGAG X CGAA AGUCCCCC 



AGACCCTT C TTCCGTTG 



CCCTTCTT C CGTTGATA 
TCTTCCGT T GATATCAA 
CCGTTGAT A TCAAGAAA 
GTTGATAT C AAGAAAGT 
AGTGCTGT T AGAAATGA 
GTGCTGTT A GAAATGAG 
GAGGAAGT T TCGGATGG 
AGGAAGTT T CGGATGGG 
GGAAGTTT C GGATGGGG 
GGGCTGAT C CAGACAGC 
GCTGCGCT T CTCCTACC 
CTGCGCTT C TCCTACCT 
GCGCTTCT C CTACCTGG 
CTTCTCCT A CCTGGCTG 
GCTGTGAT C GAAGGTGC 
TGCCAAAT T CATCATGG 
GCCAAATT C ATCATGGG 
AAATTCAT C ATGGGGGA 
GGGGGACT C TTCCGTGC 
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CUGCACGG CUGAUGAG X CGAA AGAGUCCC 

CCUGCACG CUGAUGAG X CGAA AAGAGUCC 

CUUCCACU CUGAUGAG X CGAA AUCCUGCA 

UCGUGGGA CUGAUGAG X CGAA AGCUCCUU 

CUCGUGGG CUGAUGAG X CGAA AAGCUCCU 

CCUCGUGG CUGAUGAG X CGAA AAAGCUCC 

UGGGGGGA CUGAUGAG X CGAA AUGCUCGG 

GGUGGGGG CUGAUGAG X CGAA AUAUGCUC 

TOOO UGGCCGGG CUGAUGAG X CGAA AGGUGGGG 

1020 GGCUCCAG CUGAUGAG X CGAA AUUCGUUU 

1052 UUGGGAAG CUGAUGAG X CGAA ACUCCCUG 

1053 UUUGGGAA CUGAUGAG X CGAA AACUCCCU 
1055 I GAUUUGGG CUGAUGAG X CGAA AGAACUCC 

UGAUUUGG CUGAUGAG X CGAA AAGAACUC 

CCACUGGU CUGAUGAG X CGAA AUUUGGGA 

GCAGUCUU CUGAUGAG X CGAA AUCCUCCU 

UCUUCCUU CUGAUGAG X CGAA AUGGGGCA 

CGGCAUUU CUGAUGAG X CGAA AGGGGCUU 

GCGGCAUU CUGAUGAG X CGAA AAGGGGCU 

CGAUGCCG CUGAUGAG X CGAA AGGGUGCG 

AUGCUUUC CUGAUGAG X CGAA AUGCCGUA 

AGUGUCUU CUGAUGAG X CGAA ACUCAUGC 

CGACUUCU CUGAUGAG X CGAA ACUUCAGU 



GGGACTCT T CCGTGCAG 
GGACTCTT C CGTGCAGG 
TGCAGGAT C AGTGGAAG 
AAGGAGCT T TCCCACGA 
AGGAGCTT T CCCACGAG 
GGAGCTTT C CCACGAGG 
CCGAGCAT A TCCCCCCA 
GAGCATAT C CCCCCACC 
CCCCACCT C CCCGGCCA 
AAACGAAT C CTGGAGCC 
CAGGGAGT T CTTCCCAA 
AGGGAGTT C TTCCCAAA 
GGAGTTCT T CCCAAATC 
GAGTTCTT C CCAAATCA 
TCCCAAAT C ACCAGTGG 
AGGAGGAT A AAGACTGC 
TGCCCCAT C AAGGAAGA 
AAGCCCCT T AAATGCCG 
AGCCCCTT A AATGCCGC 
CGCACCCT A CGGCATCG 
TACGGCAT C GAAAGCAT 
' GCATGAGT C AAGACACT 
T AGAAGTCG 



CCGACUUC CUGAUGAG X CGAA AACUUCAG 



CTGAAGTT A GAAGTCGG 



CACGACCC CUGAUGAG X CGAA ACUUCUAA 



1189 

1194 CCCCCCAC CUGAUGAG X CGAA 

ACCUCGAA CUGAUGAG X CGAA ACUUCCCC 
1209~ GCACCUCG CUGAUGAG X CGAA AGACUUCC 
1210 I GGCACCUC CUGAUGAG X CGAA AAGAOJUC 
UGGCUGGG CUGAUGAG X CGAA AGGCAGCC 
CGGGCAGU CUGAUGAG X CGAA ACGGCUCC 

1285 I CUUCCAGU CUGAUGAG X CGAA ACUCAGUG 

1286 I GCUUCCAG CUGAUGAG X CGAA AACUCAGU 
UGACCAGG CUGAUGAG X CGAA AGGGCUUC 
UUGACCAG CUGAUGAG X CGAA AAGGGCUU 
CACAUGUU CUGAUGAG X CGAA ACCAGGAA 

1321 GAGGACCG CUGAUGAG X CGAA AGCCACGC 

1326 GCCGUGAG CUGAUGAG X CGAA ACCGUAGC 

1329 I CCGGCCGU CUGAUGAG X CGAA AGGACCGU 
GCAGAGGU CUGAUGAG X CGAA AGCGCCGG 
AGCAGAGG CUGAUGAG X CGAA AAGCGCCG 
CUGUAGCA CUGAUGAG X CGAA AGGUAAGC 
GGAACCUG CUGAUGAG X CGAA AGCAGAGG 
UGAACAGG CUGAUGAG X CGAA ACCUGUAG 
UUGAACAG CUGAUGAG X CGAA AACCUGUA 
UGCUGUUG CUGAUGAG X CGAA ACAGGAAC 
UUGCUGUU CUGAUGAG X CGAA AACAGGAA 



TTAGAAGT C GGGTCGTG 



AGTCGGGT C GTGGGGGG 
GGGGAAGT C TTCGAGGT 
GGAAGTCT T CGAGGTGC 
GAAGTCTT C GAGGTGCC 
GGCTGCCT C CCCAGCCA 
GGAGCCGT C ACTGCCCG 
CACTGAGT T ACTGGAAG 
ACTGAGTT A CTGGAAGC 
GAAGCCCT T CCTGGTCA 
AAGCCCTT C CTGGTCAA 
TTCCTGGT C AACATGTG 
GCGTGGCT A CGGTCCTC 
GCTACGGT C CTCACGGC 
ACGGTCCT C ACGGCCGG 
CCGGCGCT T ACCTCTGC 
CGGCGCTT A CCTCTGCT 
GCTTACCT C TGCTACAG 
CCTCTGCT A CAGGTTCC 
CTACAGGT T CCTGTTCA 
TACAGGTT C CTGTTCAA 
GTTCCTGT T CAACAGCA 
TTCCTGTT C AACAGCAA 
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GGUCAGGC CUGAUGAG X CGAA AUGUGUUG 
GAGUGGAG CUGAUGAG X CGAA AGGGUCAG 
GUGGAGUG CUGAUGAG X CGAA AGGAGGGU 
UGGAGGUG CUGAUGAG X CGAA AGUGGAGG 
AGUGGGUG CUGAUGAG X CGAA AGGUGGAG 
CAGAGGCG CUGAUGAG X CGAA ACAGUGGG 
UGCGGGCA CUGAUGAG X CGAA AGGCGGAC 
AUGCCUGC CUGAUGAG X CGAA AGUCGGGC 
CCCUUACC CUGAUGAG X CGAA ACCGCGGC 
GCGGCCCU CUGAUGAG X CGAA ACCUACCG 
CGGCUCUC CUGAUGAG X CGAA ACGCGGUC 
GCAGAACC CUGAUGAG X CGAA ACGUCCGU 
UAGUGCAG CUGAUGAG X CGAA ACCAACGU 
UUAGUGCA CUGAUGAG X CGAA AACCAACG 
AUGGGUUU CUGAUGAG X CGAA AGUGCAGA 
CCGGGGAA CUGAUGAG X CGAA AUGGGUUU 
AUCCGGGG CUGAUGAG X CGAA AGAUGGGU 



CAACACAT A GCCTGACC 
CTGACCCT C CTCCACTC 
ACCCTCCT C CACTCCAC 
CCTCCACT C CACCTCCA 
CTCCACCT C CACCCACT 
CCCACTGT C CGCCTCTG 
GTCCGCCT C TGCCCGCA 
GCCCGACT A GCAGGCAT 
GCCGCGGT A GGTAAGGG 
CGGTAGGT A AGGGCCGC 
GACCGCGT A GAGAGCCG 
ACGGACGT T GGTTCTGC 
ACGTTGGT T CTGCACTA 
CGTTGGTT C TGCACTAA 
TCTGCACT A AAACCCAT 
AAACCCAT C TTCCCCGG 
T CCCCGGAT 



CAUCCGGG CUGAUGAG X CGAA AAGAUGGG 



CCCATCTT C CCCGGATG 



AGGGGUGA CUGAUGAG X CGAA ACACACAU 



ATGTGTGT C TCACCCCT 



UGAGGGGU CUGAUGAG X CGAA AGACACAC 



GTGTGTCT C ACCCCTCA 



AAAAGGAU CUGAUGAG X CGAA AGGGGUGA 



TCACCCCT C ATCCTTTT 



AGUAAAAG CUGAUGAG X CGAA AUGAGGGG 



CCCCTCAT C CTTTTACT 



AAAAGUAA CUGAUGAG X CGAA AGGAUGAG 



CTCATCCT T TTACTTTT 



AAAAAGUA CUGAUGAG X CGAA AAGGAUGA 



TCATCCTT T TACTTTTT 



CAAAAAGU CUGAUGAG X CGAA AAAGGAUG 



CATCCTTT T ACTTTTTG 



GCAAAAAG CUGAUGAG X CGAA AAAAGGAU 



ATCCTTTT A CTTTTTGC 



GGGGCAAA CUGAUGAG X CGAA AGUAAAAG 



CTTTTACT T TTTGCCCC 



AGGGGCAA CUGAUGAG X CGAA AAGUAAAA 



TTTTACTT T TTGCCCCT 



AAGGGGCA CUGAUGAG X CGAA AAAGUAAA 



TTTACTTT T TGCCCCTT 



GAAGGGGC CUGAUGAG X CGAA AAAAGUAA 



TTACTTTT T GCCCCTTC 



CAAAGUGG CUGAUGAG X CGAA AGGGGCAA 



TTGCCCCT T CCACTTTG 



UCAAAGUG CUGAUGAG X CGAA AAGGGGCA 



TGCCCCTT C CACTTTGA 



GGUACUCA CUGAUGAG X CGAA AGUGGAAG 



CTTCCACT T TGAGTACC 



UGGUACUC CUGAUGAG X CGAA AAGUGGAA 



TTCCACTT T GAGTACCA 



GGAUUUGG CUGAUGAG X CGAA 

GGCUUGUG CUGAUGAG X CGAA AUUUGGUA 

CCUCAAAA CUGAUGAG X CGAA AUGGCUUG 

UCCUCAAA CUGAUGAG X CGAA AAUGGCUU 

CUCCUCAA CUGAUGAG X CGAA AAAUGGCU 

UCUCCUCA CUGAUGAG X CGAA AAAAUGGC 

CUCUCCUC CUGAUGAG X CGAA AAAAAUGG 

CAGCAUGG CUGAUGAG X CGAA ACUCUCUU 

GACGGGUG CUGAUGAG X CGAA AGGCCCCU 

AGCCCCAA CUGAUGAG X CGAA ACGGGUGU 

CGAGCCCC CUGAUGAG X CGAA AGACGGGU 

GGUGGGGC CUGAUGAG X CGAA AGCCCCAA 

CCAGGAGG CUGAUGAG X CGAA AGCCCUGG 



CTTTGAGT A CCAAATCC 
TACCAAAT C CACAAGCC 
CAAGCCAT T TTTTGAGG 
AAGCCATT T TTTGAGGA 
AGCCATTT T TTGAGGAG 
GCCATTTT T TGAGGAGA 
CCATTTTT T GAGGAGAG 
AAGAGAGT A CCATGCTG 
AGGGGCCT A CACCCGTC 
ACACCCGT C TTGGGGCT 
ACCCGTCT T GGGGCTCG 
TTGGGGCT C GCCCCACC 
CCAGGGCT C CCTCCTGG 
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UGCUCCAG CUGAUGAG X CGAA AGGGAGCC 
CCGCCUGG CUGAUGAG X CGAA AUGCUCCA 
GCAGAUUC CUGAUGAG X CGAA AGGGGGGG 
UCCCUGCA CUGAUGAG X CGAA AUUCAAGG 
GGAGUGGA CUGAUGAG X CGAA AGUUGCUC 
AUGGAGUG CUGAUGAG X CGAA AGAGUUGC 
UAAAUAUG CUGAUGAG X CGAA AGUGGAGA 
UAAAUAAA CUGAUGAG X CGAA AUGGAGUG 
UUUAAAUA CUGAUGAG X CGAA AUAUGGAG 
GUUUAAAU CUGAUGAG X CGAA AAUAUGGA 
UGUUUAAA CUGAUGAG X CGAA AAAUAUGG 
AUUGUUUA CUGAUGAG X CGAA AUAAAUAU 
AAUUGUUU CUGAUGAG X CGAA AAUAAAUA 
AAAUUGUU CUGAUGAG X CGAA AAAUAAAU 
GGGGAAAA CUGAUGAG X CGAA AUUGUUUA 
UGGGGAAA CUGAUGAG X CGAA AAUUGUUU 
UUGGGGAA CUGAUGAG X CGAA AAAUUGUU 
UUUGGGGA CUGAUGAG X CGAA AAAAUUGU 
CUUUGGGG CUGAUGAG X CGAA AAAAAUUG 
CCUUUGGG CUGAUGAG X CGAA AAAAAAUU 
GCACUAUG CUGAUGAG X CGAA AUGCCUUU 
UAGUGCAC CUGAUGAG X CGAA AUGGAUGC 
GAAAAUGC CUGAUGAG X CGAA AGUGCACU 
UUCAAGAA CUGAUGAG X CGAA AUGCUAGU 
GUUCAAGA CUGAUGAG X CGAA AAUGCUAG 
GGUUCAAG CUGAUGAG X CGAA AAAUGCUA 
UGGUUCAA CUGAUGAG X CGAA AAAAUGCU 
AUUGGUUC CUGAUGAG X CGAA AGAAAAUG 
UAAUACAU CUGAUGAG X CGAA AUUGGUUC 
AAUUUUAA CUGAUGAG X CGAA ACAUUAUU 
AAAAUUUU CUGAUGAG X CGAA AUACAUUA 
AAAAAUUU CUGAUGAG X CGAA AAUACAUU 
CAUCAAAA CUGAUGAG X CGAA AUUUUAAU 
ACAUCAAA CUGAUGAG X CGAA AAUUUUAA 
GACAUCAA CUGAUGAG X CGAA AAAUUUUA 
UGACAUCA CUGAUGAG X CGAA AAAAUUUU 
CUGACAUC CUGAUGAG X CGAA AAAAAUUU 
GCAAGGCU CUGAUGAG X CGAA ACAUCAAA 
CUUGAUGC CUGAUGAG X CGAA AGGCUGAC 
AAGCCCUU CUGAUGAG X CGAA AUGCAAGG 
UUUUGAUA CUGAUGAG X CGAA AGCCCUUG 
UUUUUGAU CUGAUGAG X CGAA AAGCCCUU 
CUUUUUGA CUGAUGAG X CGAA AAAGCCCU 
UACUUUUU CUGAUGAG X CGAA AUAAAGCC 
UAUUAUUG CUGAUGAG X CGAA ACUUUUUG 
GGAUUUAU CUGAUGAG X CGAA AUUGUACU 
UGAGGAUU CUGAUGAG X CGAA AUUAUUGU 



GGCTCCCT C CTGGAGCA 
TGGAGCAT C CCAGGCGG 
CCCCCCCT T GAATCTGC 
CCTTGAAT C TGCAGGGA 
GAGCAACT C TCCACTCC 
GCAACTCT C CACTCCAT 
TCTCCACT C CATATTTA 
CACTCCAT A TTTATTTA 
CTCCATAT T TATTTAAA 
TCCATATT T ATTTAAAC 
CCATATTT A TTTAAACA 
ATATTTAT T TAAACAAT 
TATTTATT T AAACAATT 
ATTTATTT A AACAATTT 
TAAACAAT T TTTTCCCC 
AAACAATT T TTTCCCCA 
AACAATTT T TTCCCCAA 
ACAATTTT T TCCCCAAA 
CAATTTTT T CCCCAAAG 
AATTTTTT C CCCAAAGG 
AAAGGCAT C CATAGTGC 
GCATCCAT A GTGCACTA 
AGTGCACT A GCATTTTC 
ACTAGCAT T TTCTTGAA 
CTAGCATT T TCTTGAAC 
TAGCATTT T CTTGAACC 
AGCATTTT C TTGAACCA 
CATTTTCT T GAACCAAT 
GAACCAAT A ATGTATTA 
AATAATGT A TTAAAATT 
TAATGTAT T AAAATTTT 
AATGTATT A AAATTTTT 
ATTAAAAT T TTTTGATG 
TTAAAATT T TTTGATGT 
TAAAATTT T TTGATGTC 
AAAATTTT T TGATGTCA 
AAATTTTT T GATGTCAG 
TTTGATGT C AGCCTTGC 
GTCAGCCT T GCATCAAG 
CCTTGCAT C AAGGGCTT 
CAAGGGCT T TATCAAAA 
AAGGGCTT T ATCAAAAA 
AGGGCTTT A TCAAAAAG 
GGCTTTAT C AAAAAGTA 
CAAAAAGT A CAATAATA 
AGTACAAT A ATAAATCC 
ACAATAAT A AATCCTCA 
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UACCUGAG CUGAUGAG X CGAA AUUUAUUA 
UACUACCU CUGAUGAG X CGAA 



TAATAAAT C CTCAGGTA 
TAAATCCT C AGGTAGTA 



CCCAGUAC CUGAUGAG X CGAA ACCUGAGG 



CCTCAGGT A GTACTC 



AUUCCCAG CUGAUGAG X CGAA ACUACCUG 



CAGGTAGT A CTGGGAAT 



CCAUGGCA CUGAUGAG X CGAA AGCCUUCC 



GGAAGGCT T TGCCATGG 



CCCAUGGC CUGAUGAG X CGAA AAGCCUUC 



GAAGGCTT T GCCATGGG 



ACUGGUCU CUGAUGAG X CGAA ACGCAGCA 



TGCTGCGT C AGACCAGT 



CUUCCCAG CUGAUGAG X CGAA ACUGGUCU 



AGACCAGT A CTGGGAAG 



CUGCUUAC CUGAUGAG X CGAA ACCGUCCU 



AGGACGGT T GTAAGCAG 



CAACUGCU CUGAUGAG X CGAA ACAACCGU 



ACGGTTGT A AGCAGTTG 



UAAAUAAC CUGAUGAG X CGAA ACUGCUUA 



TAAGCAGT T GTTATTTA 



CACUAAAU CUGAUGAG X CGAA ACAACUGC 



GCAGTTGT T ATTTAGTG 



UCACUAAA CUGAUGAG X CGAA AACAACUG 



CAGTTGTT A 'i'TTAGTGA 



UAUCACUA CUGAUGAG X CGAA AUAACAAC 



GTTGTTAT T TAGTGATA 



AUAUCACU CUGAUGAG X ( 



i AAUAACAA 



TTGTTATT T AGTGATAT 



AAUAUCAC CUGAUGAG X CGAA AAAUAACA 



TGTTATTT A GTGATATT 



ACCCACAA CUGAUGAG X CGAA AUCACUAA 



TTAGTGAT A TTGTGGGT 



UUACCCAC CUGAUGAG X CGAA AUAUCACU 



AGTGATAT T GTGGGTAA 



UCUCACGU CUGAUGAG X CGAA ACCCACAA 



TTGTGGGT A ACGTGAGA 



CAUUGUUC CUGAUGAG X CGAA AUCUUCUC 



GAGAAGAT A GAACAATG 



AUAUAUUA CUGAUGAG X CGAA AGCAUUGU 



ACAATGCT A TAATATAT 



UUAUAUAU CUGAUGAG X CGAA AUAGCAUU 



AATGCTAT A ATATATAA 



UCAUUAUA cug; 



I X CGAA AUUAUAGC 



GCTATAAT A TATAATGA 



3AUGAG X CGAA AUAUUAUA 
GUGUUCAU CUGAUGAG X CGAA AUAUAUUA 
UUAUUAAA CUGAUGAG X CGAA ACCCACGU 
UCUUAUUA CUGAUGAG X CGAA AUACCCAC 
UUCUUAUU CUGAUGAG X CGAA AAUACCCA 
UUUCUUAU CUGAUGAG X CGAA AAAUACCC 
AUGUUUCU CUGAUGAG X CGAA AUUAAAUA 
GACAAAGU CUGAUGAG X CGAA AUCUCACA 
GGACAAAG CUGAUGAG X CGAA AAUCUCAC 
GCGGGACA CUGAUGAG X CGAA AGUAAUCU 
AGCGGGAC CUGAUGAG X CGAA AAGUAAUC 
AUAAGCGG CUGAUGAG X CGAA ACAAAGUA 
AGCAGAAU CUGAUGAG X CGAA AGCGGGAC 
GAGCAGAA CUGAUGAG X CGAA AAGCGGGA 
GGGAGCAG CUGAUGAG X CGAA AUAAGCGG 
AGGGAGCA CUGAUGAG X CGAA AAUAAGCG 
AUAACAGG CUGAUGAG X CGAA AGCAGAAU 
UAGCAGAU CUGAUGAG X CGAA ACAGGGAG 
CUAGCAGA CUGAUGAG X CGAA AACAGGGA 
AUCUAGCA CUGAUGAG X CGAA AUAACAGG 
ACUAGAUC CUGAUGAG X CGAA AGCAGAUA 
GAGAACUA CUGAUGAG X CGAA AUCUAGCA 
UUGAGAAC CUGAUGAG X CGAA AGAUCUAG 
UGAUUGAG CUGAUGAG X CGAA ACUAGAUC 



TATAATAT A TAATGAAC 
TAATATAT A ATGAACAC 
ACGTGGGT A TTTAATAA 
GTGGGTAT T TAATAAGA 
TGGGTATT T AATAAGAA 
GGGTATTT A ATAAGAAA 
TATTTAAT A AGAAACAT 
TGTGAGAT T ACTTTGTC 
GTGAGATT A CTTTGTCC 
AGATTACT T TGTCCCGC 
GATTACTT T GTCCCGCT 
TACTTTGT C CCGCTTAT 
GTCCCGCT T ATTCTGCT 
TCCCGCTT A TTCTGCTC 
CCGCTTAT T CTGCTCCC 
CGCTTATT C TGCTCCCT 
ATTCTGCT C CCTGTTAT 
CTCCCTGT T ATCTGCTA 
TCCCTGTT A TCTGCTAG 
CCTGTTAT C TGCTAGAT 
TATCTGCT A GATCTAGT 
TGCTAGAT C TAGTTCTC 
CTAGATCT A GTTCTCAA 
GATCTAGT T CTCAATCA 
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GUGAUUGA CUGAUGAG X CGAA AACUAGAU 



ATCTAGTT C TCAATCAC 



CAGUGAUU CUGAUGAG X CGAA AGAACUAG 



CTAGTTCT C AATCACTG 



GGAGCAGU CUGAUGAG X CGAA AUUGAGAA 



TTCTCAAT C ACTGCTCC 



ACACGGGG CUGAUGAG X CGAA AGCAGUGA 



TCACTGCT C CCCCGTGT 



CAUUCUAA CUGAUGAG X CGAA ACACACGG 



CCGTGTGT A TTAGAATG 



UGCAUUCU CUGAUGAG X CGAA AUACACAC 



GTGTGTAT T AGAATGCA 



AUGCAUUC CUGAUGAG X CGAA AAUACACA 



TGTGTATT A GAATGCAT 



GAAGACCU CUGAUGAG X CGAA ACAUGCAU 



ATGCATGT A AGGTCTTC 



CACAAGAA CUGAUGAG X CGAA ACCUUACA 



TGTAAGGT C TTCTTGTG 



GACACAAG CUGAUGAG X CGAA AGACCUUA 



TAAGGTCT T CTTGTGTC 



GGACACAA CUGAUGAG X CGAA AAGACCUU 



AAGGTCTT C TTGTGTCC 



CAGGACAC CUGAUGAG X CGAA AGAAGACC 



GGTCTTCT T GTGTCCTG 



UUCAUCAG CUGAUGAG X CGAA ACACAAGA 



TCTTGTGT C CTGATGAA 



CAAGCACA CUGAUGAG X CGAA AUUUUUCA 



TGAAAAAT A TGTGCTTG 



CUCAUUUC CUGAUGAG X CGAA AGCACAUA 



TATGTGCT T GAAATGAG 



AGAGAUCA CUGAUGAG X CGAA AGUUUCUC 



GAGAAACT T TGATCTCT 



CAGAGAUC CUGAUGAG X CGAA AAGUUUCU 



AGAAACTT T GATCTCTG 



UAAGCAGA CUGAUGAG X CGAA AUCAAAGU 



ACTTTGAT C TCTGCTTA 



AGUAAGCA CUGAUGAG X CGAA AGAUCAAA 



TTTGATCT C TGCTTACT 



ACAUUAGU CUGAUGAG X CGAA AGCAGAGA 



TCTCTGCT T ACTAATGT 



CACAUUAG CUGAUGAG X CGAA AAGCAGAG 



CTCTGCTT A CTAATGTG 



GGGCACAU CUGAUGAG X CGAA AGUAAGCA 



TGCTTACT A ATGTGCCC 



UGGACUUG CUGAUGAG X CGAA ACAUGGGG 



CCCCATGT C CAAGTCCA 



GCAGGUUG CUGAUGAG X CGAA ACUUGGAC 



GTCCAAGT C CAACCTGC 



CAUGUAAU CUGAUGAG X CGAA AUCAGGUC 



GACCTGAT C ATTACATG 



AGCCAUGU CUGAUGAG X CGAA AUGAUCAG 



CTGATCAT T ACATGGCT 



CAGCCAUG CUGAUGAG X CGAA AAUGAUCA 



TGATCATT A CATGGCTG 



GGCUUAGG CUGAUGAG X CGAA ACCACAGC 



GCTGTGGT T CCTAAGCC 



AGGCUUAG CUGAUGAG X CGAA AACCACAG 



CTGTGGTT C CTAAGCCT 



AACAGGCU CUGAUGAG X CGAA AGGAACCA 



TGGTTCCT A AGCCTGTT 



ACUUCAGC CUGAUGAG X CGAA ACAGGCUU 



AAGCCTGT T GCTGAAGT 



GCGACAAU CUGAUGAG X CGAA ACUUCAGC 



GCTGAAGT C ATTGTCGC 



UGAGCGAC CUGAUGAG X CGAA AUGACUUC 



GAAGTCAT T GTCGCTCA 



UGCUGAGC CUGAUGAG X CGAA ACAAUGAC 



GTCATTGT C GCTCAGCA 



CUAUUGCU CUGAUGAG X CGAA AGCGACAA 



TTGTCGCT C AGCAATAG 



CUGCACCC CUGAUGAG X C 



TCAGCAAT A GGGTGCAG 



UCCUGGAA CUGAUGAG X CGAA ACUGCACC 



GGTGCAGT T TTCCAGGA 



UUCCUGGA CUGAUGAG X CGAA AACUGCAC 



GTGCAGTT T TCCAGGAA 



AUUCCUGG CUGAUGAG X CGAA AAACUGCA 



TGCAGTTT T CCAGGAAT 



UAUUCCUG CUGAUGAG X CGAA AAAACUGC 



GCAGTTTT C CAGGAATA 



CAAAUGCC CUGAUGAG X CGAA AUUCCUGG 



CCAGGAAT A GGCATTTG 



AUUAGGCA CUGAUGAG X CGAA AUGCCUAU 
AAUUAGGC CUGAUGAG X CGAA AAUGCCUA 



ATAGGCAT T TGCCTAAT 



TAGGCATT T GCCTAATT 



CCAGGAAU CUGAUGAG X CGAA AGGCAAAU 



ATTTGCCT A ATTCCTGG 



AUGCCAGG CUGAUGAG X CGAA AUUAGGCA 



TGCCTAAT T CCTGGCAT 



CAUGCCAG CUGAUGAG X CGAA AAUUAGGC 



GCCTAATT C CTGGCATG 



AGUCACUA CUGAUGAG X CGAA AGUGUCAU 



ATGACACT C TAGTGACT 
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2557 
2563 
2570 
2572 



2680 
2681 



GAAGUCAC CUGAUGAG X CGAA AGAGUGUC 
UCACCAGG CUGAUGAG X CGAA AGUCACUA 
CUCACCAG CUGAUGAG X CGAA AAGUCACU 
UGUACCAG CUGAUGAG X CGAA ACAGGCUG 
CCCUGCUG CUGAUGAG X CGAA ACCAGGAC 
UACAGCAA CUGAUGAG X CGAA ACCCUGCU 
GUUACAGC CUGAUGAG X CGAA AGACCCUG 
GUCUGAGU CUGAUGAG X CGAA ACAGCAAG 
GAAUGUCU CUGAUGAG X CGAA AGUUACAG 
ACCCUUGG CUGAUGAG X CGAA AUGUCUGA 
UACCCUUG CUGAUGAG X CGAA AAUGUCUG 
GCUUCCCA CUGAUGAG X CGAA ACCCUUGG 
GGUGUGAA CUGAUGAG X CGAA AUGGCUUC 
GAGGUGUG CUGAUGAG X CGAA AUAUGGCU 
UGAGGUGU CUGAUGAG X CGAA AAUAUGGC 
CAGAGCGU CUGAUGAG X CGAA AGGUGUGA 
CAUGUCCA CUGAUGAG X CGAA AGCGUGAG 
CUUCCCUA CUGAUGAG X CGAA AUCAUGUC 
GCUUCCCU CUGAUGAG X CGAA AAUCAUGU 
UGCUUCCC CUGAUGAG X CGAA AAAUCAUG 
UGAUCCCA CUGAUGAG X CGAA AGGUGGGG 
CUGAUCCC CUGAUGAG X CGAA AAGGUGGG 
CGGAGGCU CUGAUGAG X CGAA AUCCCAAA 
GAAUGGCG CUGAUGAG X CGAA AGGCUGAU 
CGACUUGG CUGAUGAG X CGAA AUGGCGGA 
UCGACUUG CUGAUGAG X CGAA AAUGGCGG 
AGAGUGUC CUGAUGAG X CGAA ACUUGGAA 
CUCAAGAA CUGAUGAG X CGAA AGUGUCGA 
UGCUCAAG CUGAUGAG X CGAA AGAGUGUC 
CUGCUCAA CUGAUGAG X CGAA AAGAGUGU 
GUCUGCUC CUGAUGAG X CGAA AGAAGAGU 
CUCUUCCA CUGAUGAG X CGAA AUCACGGU 
UCUCUUCC CUGAUGAG X CGAA AAUCACGG ' 
GUUUCAAG CUGAUGAG X CGAA AGUGUGGU 
UGUUUCAA CUGAUGAG X CGAA AAGUGUGG 
GCUGUUUC CUGAUGAG X CGAA AGAAGUGU 
GCCUAAAG CUGAUGAG X CGAA ACCGUCAC 
GCUGCCUA CUGAUGAG X CGAA AGGACCGU 
GGCUGCCU CUGAUGAG X CGAA AAGGACCG 
AGGCUGCC CUGAUGAG X CGAA AAAGGACC 
GGGACAGA CUGAUGAG X CGAA ACGGCGGC 
CCGGGACA CUGAUGAG X CGAA AGACGGCG 
UGAACCGG CUGAUGAG X CGAA ACAGAGAC 
GCAAGGUG CUGAUGAG X CGAA ACCGGGAC 
GGCAAGGU CUGAUGAG X CGAA AACCGGGA 
CUCUCGGC CUGAUGAG X CGAA AGGUGAAC 
GUGGGGCA CUGAUGAG X CGAA ACGCGCCU 



GACACTCT A GTGACTTC 
TAGTGACT T CCTGGTGA 
AGTGACTT C CTGGTGAG 
CAGCCTGT C CTGGTACA 
GTCCTGGT A CAGCAGGG 
AGCAGGGT C TTGCTGTA 
CAGGGTCT T GCTGTAAC 
CTTGCTGT A ACTCAGAC 
CTGTAACT C AGACATTC 
TCAGACAT T CCAAGGGT 
CAGACATT C CAAGGGTA 
CCAAGGGT A TGGGAAGC 
GAAGCCAT A TTCACACC 
AGCCATAT T CACACCTC 
GCCATATT C ACACCTCA 
TCACACCT C ACGCTCTG 
CTCACGCT C TGGACATG 
GACATGAT T TAGGGAAG 
ACATGATT T AGGGAAGC 
CATGATTT A GGGAAGCA 
CCCCACCT T TGGGATCA 
CCCACCTT T GGGATCAG 
TTTGGGAT C AGCCTCCG 
ATCAGCCT C CGCCATTC 
TCCGCCAT T CCAAGTCG 
CCGCCATT C CAAGTCGA 



TCGACACT C TTCTTGAG 
GACACTCT T CTTGAGCA 
ACACTCTT C TTGAGCAG 
ACTCTTCT T GAGCAGAC 
ACCGTGAT T TGGAAGAG 
CCGTGATT T GGAAGAGA 
ACCACACT T CTTGAAAC 
CCACACTT C TTGAAACA 
ACACTTCT T GAAACAGC 



C CTTTAGGC 
ACGGTCCT T TAGGCAGC 
CGGTCCTT T AGGCAGCC 
GGTCCTTT A GGCAGCCT 
GCCGCCGT C TCTGTCCC 
CGCCGTCT C TGTCCCGG 
GTCTCTGT C CCGGTTCA 
GTCCCGGT T CACCTTGC 
TCCCGGTT C ACCTTGCC 
GTTCACCT T GCCGAGAG 
AGGCGCGT C TGCCCCAC 
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CAGGGUUU CUGAUGAG X CGAA AGGGUGGG 



CCCACCCT C AAACCCTG 



AGAGUCGU CUGAUGAG X CGAA AGCACCAU 



ATGGTGCT C ACGACTCT 



UGCAGGAA CUGAUGAG X CGAA AGUCGUGA 



TCACGACT C TTCCTGCA 



UUUGCAGG CUGAUGAG X CGAA AGAGUCGU 



ACGACTCT T CCTGCAAA 



CUUUGCAG CUGAUGAG X CGAA AAGAGUCG 



CGACTCTT C CTGCAAAG 



UUAAUGUG CUGAUGAG X CGAA AGGUCUUC 



GAAGACCT C CACATTAA 



AGCCACUU CUGAUGAG X CGAA AUGUGGAG 



CTCCACAT T AAGTGGCT 



AAGCCACU CUGAUGAG X CGAA AAUGUGGA 



TCCACATT A AGTGGCTT 



AUGUUAAA CUGAUGAG X CGAA AGCCACUU 



AAGTGGCT T TTTAACAT 



CAUGUUAA CUGAUGAG X CGAA AAGCCACU 



AGTGGCTT T TTAACATG 



UCAUGUUA CUGAUGAG X CGAA AAAGCCAC 



GTGGCTTT T TAACATGA 



UUCAUGUU CUGAl 



I AAAAGCCA 



TGGCTTTT T AACATGAA 



UUUCAUGU CUGAUGAG X CGAA AAAAAGCC 



GGCTTTTT A ACATGAAA 



UCGGGAGC CUGAUGAG X CGAA ACAGCUGC 



GCAGCTGT A GCTCCCGA 



UAGCUCGG CUGAUGAG X CGAA AGCUACAG 



CTGTAGCT C CCGAGCTA 



CAAGAGAG CUGAUGAG X CGAA AGCUCGGG 



CCCGAGCT A CTCTCTTG 



UGGCAAGA CUGAUGAG X CGAA AGUAGCUC 



GAGCTACT C TCTTGCCA 



GCUGGCAA CUGAUGAG X CGAA AGAGUAGC 



GCTACTCT C TTGCCAGC 



AUGCUGGC CUGAUGAG X CGAA AGAGAGUA 



TACTCTCT T GCCAGCAT 



AAUGUGAA CUGAUGAG X CGAA AUGCUGGC 



GCCAGCAT T TTCACATT 



AAAUGUGA CUGAUGAG X CGAA AAUGCUGG 



CCAGCATT T TCACATTT 



AAAAUGUG CUGAUGAG X CGAA AAAUGCUG 



CAGCATTT T CACATTTT 



CAAAAUGU CUGAUGAG X CGAA AAAAUGCU 



AGCATTTT C ACATTTTG 



AAAGGCAA CUGAUGAG X CGAA AUGUGAAA 



TTTCACAT T TTGCCTTT 



GAAAGGCA CUGAUGAG X CGAA AAUGUGAA 



TTCACATT T TGCCTTTC 



AGAAAGGC CUGAUGAG X CGAA AAAUGUGA 



TCACATTT T GCCTTTCT 



CCACGAGA CUGAUGAG X CGAA AGGCAAAA 



TTTTGCCT T TCTCGTGG 



ACCACGAG CUGAUGAG X CGAA AAGGCAAA 



TTTGCCTT T CTCGTGGT 



UACCACGA CUGAUGAG X CGAA AAAGGCAA 



TTGCCTTT C TCGTGGTA 



UCUACCAC CUGAUGAG X CGAA AGAAAGGC 



GCCTTTCT C GTGGTAGA 



CUGGCUUC CUGAUGAG X CGAA ACCACGAG 



CTCGTGGT A GAAGCCAG 



UUUCUCUG CUGAUGAG X CGAA ACUGGCUU 



AAGCCAGT A CAGAGAAA 



CACCACAG CUGAUGAG X CGAA AUUUCUCU 



AGAGAAAT T CTGTGGTG 



CCACCACA CUGAUGAG X CGAA AAUUUCUC 



GAGAAATT C TGTGGTGG 



ACACCUCG CUGAUGAG X CGAA AUGUUCCC 



GGGAACAT T CGAGGTGT 



GACACCUC CUGAUGAG X CGAA AAUGUUCC 



GGAACATT C GAGGTGTC 



UGCAGGGU CUGAUGAG X CGAA ACACCUCG 



CGAGGTGT C ACCCTGCA 



CCUCACCA CUGAUGAG X CGAA AGCUCUGC 



GCAGAGCT A TGGTGAGG 



CUAAGCCU CUGAUGAG X CGAA AUCCACAC 



GTGTGGAT A AGGCTTAG 



UGGCACCU CUGAUGAG X CGAA AGCCUUAU 



ATAAGGCT T AGGTGCCA 



CUGGCACC CUGAUGAG X CGAA AAGCCUUA 



TAAGGCTT A GGTGCCAG 



AGAAUGCU CUGAUGAG X CGAA ACAGCCUG 



CAGGCTGT A AGCATTCT 



CAGCUCAG CUGAUGAG X CGAA AUGCUUAC 



GTAAGCAT T CTGAGCTG 



CCAGCUCA CUGAUGAG X CGAA AAUGCUUA 



TAAGCATT C TGAGCTGG 



AAAACAAC CUGAUGAG X CGAA AGCCAGCU 



AGCTGGCT T GTTGTTTT 



UUAAAAAC CUGAUGAG X CGAA ACAAGCCA 



TGGCTTGT T GTTTTTAA 



GACUUAAA CUGAUGAG X CGAA ACAACAAG 



CTTGTTGT T TTTAAGTC 
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GGACUUAA CUGAUGAG X CGAA AACAACAA 



AGGACUUA CUGAUGAG X C 



I AAACAACA 



3125 
3136 
3141 
3145 



3159 
3160 

3161 



CAGGACUU CUGAUGAG X CGAA AAAACAAC 
ACAGGACU CUGAUGAG X CGAA AAAAACAA 
AUAUACAG CUGAUGAG X CGAA ACUUAAAA 
CAUACAUA CUGAUGAG X CGAA ACAGGACU 
UACAUACA CUGAUGAG X CGAA AUACAGGA 
CUACUACA CUGAUGAG X CGAA ACAUAUAC 
CAAACUAC CUGAUGAG X CGAA ACAUACAU 
ACCCAAAC CUGAUGAG X CGAA ACUACAUA 
CACACCCA CUGAUGAG X CGAA ACUACUAC 
ACACACCC CUGAUGAG X CGAA AACUACUA 
ACUAUAUA CUGAUGAG X CGAA ACACACCC 
CUACUAUA CUGAUGAG X CGAA AUACACAC 
UGCUACUA CUGAUGAG X CGAA AUAUACAC 
AAUGCUAC CUGAUGAG X CGAA AUAUAUAC 
UGAAAUGC CUGAUGAG X CGAA ACUAUAUA 
CAUUUUGA CUGAUGAG X CGAA AUGCUACU 
CCAUUUUG CUGAUGAG X CGAA AAUGCUAC 
UCCAUUUU CUGAUGAG X CGAA AAAUGCUA 
UAAACCAG CUGAUGAG X CGAA ACGUCCAU 
GGAGGUUA CUGAUGAG X CGAA ACCAGUAC 
AGGAGGUU CUGAUGAG X CGAA AACCAGUA 
UAGGAGGU CUGAUGAG X CGAA AAACCAGU 
AAGGAUAG CUGAUGAG X CGAA AGGUUAAA 
UCCAAGGA CUGAUGAG X CGAA AGGAGGUU 
UCUCCAAG CUGAUGAG X CGAA AUAGGAGG 
UGCUCUCC CUGAUGAG X CGAA AGGAUAGG 
AAGGUGGA CUGAUGAG X CGAA AGCCAGCU 
ACAAGGUG CUGAUGAG X CGAA AGAGCCAG 
UGUGUAAC CUGAUGAG X CGAA AGGUG3AG 
UAAUGUGU CUGAUGAG X CGAA ACAAGGUG 
AUAAUGUG CUGAUGAG X CGAA AACAAGGU 
UCUAACAU CUGAUGAG X CGAA AUGUGUAA 
CUCUAACA CUGAUGAG X CGAA AAUGUGUA 
ACCUCUCU CUGAUGAG X CGAA ACAUAAUG 
UACCUCUC CUGAUGAG X CGAA AACAUAAU 
CAGCUCGC CUGAUGAG X CGAA ACCUCUCU 
ACAUAGCA CUGAUGAG X CGAA AGCAGCUC 
UAAGGACA CUGAUGAG X CGAA AGCAGAGC 
GGCUUAAG CUGAUGAG X CGAA ACAUAGCA 
AUUGGCUU CUGAUGAG X CGAA AGGACAUA 
UAUUGGCU CUGAUGAG X CGAA AAGGACAU 
UGAGUAAA CUGAUGAG X CGAA AUUGGCUU 
GAUGAGUA CUGAUGAG X CGAA AUAUUGGC 
UGAUGAGU CUGAUGAG X CGAA AAUAUUGG 
CUGAUGAG CUGAUGAG X CGAA AAAUAUUG 



TTGTTGTT T TTAAGTCC 
TGTTGTTT T TAAGTCCT 
GTTGTTTT T AAGTCCTG 
TTGTTTTT A AGTCCTGT 
TTTTAAGT C CTGTATAT 
AGTCCTGT A TATGTATG 
TCCTGTAT A TGTATGTA 
GTATATGT A TGTAGTAG 
ATGTATGT A GTAGTTTG 
TATGTAGT A GTTTGGGT 
GTAGTAGT T TGGGTGTG 
TAGTAGTT T GGGTGTGT 
GGGTGTGT A TATATAGT 
GTGTGTAT A TATAGTAG 
GTGTATAT A TAGTAGCA 
GTATATAT A GTAGCATT 



A GCATTTCA 
AGTAGCAT T TCAAAATG 
GTAGCATT T CAAAATGG 
TAGCATTT C AAAATGGA 
ATGGACGT A CTGGTTTA 
GTACTGGT T TAACCTCC 
TACTGGTT T AACCTCCT 
ACTGGTTT A ACCTCCTA 
TTTAACCT C CTATCCTT 
AACCTCCT A TCCTTGGA 
CCTCCTAT C CTTGGAGA 
CCTATCCT T GGAGAGCA 
AGCTGGCT C TCCACCTT 
CTGGCTCT C CACCTTGT 
CTCCACCT T GTTACACA 
CACCTTGT T ACACATTA 
ACCTTGTT A CACATTAT 
TTACACAT T ATGTTAGA 
TACACATT A TGTTAGAG 
CATTATGT T AGAGAGGT 
ATTATGTT A GAGAGGTA 
AGAGAGGT A GCGAGCTG 
GAGCTGCT C TGCTATGT 
GCTCTGCT A TGTCCTTA 
TGCTATGT C CTTAAGCC 
TATGTCCT T AAGCCAAT 
ATGTCCTT A AGCCAATA 
AAGCCAAT A TTTACTCA 
GCCAATAT T TACTCATC 
CCAATATT T ACTCATCA 
CAATATTT A CTCATCAG 
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Table 3 



3164 


GACCUGAU CUGAUGAG X CGAA AGUAAAUA 


515 


TATTTACT C ATCAGGTC 


1043 




3167 


AAUGACCU CUGAUGAG X CGAA AUGAGUAA 


516 


TTACTCAT C AGGTCATT 


1044 


3172 


AAAAUAAU CUGAUGAG X CGAA ACCUGAUG 


517 


CATCAGGT C ATTATTTT 


1045 


3175 


UAAAAAAU CUGAUGAG X CGAA AUGACCUG 


518 


CAGGTCAT T ATTTTTTA 


1046 


3176 


GUAAAAAA CUGAUGAG X CGAA AAUGACCU 


519 


AGGTCATT A TTTTTTAC 


1047 


3173 


UUGUAAAA CUGAUGAG X CGAA AUAAUGAC 


520 


GTCATTAT T TTTTACAA 


1048 


3173 


AUUGUAAA CUGAUGAG X CGAA AAUAAUGA 


521 


TCATTATT T TTTACAAT 


1049 


3180 


CAUUGUAA CUGAUGAG X CGAA AAAUAAUG 


522 


CATTATTT T TTACAATG 


1050 


3181 


CCAUUGUA CUGAUGAG X CGAA AAAAUAAU 


523 


ATTATTTT T TACAATGG 


1051 


3182 


GCCAUUGU CUGAUGAG X CGAA AAAAAUAA 


524 


TTATTTTT T ACAATGGC 


1052 


3183 


GGCCAUUG CUGAUGAG X CGAA AAAAAAUA 


525 


TATTTTTT A CAATGGCC 


1053 


3199 


AAAUGGUU CUGAUGAG X CGAA AUUCCAUG 


52S 


CATGGAAT A AACCATTT 


1054 


3205 


UITUGUAAA CUGAUGAG X CGAA AUGGUUUA 


527 


TAAACCAT T TTTACAAA 


1055 


3207 


UUUUGUAA CUGAUGAG X CGAA AAUGGUUU 


528 


AAACCATT T TTACAAAA 


1056 






Input Sequence = PTPN1 (Homo sapiens protein tyrosine phosphatase, non-receptor type 1 (PTPN1 
3215 bp) 


Cut Site = UH. i 
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Table 4 



Table 4: Human PTP-1B NCH Ribozyme and Target Sequence 



Nt. 
Position 


Ribozyme Sequence 


Seq. ID 


Substrate Sequence 


Seq. ID 




CCGCUCUA CUGAUGAG X CGAA ICCGCGUC 


1057 


GACGCGGC C TAGAGCGG 


1781 


14 


GCCGCUCU CUGAUGAG X CGAA IGCCGCGU 


1058 


ACGCGGCC T AGAGCGGC 


1782 




GCGCCGUC CUGAUGAG X CGAA ICCGCUCU 


1059 


AGAGCGGC A GACGGCGC 


1783 


— — 


CGGCCCAC CUGAUGAG X CGAA ICGCCGUC 


1060 


GACGGCGC A GTGGGCCG 




39 


UCCUUCUC CUGAUGAG X CGAA ICCCACUG 


1061 


CAGTGGGC C GAGAAGGA 


1785 


53 


GCGGCUGC CUGAUGAG X CGAA ICGCCUCC 




GGAGGCGC A GCAGCCGC 


1786 


56 


AGGGCGGC CUGAUGAG X CGAA ICUGCGCC 


To75 


GGCGCAGC A GCCGCCCT 




59 


GCCAGGGC CUGAUGAG X CGAA 1CUGCUGC 


Toil 


GCAGCAGC C GCCCTGGC 


TTsi 


62 


CGGGCCAG CUGAUGAG X CGAA ICGGCUGC 


Toes — 


GCAGCCGC C CTGGCCCG 


1789 


63 


ACGGGCCA CUGAUGAG X CGAA IGCGGCUG 


To?6 


CAGCCGCC C TGGCCCGT 


1790 


64 


GACGGGCC CUGAUGAG X CGAA IGGCGGCU 


To67 


AGCCGCCC T GGCCCGTC 


T791 


68 


CCAUGACG CUGAUGAG X CGAA ICCAGGGC 


To?8 


GCCCTGGC C CGTCATGG 


TTr^ — 


69 


UCCAUGAC CUGAUGAG X CGAA IGCCAGGG 


ToS9 


CCCTGGCC C GTCATGGA 


^793 — 


— 


CAUCUCCA CUGAUGAG X CGAA IACGGGCC 


T070 


GGCCCGTC A TGGAGATG 


1794 


98 


UGUCGAUC CUGAUGAG X CGAA ICUCGAAC 


ToTT — 


GTTCGAGC A GATCGACA 


V795 




CCCGGACU CUGAUGAG X CGAA IUCGAUCU 


T072 


AGATCGAC A AGTCCGGG 


1796 


"TIT - 


CAGCUCCC CUGAUGAG X CGAA IACUUGUC 


To73 


GACAAGTC C GGGAGCTG 


T797 






TotI — 


CCGGGAGC T GGGCGGCC 


"1798 




UGGUAAAU CUGAUGAG X CGAA ICCGCCCA 


To75 


TGGGCGGC C ATTTACCA 


T799 

__ 


~T27 


CUGGUAAA CUGAUGAG X CGAA IGCCGCCC 


T076 


GGGCGGCC A TTTACCAG 




"^33 


GAUAUCCU CUGAUGAG X CGAA IUAAAUGG 


T077 


CCATTTAC C AGGATATC 


Tsoi — 


"T34 


GGAUAUCC CUGAUGAG X CGAA IGUAAAUG 


TF78 — 


CATTTACC A GGATATCC 


Tio2 






To79 


AGGATATC C GACATGAA 


T?ol — 


— ITi — 


UGGCUUCA CUGAUGAG X CGAA IUCGGAUA 


1080 


TATCCGAC A TGAAGCCA 


Tio4 — 


— — — 


AAGUCACU CUGAUGAG X CGAA ICUUCAUG 


1081 


CATGAAGC C AGTGACTT 


Tioi — 


^L54 


GAAGUCAC CUGAUGAG X CGAA IGCUUCAU 


1082 


ATGAAGCC A GTGACTTC 


1806 


— 160 — 


ACAUGGGA CUGAUGAG X CGAA IUCACUGG 


Tosl — 


CCAGTGAC T TCCCATGT 


1807 


— 163 


UCUACAUG CUGAUGAG X CGAA IAAGUCAC 


1084 


GTGACTTC C CATGTAGA 




— 


CUCUACAU CUGAUGAG X CGAA IGAAGUCA 


1085 


TGACTTCC C ATGTAGAG 


1809 


— 165 — 


ACUCUACA CUGAUGAG X CGAA IGGAAGUC 


1086 


GACTTCCC A TGTAGAGT 


1810 


"177 — 


GGAAGCUU CUGAUGAG X CGAA ICCACUCU 


1087 


AGAGTGGC C AAGCTTCC 


1811 


178 


AGGAAGCU CUGAUGAG X CGAA IGCCACUC 


1088 


GAGTGGCC A AGCTTCCT 


1812 


182 


UCUUAGGA CUGAUGAG X CGAA ICUUGGCC 


1089 


GGCCAAGC T TCCTAAGA 


1313 


185 


UGUUCUUA CUGAUGAG X CGAA IAAGCUUG 


1090 


CAAGCTTC C TAAGAACA 


1814 


186 


UUGUUCUU CUGAUGAG X CGAA IGAAGCUU 


1091 


AAGCTTCC T AAGAACAA 


1815 


193 


UCGGUUUU CUGAUGAG X CGAA IUUCUUAG 


1092 


CTAAGAAC A AAAACCGA 


1816 


199 


CCUAUUUC CUGAUGAG X CGAA IUUUUUGU 


1093 


ACAAAAAC C GAAATAGG 


1817 


211 


GACGUCUC CUGAUGAG X CGAA IUACCUAU 


1094 


ATAGGTAC A GAGACGTC 


1818 


220 


AAAGGGAC CUGAUGAG X CGAA IACGUCUC 


1095 


GAGACGTC A GTCCCTTT 


1819 


224 


GGUCAAAG CUGAUGAG X CGAA IACUGACG 


1096 


CGTCAGTC C CTTTGACC 


1320 


225 


UGGUCAAA CUGAUGAG X CGAA IGACUGAC 


1097 


GTCAGTCC C TTTGACCA 


1321 


226 


AUGGUCAA CUGAUGAG X CGAA IGGACUGA 


1098 


TCAGTCCC T TTGACCAT 


1822 


232 


CCGACUAU CUGAUGAG X CGAA IUCAAAGG 


1099 


CCTTTGAC C ATAGTCGG 


1323 



WO 01/16312 



193 



PCT/US00/23998 



UCCGACUA CUGAUGAG X CGAA IGUCAAAG 
CUUGAUGU CUGAUGAG X CGAA 



CTTTGACC A TAGTCGGA 
GATTAAAC T ACATCAAG 



CUUCUUGA CUGAUGAG X CGAA IUAGUUUA 



TAAACTAC A TCAAGAAG 



UAUCUUCU CUGAUGAG X CGAA IAUGUAGU 



ACTACATC A AGAAGATA 



GUUGAUAU CUGAUGAG X CGAA IUCAUUAU 



ATAATGAC T ATATCAAC 



ACUAGCGU CUGAUGAG X CGAA IAUAUAGU 



ACTATATC A ACGCTAGT 



AUCAAACU CUGAUGAG X CGAA ICGUUGAU 



ATCAACGC T AGTTTGAT 



CUCCUUUG CUGAUGAG X CGAA ICUUCUUC 



GAAGAAGC C CAAAGGAG 



ACUCCUUU CUGAUGAG X CGAA IGCUUCUU 



AAGAAGCC C AAAGGAGT 



AACUCCUU CUGAUGAG X CGAA IGGCUUCU 



AGAAGCCC A AAGGAGTT 



GGUAAGAA CUGAUGAG X CGAA IUAACUCC 



GGAGTTAC A TTCTTACC 



CCUGGGUA CUGAUGAG X CGAA IAAUGUAA 



TTACATTC T TACCCAGG 



GGGCCCUG CUGAUGAG X CGAA IUAAGAAU 



ATTCTTAC C CAGGGCCC 



AGGGCCCU CUGAUGAG X CGAA IGUAAGAA 



TTCTTACC C AGGGCCCT 



AAGGGCCC CUGAUGAG X CGAA IGGUAAGA 



TCTTACCC A GGGCCCTT 



AGGCAAAG CUGAUGAG X CGAA ICCCUGGG 



CCCAGGGC C CTTTGCCT 



UAGGCAAA CUGAUGAG X CGAA IGCCCUGG 



CCAGGGCC C TTTGCCTA 



UUAGGCAA CUGAUGAG X CGAA IGGCCCUG 



CAGGGCCC T TTGCCTAA 



AUGUGUUA CUGAUGAG X CGAA I CAAAGGG 



CCCTTTGC C TAACACAT 



CAUGUGUU CUGAUGAG X CGAA IGCAAAGG 



CCTTTGCC T AACACATG 



ACCGCAUG CUGAUGAG X CGAA IUUAGGCA 



TGCCTAAC A CATGCGGT 



UGACCGCA CUGAUGAG X CGAA IUGUUAGG 



CCTAACAC A TGCGGTCA 



CCCAAAAG CUGAUGAG X CGAA IACCGCAU 



ATGCGGTC A CTTTTGGG 



CUCCCAAA CUGAUGAG X CGAA IUGACCGC 



GCGGTCAC T TTTGGGAG 



UGCUUUUC CUGAUGAG X CGAA ICUCCCAC 



GTGGGAGC A GAAAAGCA 



GACACCCC CUGAUGAG X CGAA ICUUUUCU 



AGAAAAGC A GGGGTGTC 



GUUGAGCA CUGAUGAG X CGAA IACGACAC 



GTGTCGTC A TGCTCAAC 



CUCUGUUG CUGAUGAG X CGAA ICAUGACG 



CGTCATGC T CAACAGAG 



CACUCUGU CUGAUGAG X CGAA IAGCAUGA 



TCATGCTC A ACAGAGTG 



CAUCACUC CUGAUGAG X CGAA IUUGAGCA 

CAGUAUUG CUGAUGAG X CGAA ICGCAUUU 

GCCAGUAU CUGAUGAG X CGAA IUGCGCAU 

UUGUGGCC CUGAUGAG X CGAA IUAUUGUG 

CUUUUUGU CUGAUGAG X CGAA ICCAGUAU 

UCUUUUUG CUGAUGAG X CGAA IGCCAGUA 

CUUCUUUU CUGAUGAG X CGAA IUGGCCAG 

GUCUUCAA CUGAUGAG X CGAA IAUCAUCU 

CAAAUUUG CUGAUGAG X CGAA IUCUUCAA 

UUCAAAUU CUGAUGAG X CGAA IUGUCUUC 

GAGAUCAA CUGAUGAG X CGAA IUUAAUUU 

AUCUUCAG CUGAUGAG X CGAA IAUCAAUG 

AUAUCUUC CUGAUGAG X CGAA IAGAUCAA 

AUAUGACU CUGAUGAG X CGAA IAUAUCUU 

GUAUAAUA CUGAUGAG X CGAA IACUUGAU 

UGUCGCAC CUGAUGAG X CGAA IUAUAAUA 

AUUCUAGC CUGAUGAG X CGAA IUCGCACU 

CCAAUUCU CUGAUGAG X CGAA ICUGUCGC 



TGCTCAAC A 
AAATGCGC A CAATACTG 
ATGCGCAC A ATACTGGC 
CACAATAC T GGCCACAA 
ATACTGGC C ACAAAAAG 
TACTGGCC A CAAAAAGA 
CTGGCCAC A AAAAGAAG 
AGATGATC T TTGAAGAC 
TTGAAGAC A CAAATTTG 
GAAGACAC A AATTTGAA 
AAATTAAC A TTGATCTC 
CATTGATC T CTGAAGAT 
TTGATCTC T GAAGATAT 
AAGATATC A AGTCATAT 
ATCAAGTC A TATTATAC 
TATTATAC A GTGCGACA 
AGTGCGAC A GCTAGAAT 
GCGACAGC T AGAATTGG 
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GGUUGUAA CUGAUGAG X CGAA IUUUUCCA 

GGGUUGUA CUGAUGAG X CGAA IGUUUUCC 

UCUUGGGU CUGAUGAG X CGAA IUAAGGUU 

567 I GUUUCUUG CUGAUGAG X CGAA IUUGUAAG 

568 AGUUUCUU CUGAUGAG X CGAA IGUUGUAA 

569 I GAGUUUCU CUGAUGAG X CGAA IGGUUGUA 
AUCUCUCG CUGAUGAG X CGAA IUUUCUUG 
GAAAUGUA CUGAUGAG X CGAA IAUCUCUC 
AGUGGAAA CUGAUGAG X CGAA IUAAGAUC 
GGUAUAGU CUGAUGAG X CGAA IAAAUGUA 
UGGUAUAG CUGAUGAG X CGAA IGAAAUGU 
UGUGGUAU CUGAUGAG X CGAA IUGGAAAU 
GGCCAUGU CUGAUGAG X CGAA IUAUAGUG 
AGGCCAUG CUGAUGAG X CGAA IGUAUAGU 
UCAGGCCA CUGAUGAG X CGAA IUGGUAUA 
CAAAGUCA CUGAUGAG X CGAA ICCAUGUG 
CCAAAGUC CUGAUGAG X CGAA IGCCAUGU 
GACUCCAA CUGAUGAG X CGAA IUCAGGCC 
UGAUUCAG CUGAUGAG X CGAA IACUCCAA 
GUGAUUCA CUGAUGAG X CGAA IGACUCCA 
GGUGAUUC CUGAUGAG X CGAA IGGACUCC 
GAGGCUGG CUGAUGAG X CGAA IAUUCAGG 
AUGAGGCU CUGAUGAG X CGAA IUGAUUCA 
AAUGAGGC CUGAUGAG X CGAA IGUGAUUC 

_ 639 I AAGAAUGA CUGAUGAG X CGAA ICUGGUGA 

640 I CAAGAAUG CUGAUGAG X CGAA IGCUGGUG 

UUCAAGAA CUGAUGAG X CGAA IAGGCUGG 

AAAGUUCA CUGAUGAG X CGAA IAAUGAGG 

GAAAAGAA CUGAUGAG X CGAA IUUCAAGA 

CUUUGAAA CUGAUGAG X CGAA IAAAGUUC 

UCGGACUU CUGAUGAG X CGAA IAAAAGAA 

667 I UGACUCUC CUGAUGAG X CGAA IACUUUGA 

675 I AGUGACCC CUGAUGAG X CGAA IACUCUCG 

GGGCUGAG CUGAUGAG X CGAA IACCCUGA 

CCGGGCUG CUGAUGAG X CGAA IUGACCCU 

CUCCGGGC CUGAUGAG X CGAA IAGUGACC 

GUGCUCCG CUGAUGAG X CGAA ICUGAGUG 

CGUGCUCC CUGAUGAG X CGAA IGCUGAGU 

CGGGCCCG CUGAUGAG X CGAA ICUCCGGG 

CCACAACG CUGAUGAG X CGAA ICCCGUGC 

ACCACAAC CUGAUGAG X CGAA IGCCCGUG 

CACUGCAG CUGAUGAG X CGAA ICACCACA 

UGCACUGC CUGAUGAG X CGAA IUGCACCA 

GCCUGCAC CUGAUGAG X CGAA ICAGUGCA 

CCGAUGCC CUGAUGAG X CGAA ICACUGCA 

CCUGCCGA CUGAUGAG X CGAA ICCUGCAC 

UCCAGACC CUGAUGAG X CGAA ICCGAUGC 



TGGAAAAC C TTACAACC 
GGAAAACC T TACAACCC 
AACCTTAC A ACCCAAGA 
CTTACAAC C CAAGAAAC 
TTACAACC C AAGAAACT 
TACAACCC A AGAAACTC 
CAAGAAAC T CGAGAGAT 
GAG AG AT C T TACATTTC 
GATCTTAC A TTTCCACT 
TACATTTC C ACTATACC 
ACATTTCC A CTATACCA 
ATTTCCAC T ATACCACA 
CACTATAC C ACATGGCC 
ACTATACC A CATGGCCT 
TATACCAC A TGGCCTGA 
CACATGGC C TGACTTTG 
ACATGGCC T GACTTTGG 
GGCCTGAC T TTGGAGTC 



TTGGAGTC 
TGGAGTCC 



C CTGAATCA 
C TGAATCAC 



GGAGTCCC T GAATCACC 
CCTGAATC A CCAGCCTC 
TGAATCAC C AGCCTCAT 
GAATCACC A GCCTCATT 
TCACCAGC C TCATTCTT 
CACCAGCC T CATTCTTG 
CCAGCCTC A TTCTTGAA 
CCTCATTC T TGAACTTT 
TCTTGAAC T TTCTTTTC 
GAACTTTC T TTTCAAAG 
TTCTTTTC A AAGTCCGA 
TCAAAGTC C GAGAGTCA 
CGAGAGTC A GGGTCACT 
TCAGGGTC A CTCAGCCC 
AGGGTCAC T CAGCCCGG 
GGTCACTC A GCCCGGAG 
CACTCAGC C CGGAGCAC 
ACTCAGCC C GGAGCACG 
CCCGGAGC A CGGGCCCG 
GCACGGGC C CGTTGTGG 
CACGGGCC C GTTGTGGT 
TGTGGTGC A CTGCAGTG 
TGGTGCAC T GCAGTGCA 
TGCACTGC A GTGCAGGC 
TGCAGTGC A GGCATCGG 
GTGCAGGC A TCGGCAGG 
GCATCGGC A GGTCTGGA 
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AAGGUUCC CUGAUGAG X CGAA IACCUGCC 

AGACAGAA CUGAUGAG X CGAA IUUCCAGA 

CAGACAGA CUGAUGAG X CGAA IGUUCCAG 

AGCCAGAC CUGAUGAG X CGAA IAAGGUUC 

UAUCAGCC CUGAUGAG X CGAA IACAGAAG 

CAGGUAUC CUGAUGAG X CGAA ICCAGACA 

AAGAGGCA CUGAUGAG X CGAA IUAUCAGC 

CAAGAGGC CUGAUGAG X CGAA IGUAUCAG 

CAGCAAGA CUGAUGAG X CGAA ICAGGUAU 

767 UCAGCAAG CUGAUGAG X CGAA IGCAGGUA ' 

769 I CAUCAGCA CUGAUGAG X CGAA IAGGCAGG 

UGUCCAUC CUGAUGAG X CGAA ICAAGAGG 

UUUCCUCU CUGAUGAG X CGAA IUCCAUCA 

GGAAGAAG CUGAUGAG X CGAA IUCUUUCC 

CGGAAGAA CUGAUGAG X CGAA IGUCUUUC 

ACGGAAGA CUGAUGAG X CGAA IGGUCUUU 

UCAACGGA CUGAUGAG X CGAA IAAGGGUC 

AUAUCAAC CUGAUGAG X CGAA IAAGAAGG 

CACUUUCU CUGAUGAG X CGAA IAUAUCAA 

UUUCUAAC CUGAUGAG X CGAA ICACUUUC 



3AUGAG X CGAA. ICCCCAUC 
GGCUGUCU CUGAUGAG X CGAA IAUCAGCC 
CGGCUGUC CUGAUGAG X CGAA IGAUCAGC 
UGGUCGGC CUGAUGAG X CGAA IUCUGGAU 
AGCUGGUC CUGAUGAG X CGAA ICUGUCUG 
GCGCAGCU CUGAUGAG X CGAA IUCGGCUG 
AGCGCAGC CUGAUGAG X CGAA IGUCGGCU 
AGAAGCGC CUGAUGAG^X CGAA~I CUGGUCG _ 
GUAGGAGA CUGAUGAG X CGAA ICGCAGCU 
CAGGUAGG CUGAUGAG X CGAA IAAGCGCA 
GCCAGGUA CUGAUGAG X CGAA IAGAAGCG 
AGCCAGGU CUGAUGAG X CGAA I GAGAAGC 
CACAGCCA CUGAUGAG X CGAA IUAGGAGA 
UCACAGCC CUGAUGAG X CGAA IGUAGGAG 
UCGAUCAC CUGAUGAG X CGAA ICCAGGUA 
AUGAAUUU CUGAUGAG X CGAA ICACCUUC 
GAUGAAUU CUGAUGAG X CGAA IGCACCUU 
CCCCAUGA CUGAUGAG X CGAA IAAUUUGG 
GUCCCCCA CUGAUGAG X CGAA IAUGAAUU 
CACGGAAG CUGAUGAG X CGAA IUCCCCCA 
UGCACGGA CUGAUGAG X CGAA IAGUCCCC 
UCCUGCAC CUGAUGAG X CGAA IAAGAGUC 
ACUGAUCC CUGAUGAG X CGAA ICACGGAA 
CCUUCCAC CUGAUGAG X CGAA IAUCCUGC 
CGUGGGAA CUGAUGAG X CGAA ICUCCUUC 
UCCUCGUG CUGAUGAG X CGAA IAAAGCUC 
GUCCUCGU CUGAUGAG X CGAA I GAAAGCU 



GGCAGGTC T GGAACCTT 
TCTGGAAC C TTCTGTCT 
CTGGAACC T TCTGTCTG 
GAACCTTC T GTCTGGCT 
CTTCTGTC T GGCTGATA 
TGTCTGGC T GATACCTG 
GCTGATAC C TGCCTCTT 
CTGATACC T GCCTCTTG 
ATACCTGC C TCTTGCTG 
TACCTGCC T CTTGCTGA 
CCTGCCTC T TGCTGATG 
CCTCTTGC T GATGGACA 
TGATGGAC A AGAGGAAA 
GGAAAGAC C CTTCTTCC 
GAAAGACC C TTCTTCCG 
AAAGACCC T TCTTCCGT 
GACCCTTC T TCCGTTGA 
CCTTCTTC C GTTGATAT 
TTGATATC A AGAAAGTG 
GAAAGTGC T GTTAGAAA 
GATGGGGC T GATCCAGA 
GGCTGATC C AGACAGCC 
GCTGATCC A GACAGCCG 
ATCCAGAC A GCCGACCA 
CAGACAGC C GACCAGCT 
CAGCCGAC C AGCTGCGC 
AGCCGACC A GCTGCGCT 
CGACCAGC T GCGCTTCT 
AGCTGCGC T TCTCCTAC 
TGCGCTTC T CCTACCTG 
CGCTTCTC C TACCTGGC 
GCTTCTCC T ACCTGGCT 
TCTCCTAC C TGGCTGTG 
CTCCTACC T GGCTGTGA 
TACCTGGC T GTGATCGA 
GAAGGTGC C AAATTCAT 
AAGGTGCC A AATTCATC 
CCAAATTC A TCATGGGG 
AATTCATC A TGGGGGAC 
TGGGGGAC T CTTCCGTG 
GGGGACTC T TCCGTGCA 
GACTCTTC C GTGCAGGA 
TTCCGTGC A GGATCAGT 
GCAGGATC A GTGGAAGG 
GAAGGAGC T TTCCCACG 
GAGCTTTC C CACGAGGA 
AGCTTTCC C ACGAGGAC 
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1002 
1003 



1012 
1021 



GGUCCUCG CUGAUGAG X CGAA IGGAAAGC 
GGGCUCCA CUGAUGAG X CGAA IUCCUCGU 
GGGGCUCC CUGAUGAG X CGAA IGUCCUCG 
CGGGUGGG CUGAUGAG X CGAA ICUCCAGG 
UCGGGUGG CUGAUGAG X CGAA IGCUCCAG 
CUCGGGUG CUGAUGAG X CGAA IGGCUCCA 
GCUCGGGU CUGAUGAG X CGAA IGGGCUCC 
UGCUCGGG CUGAUGAG X CGAA IGGGGCUC 
UAUGCUCG CUGAUGAG X CGAA IUGGGGGC 
AUAUGCUC CUGAUGAG X CGAA IGUGGGGG 
GGGGGAUA CUGAUGAG X CGAA ICUCGGGU 
AGGUGGGG CUGAUGAG X CGAA IAUAUGCU 
GAGGUGGG CUGAUGAG X CGAA IGAUAUGC 
GGAGGUGG CUGAUGAG X CGAA IGGAUAUG 
GGGAGGUG CUGAUGAG X CGAA I GGGAUAU 
GGGGAGGU CUGAUGAG X CGAA IGGGGAUA 
■ CGGGGAGG CUGAUGAG X CGAA IGGGGGAU 
GCCGGGGA CUGAUGAG X CGAA IUGGGGGC 
GGCCGGGG CUGAUGAG X CGAA IGUGGGGG 
GUGGCCGG CUGAUGAG X CGAA IAGGUGGG 
GGUGGCCG CUGAUGAG X CGAA IGAGGUGG 
GGGUGGCC CUGAUGAG X CGAA IGGAGGUG 
GUUUGGGU CUGAUGAG X CGAA I CCGGGGA 
CGUUUGGG CUGAUGAG X CGAA IGCCGGGG 
UUCGUUUG CUGAUGAG X CGAA IUGGCCGG 
AUUCGUUU CUGAUGAG X CGAA IGUGGCCG 
GAUUCGUU CUGAUGAG X CGAA IGGUGGCC 
UGGCUCCA CUGAUGAG X CGAA IAUUCGUU 
X CGAA IGAUUCGU 



GCTTTCCC A CGAGGACC 
ACGAGGAC C TGGAGCCC 
CGAGGACC T GGAGCCCC 
CCTGGAGC C CCCACCCG 
CTGGAGCC C CCACCCGA 
TGGAGCCC C CACCCGAG 
GGAGCCCC C ACCCGAGC 
GAGCCCCC A CCCGAGCA 
GCCCCCAC C CGAGCATA 
CCCCCACC C GAGCATAT 
ACCCGAGC A TATCCCCC 
AGCATATC C CCCCACCT 
GCATATCC C CCCACCTC 
CATATCCC C CCACCTCC 
ATATCCCC C CACCTCCC 
TATCCCCC C ACCTCCCC 
ATCCCCCC A CCTCCCCG 
CCCCCCAC C TCCCCGGC 
CCCCCACC T CCCCGGCC 



CCACCTCC C CGGCCACC 
CACCTCCC C GGCCACCC 
TCCCCGGC C ACCCAAAC 
CCCCGGCC A CCCAAACG 
CCGGCCAC C CAAACGAA 
CGGCCACC C AAACGAAT 
GGCCACCC A AACGAATC 
AACGAATC C TGGAGCCA 
ACGAATCC T GGAGCCAC 



CAUUGUGU CUGAUGAG X CGAA ICUCCAGG 



CCTGGAGC C ACACAATG 



CCAUUGUG CUGAUGAG X CGAA IGCUCCAG 



CTGGAGCC A CACAATGG 



1031 
1033 
1045 



UCCCAUUG CUGAUGAG X CGAA IUGGCUCC 



GGAGCCAC A CAATGGGA 



UUUCCCAU CUGAUGAG X CGAA IUGUGGCU 



AGCCACAC A ATGGGAAA 



GAACUCCC CUGAUGAG X CGAA ICAUUUCC 



GGAAATGC A GGGAGTTC 



AUUUGGGA CUGAUGAG X CGAA IAACUCCC 



GGGAGTTC T TCCCAAAT 



1057 
1058 
1059 
10S4 
1066 



1086 
1087 
1088 
1102 



GUGAUUUG CUGAUGAG X CGAA IAAGAACU 



AGTTCTTC C CAAATCAC 



GGUGAUUU CUGAUGAG X CGAA IGAAGAAC 



GTTCTTCC C AAATCACC 



UGGUGAUU CUGAUGAG X CGAA I GGAAGAA 



TTCTTCCC A AATCACCA 



CCCACUGG CUGAUGAG X CGAA IAUUUGGG 



CCCAAATC A CCAGTGGG 



X CGAA IUGAUUUG 
UCACCCAC CUGAUGAG X CGAA IGUGAUUU 
UCCUCCUG CUGAUGAG X CGAA IUCUCUUC 
AUCCUCCU CUGAUGAG X CGAA IGUCUCUU 
UAUCCUCC CUGAUGAG X CGAA IGGUCUCU 
GAUGGGGC CUGAUGAG X CGAA IUCUUUAU 
CUUGAUGG CUGAUGAG X CGAA ICAGUCUU 
CCUUGAUG CUGAUGAG X CGAA IGCAGUCU 



CAAATCAC C AGTGGGTG 
AAATCACC A GTGGGTGA 
GAAGAGAC C CAGGAGGA 
AAGAGACC C AGGAGGAT 
AGAGACCC A GGAGGATA 
ATAAAGAC T GCCCCATC 
AAGACTGC C CCATCAAG 
AGACTGCC C CATCAAGG 
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UCCUUGAU CUGAUGAG X CGAA IGGCAGUC 

UUCCUUGA CUGAUGAG X CGAA IGGGCAGU 

UUCUUCCU CUGAUGAG X CGAA IAUGGGGC 

AUUUAAGG CUGAUGAG X CGAA ICUUCCUU 

CAUUUAAG CUGAUGAG X CGAA IGCUUCCU 

GCAUUUAA CUGAUGAG X CGAA IGGCUUCC 

GGCAUUUA CUGAUGAG X CGAA IGGGCUUC 

UAGGGUGC CUGAUGAG X CGAA ICAUUUAA 

CCGUAGGG CUGAUGAG X CGAA ICGGCAUU 

UGCCGUAG CUGAUGAG X CGAA IUGCGGCA 

AUGCCGUA CUGAUGAG X CGAA IGUGCGGC 

GAUGCCGU CUGAUGAG X CGAA IGGUGCGG 

GCUUUCGA CUGAUGAG X CGAA ICCGUAGG 

UUGACUCA CUGAUGAG X CGAA ICUUUCGA 

CAGUGUCU CUGAUGAG X CGAA IACUCAUG 

AACUUCAG CUGAUGAG X CGAA IUCUUGAC 

CUAACUUC CUGAUGAG X CGAA IUGUCUUG 

CACCUCGA CUGAUGAG X CGAA IACUUCCC 

GCAGCCUG CUGAUGAG X CGAA ICACCUCG 

GGCAGCCU CUGAUGAG X CGAA IGCACCUC 

AGGCAGCC CUGAUGAG X CGAA IGGCACCU 

GGGGAGGC CUGAUGAG X CGAA ICCUGGGC 

GCUGGGGA CUGAUGAG X CGAA ICAGCCUG 

GGCUGGGG CUGAUGAG X CGAA IGCAGCCU 

UUGGCUGG CUGAUGAG X CGAA IAGGCAGC 

UUUGGCUG CUGAUGAG X CGAA IGAGGCAG 

CUUUGGCU CUGAUGAG X CGAA IGGAGGCA 

CCUUUGGC CUGAUGAG X CGAA IGGGAGGC 

UCCCCUUU CUGAUGAG X CGAA I CUGGGGA 

CUCCCCUU CUGAUGAG X CGAA I GCUGGGG 

GCAGUGAC CUGAUGAG X CGAA ICUCCCCU 

UCGGGCAG CUGAUGAG X CGAA IACGGCUC 

UCUCGGGC CUGAUGAG X CGAA IUGACGGC 

CCUUCUCG CUGAUGAG X CGAA ICAGUGAC 

UCCUUCUC CUGAUGAG X CGAA IGCAGUGA 

CAGUGCAU CUGAUGAG X CGAA IUCCUCGU 

UCAGUGCA CUGAUGAG X CGAA IGUCCUCG 

UAACUCAG CUGAUGAG X CGAA ICAUGGUC 

AGUAACUC CUGAUGAG X CGAA IUGCAUGG 

GGGCUUCC CUGAUGAG X CGAA IUAACUCA 

CCAGGAAG CUGAUGAG X CGAA ICUUCCAG 

ACCAGGAA CUGAUGAG X CGAA IGCUUCCA 

GACCAGGA CUGAUGAG X CGAA IGGCUUCC 

GUUGACCA CUGAUGAG X CGAA IAAGGGCU 

UGUUGACC CUGAUGAG X CGAA IGAAGGGC 

GCACAUGU CUGAUGAG X CGAA IACCAGGA 

CACGCACA CUGAUGAG X CGAA IUUGACCA 



GACTGCCC C ATCAAGGA 
ACTGCCCC A TCAAGGAA 
GCCCCATC A AGGAAGAA 
AAGGAAGC C CCTTAAAT 
AGGAAGCC C CTTAAATG 
GGAAGCCC C TTAAATGC 
GAAGCCCC T TAAATGCC 
TTAAATGC C GCACCCTA 
AATGCCGC A CCCTACGG 
TGCCGCAC C CTACGGCA 
GCCGCACC C TACGGCAT 
CCGCACCC T ACGGCATC 
CCTACGGC A TCGAAAGC 
TCGAAAGC A TGAGTCAA 
CATGAGTC A AGACACTG 
GTCAAGAC A CTGAAGTT 
CAAGACAC T GAAGTTAG 
GGGAAGTC T TCGAGGTG 
CGAGGTGC C CAGGCTGC . 
GAGGTGCC C AGGCTGCC 
AGGTGCCC A GGCTGCCT 
GCCCAGGC T GCCTCCCC 
CAGGCTGC C TCCCCAGC 
AGGCTGCC T CCCCAGCC 
GCTGCCTC C CCAGCCAA 
CTGCCTCC C CAGCCAAA 
TGCCTCCC C AGCCAAAG 
GCCTCCCC A GCCAAAGG 
TCCCCAGC C AAAGGGGA 
CCCCAGCC A AAGGGGAG 
AGGGGAGC C GTCACTGC 
GAGCCGTC A CTGCCCGA 
GCCGTCAC T GCCCGAGA 
GTCACTGC C CGAGAAGG 
TCACTGCC C GAGAAGGA 
ACGAGGAC C ATGCACTG 



A TGCACTGA 
GACCATGC A CTGAGTTA 
CCATGCAC T GAGTTACT 
TGAGTTAC T GGAAGCCC 
CTGGAAGC C CTTCCTGG 
TGGAAGCC C TTCCTGGT 
GGAAGCCC T TCCTGGTC 
AGCCCTTC C TGGTCAAC 
GCCCTTCC T GGTCAACA 
TCCTGGTC A ACATGTGC 
TGGTCAAC A TGTGCGTG 
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AGGACCGU CUGAUGAG X CGAA ICCACGCA 



TGCGTGGC T ACGGTCCT 



GGCCGUGA CUGAUGAG X CGAA IACCGUAG 



CTACGGTC C TCACGGCC 



1328 
1330 
1335 



CGGCCGUG CUGAUGAG X CGAA IGACCGUA 



TACGGTCC T CACGGCCG 



GCCGGCCG CUGAUGAG X CGAA IAGGACCG 



CGGTCCTC A CGGCCGGC 



UAAGCGCC CUGAUGAG X CGAA ICCGUGAG 



CTCACGGC C GGCGCTTA 



CAGAGGUA CUGAUGAG X CGAA ICGCCGGC 



GCCGGCGC T TACCTCTG 



1345 
1346 



GUAGCAGA CUGAUGAG X CGAA IUAAGCGC 



GCGCTTAC C TCTGCTAC 



UGUAGCAG CUGAUGAG X CGAA IGUAAGCG 



CGCTTACC T CTGCTACA 



CCUGUAGC CUGAUGAG X CGAA IAGGUAAG 



CTTACCTC T GCTACAGG 



1351 
1354 



GAACCUGU CUGAUGAG X CGAA I CAGAGGU 



ACCTCTGC T ACAGGTTC 



CAGGAACC CUGAUGAG X CGAA IUAGCAGA 



TCTGCTAC A GGTTCCTG 



GUUGAACA CUGAUGAG X CGAA IAACCUGU 



ACAGGTTC C TGTTCAAC 



1361 
1366 
1369 
1372 
1375 
1377 
1382 



UGUUGAAC CUGAUGAG X CGAA IGAACCUG 



CAGGTTCC T GTTCAACA 



GUUGCUGU CUGAUGAG X CGAA IAACAGGA 



TCCTGTTC A ACAGCAAC 



UGUGUUGC CUGAUGAG X CGAA IUUGAACA 



TGTTCAAC A GCAACACA 



CUAUGUGU CUGAUGAG X CGAA ICUGUUGA 



TCAACAGC A ACACATAG 



AGGCUAUG CUGAUGAG X CGAA IUUGCUGU 



ACAGCAAC A CATAGCCT 



UCAGGCUA CUGAUGAG X CGAA IUGUUGCU 



AGCAACAC A TAGCCTGA 



GAGGGUCA CUGAUGAG X CGAA ICUAUGUG 



CACATAGC C TGACCCTC 



GGAGGGUC CUGAUGAG X CGAA IGCUAUGU 



ACATAGCC T GACCCTCC 



UGGAGGAG CUGAUGAG X CGAA IUCAGGCU 



AGCCTGAC C CTCCTCCA 



GUGGAGGA CUGAUGAG X CGAA IGUCAGGC 



GCCTGACC C TCCTCCAC 



1389 
1391 



AGUGGAGG CUGAUGAG X CGAA IGGUCAGG 



CCTGACCC T CCTCCACT 



GGAGUGGA CUGAUGAG X CGAA IAGGGUCA 



TGACCCTC C TCCACTCC 



UGGAGUGG CUGAUGAG X CGAA IGAGGGUC 



GACCCTCC T CCACTCCA 



GGUGGAGU CUGAUGAG X CGAA IAGGAGGG 



CCCTCCTC C ACTCCACC 



1395 
1397 



AGGUGGAG CUGAUGAG X CGAA IGAGGAGG 



CCTCCTCC A CTCCACCT 



GGAGGUGG CUGAUGAG X CGAA IUGGAGGA 



TCCTCCAC T CCACCTCC 



GUGGAGGU CUGAUGAG J 



CTCCACTC C ACCTCCAC 



1400 
1402 
1403 
1405 
1406 



GGUGGAGG CUGAUGAG X CGAA IGAGUGGA 



TCCACTCC A CCTCCACC 



UGGGUGGA CUGAUGAG X CGAA IUGGAGUG 



CACTCCAC C TCCACCCA 



GUGGGUGG CUGAUGAG X CGAA IGUGGAGU 



ACTCCACC T CCACCCAC 



CAGUGGGU CUGAUGAG X CGAA IAGGUGGA 



TCCACCTC C ACCCACTG 



ACAGUGGG CUGAUGAG X CGAA IGAGGUGG 



CCACCTCC A CCCACTGT 



GGACAGUG CUGAUGAG X CGAA IUGGAGGU 



ACCTCCAC C CACTGTCC 



1409 
1410 



CGGACAGU CUGAUGAG X CGAA IGUGGAGG 



CCTCCACC C ACTGTCCG 



GCGGACAG CUGAUGAG X CGAA IGGUGGAG 



CTCCACCC A CTGTCCGC 



AGGCGGAC CUGAUGAG X CGAA IUGGGUGG 



CCACCCAC T GTCCGCCT 



GCAGAGGC CUGAUGAG X CGAA IACAGUGG 



CCACTGTC C GCCTCTGC 



1419 
1420 
1422 
1425 



CGGGCAGA CUGAUGAG X CGAA ICGGACAG 



CTGTCCGC C TCTGCCCG 



GCGGGCAG CUGAUGAG X CGAA IGCGGACA 



TGTCCGCC T CTGCCCGC 



CUGCGGGC CUGAUGAG X CGAA IAGGCGGA 



TCCGCCTC T GCCCGCAG 



GCUCUGCG CUGAUGAG X CGAA ICAGAGGC 



GCCTCTGC C CGCAGAGC 



GGCUCUGC CUGAUGAG X CGAA IGCAGAGG 



CCTCTGCC C GCAGAGCC 



GUGGGCUC CUGAUGAG X CGAA ICGGGCAG 



CTGCCCGC A GAGCCCAC 



CGGGCGUG CUGAUGAG X CGAA ICUCUGCG 



CGCAGAGC C CACGCCCG 



UCGGGCGU CUGAUGAG X CGAA IGCUCUGC 



GCAGAGCC C ACGCCCGA 
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GUCGGGCG CUGAUGAG X CGAA IGGCUCUG 



CAGAGCCC A CGCCCGAC 



GCUAGUCG CUGAUGAG X CGAA ICGUGGGC 



GCCCACGC C CGACTAGC 



UGCUAGUC CUGAUGAG X CGAA IGCGUGGG 



CCCACGCC C GACTAGCA 



UGCCUGCU CUGAUGAG X CGAA IUCGGGCG 



CGCCCGAC T AGCAGGCA 



1449 
1453 
1457 
1473 
1476 
1481 
1493 



GGCAUGCC CUGAUGAG X CGAA ICUAGUCG 



CGACTAGC A GGCATGCC 



CCGCGGCA CUGAUGAG X CGAA ICCUGCUA 



TAGCAGGC A TGCCGCGG 



CCUACCGC CUGAUGAG X CGAA ICAUGCCU 



AGGCATGC C GCGGTAGG 



GGUCCGGC CUGAUGAG X CGAA ICCCUUAC 



GTAAGGGC C GCCGGACC 



CGCGGUCC CUGAUGAG X CGAA ICGGCCCU 



AGGGCCGC C GGACCGCG 



CUCUACGC CUGAUGAG X CGAA IUCCGGCG 



CGCCGGAC C GCGTAGAG 



CGGGGCCC CUGAUGAG X CGAA ICUCUCUA 



TAGAGAGC C GGGCCCCG 



CCGUCCGG CUGAUGAG X CGAA ICCCGGCU 



AGCCGGGC C CCGGACGG 



1499 
1500 
1517 
1520 



UCCGUCCG CUGAUGAG X CGAA IGCCCGGC 



GQCGGGCC C CGGACGGA 



GUCCGUCC CUGAUGAG X CGAA- IGGCCCGG 



CCGGGCCC C GGACGGAC 



UUUAGUGC CUGAUGAG X CGAA IAACCAAC 



GTTGGTTC T GCACTAAA 



GGUUUUAG CUGAUGAG X CGAA ICAGAACC 



GGTTCTGC A CTAAAACC 



UGGGUUUU CUGAUGAG X CGAA IUGCAGAA 



TTCTGCAC T AAAACCCA 



GGAAGAUG CUGAUGAG X CGAA IUUUUAGU 



ACTAAAAC C CATCTTCC 



GGGAAGAU CUGAUGAG X CGAA IGUUUUAG 



CTAAAACC C ATCTTCCC 



GGGGAAGA CUGAUGAG X CGAA IGGUUUUA 



TAAAACCC A TCTTCCCC 



UCCGGGGA CUGAUGAG X CGAA IAUGGGUU 



AACCCATC T TCCCCGGA 



ACAUCCGG CUGAUGAG X CGAA IAAGAUGG 



CCATCTTC C CCGGATGT 



CACAUCCG CUGAUGAG X CGAA I GAAGAUG 



CATCTTCC C CGGATGTG 



ACACAUCC CUGAUGAG X CGAA IGGAAGAU 



ATCTTCCC C GGATGTGT 



GAGGGGUG CUGAUGAG X CGAA IACACACA 



TGTGTGTC T CACCCCTC 



AUGAGGGG CUGAUGAG X CGAA IAGACACA 



TGTGTCTC A CCCCTCAT 



1554 
1555 
1556 



GGAUGAGG CUGAUGAG X CGAA IUGAGACA 



TGTCTCAC C CCTCATCC 



AGGAUGAG CUGAUGAG X CGAA IGUGAGAC 



GTCTCACC C CTCATCCT 



AAGGAUGA CUGAUGAG X CGAA IGGUGAGA 



TCTCACCC C TCATCCTT 



AAAGGAUG CUGAUGAG X CGAA IGGGUGAG 



CTCACCCC T CATCCTTT 



1559 
1562 
1563 
1569 
1576 



UAAAAGGA CUGAUGAG X CGAA IAGGGGUG 



CACCCCTC A TCCTTTTA 



AAGUAAAA CUGAUGAG X CGAA IAUGAGGG 



CCCTCATC C TTTTACTT 



AAAGUAAA CUGAUGAG X CGAA IGAUGAGG 



CCTCATCC T TTTACTTT 



GGGCAAAA CUGAUGAG X CGAA IUAAAAGG 



CCTTTTAC T TTTTGCCC 



GUGGAAGG CUGAUGAG X CGAA ICAAAAAG 



CTTTTTGC C CCTTCCAC 



AGUGGAAG CUGAUGAG X CGAA IGCAAAAA 



TTTTTGCC C CTTCCACT 



AAGUGGAA CUGAUGAG X CGAA IGGCAAAA 



TTTTGCCC C TTCCACTT 



AAAGUGGA CUGAUGAG X CGAA IGGGCAAA 



TTTGCCCC T TCCACTTT 



CUCAAAGU CUGAUGAG X CGAA IAAGGGGC 



GCCCCTTC C ACTTTGAG 



1583 
1585 
1594 
1595 
1600 



ACUCAAAG CUGAUGAG X CGAA IGAAGGGG 



CCCCTTCC A CTTTGAG r 



GUACUCAA CUGAUGAG X CGAA IUGGAAGG 



CCTTCCAC T TTGAGTAC 



GUGGAUUU CUGAUGAG X CGAA IUACUCAA 



TTGAGTAC C AAATCCAC 



UGUGGAUU CUGAUGAG X CGAA IGUACUCA 



TGAGTACC A AATCCACA 



UGGCUUGU CUGAUGAG X CGAA IAUUUGGU 



ACCAAATC C ACAAGCCA 



AUGGCUUG CUGAUGAG X CGAA IGAUUUGG 



CCAAATCC A CAAGCCAT 



1603 
1607 



AAAUGGCU CUGAUGAG X CGAA IUGGAUUU 



AAATCCAC A AGCCATTT 



CAAAAAAU CUGAUGAG X CGAA ICUUGUGG 



CCACAAGC C ATTTTTTG 
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UCAAAAAA CUGAUGAG X CGAA IGCUUGUG 



CACAAGCC A TTTTTTGA 



GCCAGCAU CUGAUGAG X CGAA IUACUCUC 



GAGAGTAC C ATGCTGGC 



CGCCAGCA CUGAUGAG X CGAA IGUACUCU 



AGAGTACC A TGCTGGCG 



GCGCCGCC CUGAUGAG X CGAA ICAUGGUA 



TACCATGC T GGCGGCGC 



CUUCCCUC CUGAUGAG X CGAA ICGCCGCC 



GGCGGCGC A GAGGGAAG 



CGGGUGUA CUGAUGAG X CGAA ICCCCUUC 



GAAGGGGC C TACACCCG 



1S64 
1667 
1669 
1670 



ACGGGUGU CUGAUGAG X CGAA IGCCCCUU 



AAGGGGCC T ACACCCGT 



AAGACGGG CUGAUGAG X CGAA IUAGGCCC 



GGGCCTAC A CCCGTCTT 



CCAAGACG CUGAUGAG X CGAA IUGUAGGC 



GCCTACAC C CGTCT'f GG 



CCCAAGAC CUGAUGAG X CGAA IGUGUAGG 



CCTACACC C GTCTTGGG 



GAGCCCCA CUGAUGAG X CGAA I ACGGGUG 



CACCCGTC T TGGGGCTC 



GUGGGGCG CUGAUGAG X CGAA ICCCCAAG 



CTTGGGGC T CGCCCCAC 



1685 
1686 



CUGGGUGG CUGAUGAG X CGAA ICGAGCCC 



GGGCTCGC C CCACCCAG 



CCUGGGUG CUGAUGAG X CGAA IGCGAGCC 



GGCTCGCC C CACCCAGG 



CCCUGGGU CUGAUGAG X CGAA I GGCGAGC 



GCTCGCCC C ACCCAGGG 



GCCCUGGG CUGAUGAG X CGAA I GGGCGAG 



CTCGCCCC A CCCAGGGC 



GAGCCCUG CUGAUGAG X CGAA IUGGGGCG 



GGAGCCCU CUGAUGAG X CGAA I GUGGGGC 



GCCCCACC C AGGGCTCC 



GGGAGCCC CUGAUGAG X CGAA I GGUGGGG 



CCCCACCC A GGGCTCCC 



CAGGAGGG CUGAUGAG X CGAA ICCCUGGG 



CCCAGGGC T CCCTCCTG 



UCCAGGAG CUGAUGAG X CGAA IAGCCCUG 



CAGGGCTC C CTCCTGGA 



CUCCAGGA CUGAUGAG X CGAA IGAGCCCU 



AGGGCTCC C TCCTGGAG 



1701 

1703 
1704 



GCUCCAGG CUGAUGAG X CGAA IGGAGCCC 



GGGCTCCC T CCTGGAGC 



AUGCUCCA CUGAUGAG X CGAA IAGGGAGC 



GCTCCCTC C TGGAGCAT 



GAUGCUCC CUGAUGAG X CGAA IGAGGGAG 



CTCCCTCC T GGAGCATC 



GCCUGGGA CUGAUGAG X CGAA ICUCCAGG 



CCTGGAGC A TCCCAGGC 



CCCGCCUG CUGAUGAG X CGAA IAUGCUCC 



GGAGCATC C CAGGCGGG 



GCCCGCCU CUGAUGAG X CGAA IGAUGCUC 



GAGCATCC C AGGCGGGC 



CGCCCGCC CUGAUGAG X CGAA IGGAUGCU 



AGCATCCC A GGCGGGCG 



GUCUGGCG CUGAUGAG X CGAA ICCGCCCG 



CGGGCGGC A CGCCAGAC 



1730 
1731 
1735 
1738 



GGCUGUCU CUGAUGAG X CGAA ICGUGCCG 



CGGCACGC C AGACAGCC 



GGGCUGUC CUGAUGAG X CGAA IGCGUGCC 



GGCACGCC A GACAGCCC 



GGGGGGGC CUGAUGAG X CGAA IUCUGGCG 



CGCCAGAC A GCCCCCCC 



AAGGGGGG CUGAUGAG X CGAA ICUGUCUG 



CAGACAGC C CCCCCCTT 



CAAGGGGG CUGAUGAG X CGAA IGCUGUCU 



AGACAGCC C CCCCCTTG 



UCAAGGGG CUGAUGAG X CGAA IGGCUGUC 



GACAGCCC C CCCCTTGA 



UUCAAGGG CUGAUGAG X CGAA IGGGCUGU 



ACAGCCCC C CCCTTGAA 



1742 
1743 



AUUCAAGG CUGAUGAG X CGAA I GGGGCUG 



CAGCCCCC C CCTTGAAT 



GAUUCAAG CUGAUGAG X CGAA IGGGGGCU 



AGCCCCCC C CTTGAATC 



AGAUUCAA CUGAUGAG X CGAA IGGGGGGC 



GCCCCCCC C TTGAATCT 



1745 
1752 
1755 
1762 
1765 



CAGAUUCA CUGAUGAG X CGAA IGGGGGGG 



CCCCCCCC T TGAATCTG 



CUCCCUGC CUGAUGAG X CGAA IAUUCAAG 



CTTGAATC T GCAGGGAG 



UUGCUCCC CUGAUGAG X CGAA ICAGAUUC 



GAATCTGC A GGGAGCAA 



UGGAGAGU CUGAUGAG X CGAA ICUCCCUG 



CAGGGAGC A ACTCTCCA 



GAGUGGAG CUGAUGAG X CGAA IUUGCUCC 



GGAGCAAC T CTCCACTC 



UGGAGUGG CUGAUGAG X CGAA IAGUUGCU 



AGCAACTC T CCACTCCA 



UAUGGAGU CUGAUGAG X CGAA IAGAGUUG 



CAACTCTC C ACTCCATA 
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AUAUGGAG CUGAUGAG X CGAA IGAGAGUU 

AAAUAUGG CUGAUGAG X CGAA IUGGAGAG 

1774 AUAAAUAU CUGAUGAG X CGAA IAGUGGAG 

1775 AAUAAAUA CUGAUGAG X CGAA IGAGUGGA 
1789 | GAAAAAAU CUGAUGAG X CGAA IUUUAAAU 

GCCUUUGG CUGAUGAG X CGAA IAAAAAAU 

1799 UGCCUUUG CUGAUGAG X CGAA IGAAAAAA 

1800 AUGCCUUU CUGAUGAG X CGAA IGGAAAAA 

1801 [ GAUGCCUU CUGAUGAG X CGAA IGGGAAAA 
ACUAUGGA CUGAUGAG X CGAA ICCUUUGG 
UGCACUAU CUGAUGAG X CGAA IAUGCCUU 
GUGCACUA CUGAUGAG X CGAA IGAUGCCU 
AAUGCUAG CUGAUGAG X CGAA ICACUAUG 
AAAAUGCU CUGAUGAG X CGAA IUGCACUA 
CAAGAAAA CUGAUGAG X CGAA ICUAGUGC 
UUGGUUCA CUGAUGAG X CGAA IAAAAUGC 
ACAUUAUU CUGAUGAG X CGAA IUUCAAGA 
UACAUUAU CUGAUGAG X CGAA IGUUCAAG 
UGCAAGGC CUGAUGAG X CGAA IACAUCAA 
UGAUGCAA CUGAUGAG X CGAA ICUGACAU 
UUGAUGCA CUGAUGAG X CGAA IGCUGACA 
GCCCUUGA CUGAUGAG X CGAA ICAAGGCU 
AAAGCCCU CUGAUGAG X CGAA IAUGCAAG 
UUUGAUAA CUGAUGAG X CGAA ICCCUUGA 
GUACUUUU CUGAUGAG X CGAA IAUAAAGC 
UUUAUUAU CUGAUGAG X CGAA IUACUUUU 
CUACCUGA CUGAUGAG X CGAA IAUUUAUU 
ACUACCUG CUGAUGAG X CGAA IGAUUUAU 
GUACUACC CUGAUGAG X CGAA IAGGAUUU 
CCAUUCCC CUGAUGAG X CGAA IUACUACC 

193 3 CAUGGCAA CUGAUGAG X CGAA ICCUUCCA 

1938 AGGCCCAU CUGAUGAG X CGAA ICAAAGCC 

1939 CAGGCCCA CUGAUGAG X CGAA IGCAAAGC 
1945 I ACGCAGCA CUGAUGAG X CGAA ICCCAUGG 

GACGCAGC CUGAUGAG X CGAA IGCCCAUG 

1949 UCUGACGC CUGAUGAG X CGAA ICAGGCCC 

1955 UACUGGUC CUGAUGAG X CGAA IACGCAGC 

1959 CCAGUACU CUGAUGAG X CGAA IUCUGACG 

1960 | CCCAGUAC CUGAUGAG X CGAA IGUCUGAC 
UCCUUCCC CUGAUGAG X CGAA IUACUGGU 
AUAACAAC CUGAUGAG X CGAA ICUUACAA 
UAUAGCAU CUGAUGAG X CGAA IUUCUAUC 
UAUAUUAU CUGAUGAG X CGAA ICAUUGUU 

2054 UACCCACG CUGAUGAG X CGAA IUUCAUUA 

2076 UCACAUCA CUGAUGAG X CGAA IUUUCUUA 

2091 CGGGACAA CUGAUGAG X CGAA IUAAUCUC 

2097 AAUAAGCG CUGAUGAG X CGAA IACAAAGU 



AACTCTCC A CTCCATAT 
CTCTCCAC T CCATATTT 
CTCCACTC C ATATTTAT 
TCCACTCC A TATTTATT 
ATTTAAAC A ATTTTTTC 
ATTTTTTC C CCAAAGGC 
TTTTTTCC C CAAAGGCA 
TTTTTCCC C AAAGGCAT 
TTTTCCCC A AAGGCATC 
CCAAAGGC A TCCATAGT 
AAGGCATC C ATAGTGCA 
AGGCATCC A TAGTGCAC 
CATAGTGC A CTAGCATT 
TAGTGCAC T AGCATTTT 
GCACTAGC A TTTTCTTG 
GCATTTTC T TGAACCAA 
TCTTGAAC C AATAATGT 
CTTGAACC A ATAATGTA 
TTGATGTC A GCCTTGCA 
ATGTCAGC C TTGCATCA 
TGTCAGCC T TGCATCAA 
AGCCTTGC A TCAAGGGC 
CTTGCATC A AGGGCTTT 
TCAAGGGC T TTATCAAA 
GCTTTATC A AAAAGTAC 
AAAAGTAC A ATAATAAA 
AATAAATC C TCAGGTAG 
ATAAATCC T CAGGTAGT 
AAATCCTC A GGTAGTAC 
GGTAGTAC T GGGAATGG 
TGGAAGGC T TTGCCATG 
GGCTTTGC C ATGGGCCT 
GCTTTGCC A TGGGCCTG 
CCATGGGC C TGCTGCGT 
CATGGGCC T GCTGCGTC 
GGGCCTGC T GCGTCAGA 
GCTGCGTC A GACCAGTA 
CGTCAGAC C AGTACTGG 
GTCAGACC A GTACTGGG 
ACCAGTAC T GGGAAGGA 
TTGTAAGC A GTTGTTAT 
GATAGAAC A ATGCTATA 
AACAATGC T ATAATATA 
TAATGAAC A CGTGGGTA 
TAAGAAAC A TGATGTGA 
GAGATTAC T TTGTCCCG 
ACTTTGTC C CGCTTATT 
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2110 
2112 
2113 



2130 
2136 



2151 
2152 
2169 
2179 
2182 
2189 



2246 
2247 



2259 
2260 



2258 
2273 
2278 
2279 
2284 
2289 



GAAUAAGC CUGAUGAG X CGAA IGACAAAG 

GCAGAAUA CUGAUGAG X CGAA I CGGGACA 

CAGGGAGC CUGAUGAG X CGAA IAAUAAGC 

UAACAGGG CUGAUGAG X CGAA I CAGAAUA 

GAUAACAG CUGAUGAG X CGAA IAGCAGAA 

AGAUAACA CUGAUGAG X CGAA IGAGCAGA 

CAGAUAAC CUGAUGAG X CGAA IGGAGCAG 

GAUCUAGC CUGAUGAG X CGAA IAUAACAG 

CUAGAUCU CUGAUGAG X CGAA I CAGAUAA 

UGAGAACU CUGAUGAG X CGAA IAUCUAGC 

AGUGAUUG CUGAUGAG X CGAA IAACUAGA 

GCAGUGAU CUGAUGAG X CGAA IAGAACUA 

GGGAGCAG CUGAUGAG X CGAA IAUUGAGA 

GGGGGAGC CUGAUGAG X CGAA IUGAUUGA 

CACGGGGG CUGAUGAG X CGAA ICAGUGAU 

CACACGGG CUGAUGAG X CGAA IAGCAGUG 



ACACACGG CUGAUGAG : 



l IGAGCAGU 



UACACACG CUGAUGAG X CGAA IGGAGCAG 

AUACACAC CUGAUGAG X CGAA IGGGAGCA 

ACCUUACA CUGAUGAG X CGAA ICAUUCUA 

ACACAAGA CUGAUGAG X CGAA IACCUUAC 

AGGACACA CUGAUGAG X CGAA IAAGACCU 

UUUCAUCA CUGAUGAG X CGAA IACACAAG 

UUUUCAUC CUGAUGAG X CGAA IGACACAA 

UCAUUUCA CUGAUGAG X CGAA ICACAUAU 

GAGAUCAA CUGAUGAG X CGAA IUUUCUCA " 

GUAAGCAG CUGAUGAG X CGAA IAUCAAAG 

UAGUAAGC CUGAUGAG X CGAA IAGAUCAA 



X CGAA ICAGAGAU 

GGCACAUU CUGAUGAG X CGAA IUAAGCAG 

GGACAUGG CUGAUGAG X CGAA ICACAUUA 

UGGACAUG CUGAUGAG X CGAA IGCACAUU 

UUGGACAU CUGAUGAG X CGAA IGGCACAU 

CUUGGACA CUGAUGAG X CGAA IGGGCACA 

UUGGACUU CUGAUGAG X CGAA IACAUGGG 

GUUGGACU CUGAUGAG X CGAA IGACAUGG 

GGCAGGUU CUGAUGAG X CGAA IACUUGGA 

AGGCAGGU CUGAUGAG X CGAA IGACUUGG 

CACAGGCA CUGAUGAG X CGAA IUUGGACU 

GCACAGGC CUGAUGAG X CGAA IGUUGGAC 

CAUGCACA CUGAUGAG X CGAA ICAGGUUG 

UCAUGCAC CUGAUGAG X CGAA IGCAGGUU 

UCAGGUCA CUGAUGAG X CGAA ICACAGGC 

AAUGAUCA CUGAUGAG X CGAA IUCAUGCA 

UAAUGAUC CUGAUGAG X CGAA IGUCAUGC 

CCAUGUAA CUGAUGAG X CGAA IAUCAGGU 

CACAGCCA CUGAUGAG X CGAA IUAAUGAU 



CTTTGTCC C GCTTATTC 
TGTCCCGC T TATTCTGC 
GCTTATTC T GCTCCCTG 
TATTCTGC T CCCTGTTA 
TTCTGCTC C CTGTTATC 
TCTGCTCC C TGTTATCT 
CTGCTCCC T GTTATCTG 
CTGTTATC T GCTAGATC 
TTATCTGC T AGATCTAG 
GCTAGATC T AGTTCTCA 
TCTAGTTC T CAATCACT 
TAGTTCTC A ATCACTGC 
TCTCAATC A CTGCTCCC 
TCAATCAC T GCTCCCCC 
ATCACTGC T CCCCCGTG 
CACTGCTC C CCCGTGTG 
ACTGCTCC C CCGTGTGT 
CTGCTCCC C CGTGTGTA 
TGCTCCCC C GTGTGTAT 
TAGAATGC A TGTAAGGT 
GTAAGGTC T TCTTGTGT 
AGGTCTTC T TGTGTCCT 
CTTGTGTC C TGATGAAA 
TTGTGTCC T GATGAAAA 
ATATGTGC T TGAAATGA 
TGAGAAAC T TTGATCTC 
CTTTGATC T CTGCTTAC 
TTGATCTC T GCTTACTA 
ATCTCTGC T TACTAATG 
CTGCTTAC T AATGTGCC 
TAATGTGC C CCATGTCC 
AATGTGCC C CATGTCCA 
ATGTGCCC C ATGTCCAA 
TGTGCCCC A TGTCCAAG 
CCCATGTC C AAGTCCAA 
CCATGTCC A AGTCCAAC 
TCCAAGTC C AACCTGCC 
CCAAGTCC A ACCTGCCT 
AGTCCAAC C TGCCTGTG 
GTCCAACC T GCCTGTGC 
CAACCTGC C TGTGCATG 
AACCTGCC T GTGCATGA 
GCCTGTGC A TGACCTGA 
TGCATGAC C TGATCATT 
GCATGACC T GATCATTA 
ACCTGATC A TTACATGG 
ATCATTAC A TGGCTGTG 
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GGAACCAC CUGAUGAG X CGAA ICCAUGUA 



TACATGGC T GTGGTTCC 



2302 
2303 
2308 
2309 
2315 
2322 
2330 
2332 
2335 
2345 
2352 



CAGGCUUA CUGAUGAG X CGAA IAACCACA 



TGTGGTTC C TAAGCCTG 



ACAGGCUU CUGAUGAG X CGAA IGAACCAC 



GTGGTTCC T AAGCCTGT 



CAGCAACA CUGAUGAG X CGAA ICUUAGGA 



TCCTAAGC C TGTTGCTG 



UCAGCAAC CUGAUGAG X CGAA IGCUUAGG 



CCTAAGCC T GTTGCTGA 



AUGACUUC CUGAUGAG X CGAA ICAACAGG 



CCTGTTGC T GAAGTCAT 



AGCGACAA CUGAUGAG X CGAA IACUUCAG 



CTC-AAGTC A TTGTCGCT 



UAUUGCUG CUGAUGAG X CGAA ICGACAAU 



ATTGTCGC T CAGCAATA 



CCUAUUGC CUGAUGAG X CGAA IAGCGACA 



TGTCGCTC A GCAATAGG 



CACCCUAU CUGAUGAG X CGAA ICUGAGCG 



CGCTCAGC A ATAGGGTG 



UGGAAAAC CUGAUGAG X CGAA ICACCCUA 



TAGGGTGC A GTTTTCCA 



CUAUUCCU CUGAUGAG X CGAA IAAAACUG 



CAGTTTTC C AGGAATAG 



CCUAUUCC CUGAUGAG X CGAA IGAAAACU 



AGTTTTCC A GGAATAGG 



2363 
2369 



UAGGCAAA CUGAUGAG X CGAA ICCUAUUC 



GAATAGGC A TTTGCCTA 



AGGAAUUA CUGAUGAG X CGAA ICAAAUGC 



GCATTTGC C TAATTCCT 



CAGGAAUU CUGAUGAG X CGAA I GCAAAUG 



CATTTGCC T AATTCCTG 



2376 
2377 
2381 
2386 



UCAUGCCA CUGAUGAG X CGAA IAAUUAGG 



CCTAATTC C TGGCATGA 



GUCAUGCC CUGAUGAG X CGAA IGAAUUAG 



CTAATTCC T GGCATGAC 



GAGUGUCA CUGAUGAG X CGAA I CCAGGAA 



TTCCTGGC A TGACACTC 



CACUAGAG CUGAUGAG X CGAA IUCAUGCC 



GGCATGAC A CTCTAGTG 



GUCACUAG CUGAUGAG X CGAA IUGUCAUG 



CATGACAC T CTAGTGAC 



AAGUCACU CUGAUGAG X CGAA IAGUGUCA 



TGACACTC T AGTGACTT 



CACCAGGA CUGAUGAG X CGAA IUCACUAG 



CTAGTGAC T TCCTGGTG 



CCUCACCA CUGAUGAG X CGAA IAAGUCAC 



GTGACTTC C TGGTGAGG 



2401 
2410 
2411 
2412 
2415 
2416 



GCCUCACC CUGAUGAG X CGAA IGAAGUCA 



TGACTTCC T GGTGAGGC 



ACAGGCUG CUGAUGAG X CGAA ICGUCACC 



GGTGAGGC C CAGCCTGT 



GACAGGCU CUGAUGAG X CGAA IGCCUCAC 



GTGAGGCC C AGCCTGTC 



GGACAGGC CUGAUGAG X CGAA IGGCCUCA 



TGAGGCCC A GCCTGTCC 



CCAGGACA CUGAUGAG X CGAA ICUGGGCC 



GGCCCAGC C TGTCCTGG 



ACCAGGAC CUGAUGAG X CGAA IGCUGGGC 



GCCCAGCC T GTCCTGGT 



CUGUACCA CUGAUGAG X CGAA IACAGGCU 



AGCCTGTC C TGGTACAG 



GCUGUACC CUGAUGAG X CGAA I GACAGGC 



GCCTGTCC T GGTACAGC 



GACCCUGC CUGAUGAG X CGAA IUACCAGG 



CCTGGTAC A GCAGGGTC 



2430 
2436 



CAAGACCC CUGAUGAG X CGAA ICUGUACC 



GGTACAGC A GGGTCTTG 



UUACAGCA CUGAUGAG X CGAA IACCCUGC 



GCAGGGTC T TGCTGTAA 



UGAGUUAC CUGAUGAG X CGAA ICAAGACC 



GGTCTTGC T GTAACTCA 



AAUGUCUG CUGAUGAG X CGAA IUUACAGC 



GCTGTAAC T CAGACATT 



GGAAUGUC CUGAUGAG X CGAA IAGUUACA 



TGTAACTC A GACATTCC 



2452 
24 5 6 



CCUUGGAA CUGAUGAG X CGAA IUCUGAGU 



ACTCAGAC A TTCCAAGG 



AUACCCUU CUGAUGAG X CGAA IAAUGUCU 



AGACATTC C AAGGGTAT 



CAUACCCU CUGAUGAG X CGAA IGAAUGUC 



GACATTCC A AGGGTATG 



2472 
2473 
2479 



GUGAAUAU CUGAUGAG X CGAA ICUUCCCA 



TGGGAAGC C ATATTCAC 



UGUGAAUA CUGAUGAG X CGAA IGCUUCCC 



GGGAAGCC A TATTCACA 



GUGAGGUG CUGAUGAG X CGAA IAAUAUGG 



CCATATTC A CACCTCAC 



GCGUGAGG CUGAUGAG X CGAA IUGAAUAU 



ATATTCAC A CCTCACGC 



GAGCGUGA CUGAUGAG X CGAA IUGUGAAU 



ATTCACAC C TCACGCTC 



AGAGCGUG CUGAUGAG X CGAA IGUGUGAA 



TTCACACC T CACGCTCT 
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CCAGAGCG CUGAUGAG X CGAA IAGGUGUG 



CACACCTC A ■ CGCTCTGG 



2490 
2492 
2497 
2512 
2518 



AUGUCCAG CUGAUGAG X CGAA ICGUGAGG 



CCTCACGC T CTGGACAT 



UCAUGUCC CUGAUGAG X CGAA IAGCGUGA 



TCACGCTC T GGACATGA 



CUAAAUCA CUGAUGAG X CGAA IUCCAGAG 



CTCTGGAC A TGATTTAG 



GGUGUCCC CUGAUGAG X CGAA ICUUCCCU 



AGGGAAGC A GGGACACC 



GCGGGGGG CUGAUGAG X CGAA IUCCCUGC 



GCAGGGAC A CCCCCCGC 



GGGCGGGG CUGAUGAG X CGAA IUGUCCCU 



AGGGACAC C CCCCGCCC 



2521 
2522 
2523 
2524 
2527 



GGGGCGGG CUGAUGAG X CGAA IGUGUCCC 



GGGACACC C CCCGCCCC 



GGGGGCGG CUGAUGAG X CGAA IGGUGUCC 



GGACACCC C CCGCCCCC 



GGGGGGCG CUGAUGAG X CGAA IGGGUGUC 



GACACCCC C CGCCCCCC 



UGGGGGGC CUGAUGAG X CGAA IGGGGUGU 



ACACCCCC C GCCCCCCA 



AGGUGGGG CUGAUGAG X CGAA ICGGGGGG 



CCCCCCGC C CCCCACCT 



AAGGUGGG CUGAUGAG X CGAA IGCGGGGG 



CCCCCGCC C CCCACCTT 



2529 
2530 



AAAGGUGG CUGAUGAG X CGAA IGGCGGGG 



CCCCGCCC C CCACCTTT 



CAAAGGUG CUGAUGAG X CGAA IGGGCGGG 



CCCGCCCC C CACCTTTG 



CCAAAGGU CUGAUGAG X CGAA IGGGGCGG 



CCGCCCCC C ACCTTTGG 



CCCAAAGG CUGAUGAG X CGAA IGGGGGCG 



CGCCCCCC A CCTTTGGG 



AUCCCAAA CUGAUGAG X CGAA IUGGGGGG 



CCCCCCAC C TTTGGGAT 



GAUCCCAA CUGAUGAG X CGAA IGUGGGGG 



CCCCCACC T TTGGGATC 



GCGGAGGC CUGAUGAG X CGAA IAUCCCAA 



TTGGGATC A GCCTCCGC 



2547 
2548 
2550 
2553 
2554 
2558 
2559 
2567 
2569 
2571 



AUGGCGGA CUGAUGAG X CGAA ICUGAUCC 



GGATCAGC C TCCGCCAT 



AAUGGCGG CUGAUGAG X CGAA IGCUGAUC 



GATCAGCC T CCGCCATT 



GGAAUGGC CUGAUGAG X CGAA IAGGCUGA 



TCAGCCTC C GCCATTCC 



CUUGGAAU CUGAUGAG X CGAA ICGGAGGC 



GCCTCCGC C ATTCCAAG 



ACUUGGAA CUGAUGAG X CGAA IGCGGAGG 



CCTCCGCC A TTCCAAGT 



GUCGACUU CUGAUGAG X CGAA IAAUGGCG 



CGCCATTC C AAGTCGAC 



UGUCGACU CUGAUGAG X CGAA IGAAUGGC 



GCCATTCC A AGTCGACA 



AAGAAGAG CUGAUGAG X CGAA IUCGACUU 



AAGTCGAC A CTCTTCTT 



UCAAGAAG CUGAUGAG X CGAA IUGUCGAC 



GTCGACAC T CTTCTTGA 



GCUCAAGA CUGAUGAG X CGAA IAGUGUCG 



CGACACTC T TCTTGAGC 



UCUGCUCA CUGAUGAG X CGAA IAAGAGUG 



CACTCTTC T TGAGCAGA 



UCACGGUC CUGAUGAG X CGAA ICUCAAGA 



TCTTGAGC A GACCGTGA 



CAAAUCAC CUGAUGAG X CGAA IUCUGCUC 



GAGCAGAC C GTGATTTG 



2603 
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260S 
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2615 



CCAGCAGG CUGAUGAG X CGAA ICCUCUCU 



AGAGAGGC A CCTGCTGG 



UUCCAGCA CUGAUGAG X CGAA IUGCCUCU 



AGAGGCAC C TGCTGGAA 



UUUCCAGC CUGAUGAG X CGAA IGUGCCUC 



GAGGCACC T GCTGGAAA 



UGGUUUCC CUGAUGAG X CGAA ICAGGUGC 



GCACCTGC T GGAAACCA 



AGAAGUGU CUGAUGAG X CGAA IUUUCCAG 



CTGGAAAC C ACACTTCT 



AAGAAGUG CUGAUGAG X CGAA IGUUUCCA 



TGGAAACC A CACTTCTT 



2619 
2621 



UCAAGAAG CUGAUGAG X CGAA IUGGUUUC 



GAAACCAC A CTTCTTGA 



UUUCAAGA CUGAUGAG X CGAA IUGUGGUU 



AACCACAC T TCTTGAAA 



CUGUUUCA CUGAUGAG X CGAA IAAGUGUG 



CACACTTC T TGAAACAG 



ACCCAGGC CUGAUGAG X CGAA IUUUCAAG 



CTTGAAAC A GCCTGGGT 



GUCACCCA CUGAUGAG X CGAA ICUGUUUC 



GAAACAGC C TGGGTGAC 



2635 
2647 
2648 



CGUCACCC CUGAUGAG X CGAA IGCUGUUU 



AAACAGCC T GGGTGACG 



UGCCUAAA CUGAUGAG X CGAA IACCGUCA 



TGACGGTC C TTTAGGCA 



CUGCCUAA CUGAUGAG X CGAA IGACCGUC 



GACGGTCC T TTAGGCAG 
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2714 
2716 
2720 
2721 
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2788 
2796 
2805 
2810 



CGGCAGGC CUGAUGAG X CGAA ICCUAAAG 

CGGCGGCA CUGAUGAG X CGAA ICUGCCUA 

ACGGCGGC CUGAUGAG X CGAA IGCUGCCU 

GAGACGGC CUGAUGAG X CGAA ICAGGCUG 

ACAGAGAC CUGAUGAG X CGAA ICGGCAGG 

CGGGACAG CUGAUGAG X CGAA IACGGCGG 

ACCGGGAC CUGAUGAG X CGAA IAGACGGC 

GUGAACCG CUGAUGAG X CGAA IACAGAGA 

GGUGAACC CUGAUGAG X CGAA IGACAGAG 

CGGCAAGG CUGAUGAG X CGAA IAACCGGG 

CUCGGCAA CUGAUGAG X CGAA IUGAACCG 

UCUCGGCA CUGAUGAG X CGAA IGUGAACC 

CCUCUCUC CUGAUGAG X CGAA I CAAGGUG 

GGUGGGGC CUGAUGAG X CGAA IACGCGCC 

GAGGGUGG CUGAUGAG X CGAA ICAGACGC 

UGAGGGUG CUGAUGAG X CGAA IGCAGACG 



UUGAGGGU CUGAUGAG X C 



. IGGCAGAC 



UUUGAGGG CUGAUGAG X CGAA IGGGCAGA 
GGUUUGAG CUGAUGAG X CGAA IUGGGGCA 
GGGUUUGA CUGAUGAG X CGAA IGUGGGGC 
AGGGUUUG CUGAUGAG X CGAA IGGUGGGG 
ACAGGGUU CUGAUGAG X CGAA IAGGGUGG 
CCCCACAG CUGAUGAG X CGAA IUUUGAGG 
GCCCCACA CUGAUGAG X CGAA IGUUUGAG 
GGCCCCAC CUGAUGAG X CGAA IGGUUUGA 
CACCAUCA CUGAUGAG X CGAA ICCCCACA 
GCACCAUC CUGAUGAG X CGAA IGCCCCAC 
GAGUCGUG CUGAUGAG X CGAA ICACCAUC 
AAGAGUCG CUGAUGAG X CGAA IAGCACCA 
GCAGGAAG CUGAUGAG X CGAA IUCGUGAG 
UUGCAGGA CUGAUGAG X CGAA IAGUCGUG 
CCUUUGCA CUGAUGAG X CGAA IAAGAGUC 
CCCUUUGC CUGAUGAG X CGAA IGAAGAGU 
GUUCCCUU CUGAUGAG X CGAA I CAGGAAG 
AGGUCUUC CUGAUGAG X CGAA IUUCCCUU 
AAUGUGGA CUGAUGAG X CGAA IUCUUCAG 
UAAUGUGG CUGAUGAG X CGAA IGUCUUCA 
CUUAAUGU CUGAUGAG X CGAA IAGGUCUU 
ACUUAAUG CUGAUGAG X CGAA IGAGGUCU 
CCACUUAA CUGAUGAG X CGAA IUGGAGGU 
UGUUAAAA CUGAUGAG X CGAA ICCACUUA 
GUUUUUCA CUGAUGAG X CGAA IUUAAAAA 
AGCUGCCG CUGAUGAG X CGAA IUUUUUCA 
GCUACAGC CUGAUGAG X CGAA ICCGUGUU 
GGAGCUAC CUGAUGAG X CGAA ICUGCCGU 
AGCUCGGG CUGAUGAG X CGAA ICUACAGC 
GUAGCUCG CUGAUGAG X CGAA IAGCUACA 



CTTTAGGC A GCCTGCCG 
TAGGCAGC C TGCCGCCG 
AGGCAGCC T GCCGCCGT 
CAGCCTGC C GCCGTCTC 
CCTGCCGC C GTCTCTGT 
CCGCCGTC T CTGTCCCG 
GCCGTCTC T GTCCCGGT 
C CGGTTCAC 



CTCTGTCC C GGTTCACC 
CCCGGTTC A CCTTGCCG 
CGGTTCAC C TTGCCGAG 
GGTTCACC T TGCCGAGA 
CACCTTGC C GAGAGAGG 
GGCGCGTC T GCCCCACC 
GCGTCTGC C CCACCCTC 
CGTCTGCC C CACCCTCA 
GTCTGCCC C ACCCTCAA 
TCTGCCCC A CCCTCAAA 
TGCCCCAC C CTCAAACC 
GCCCCACC C TCAAACCC 
CCCCACCC T CAAACCCT 
CCACCCTC A AACCCTGT 
CCTCAAAC C CTGTGGGG 
CTCAAACC C TGTGGGGC 
TCAAACCC T GTGGGGCC 
TGTGGGGC C TGATGGTG 
GTGGGGCC T GATGGTGC 
GATGGTGC T CACGACTC 
TGGTGCTC A CGACTCTT 
CTCACGAC T CTTCCTGC 
CACGACTC T TCCTGCAA 
GACTCTTC C TGCAAAGG 
ACTCTTCC T GCAAAGGG 
CTTCCTGC A AAGGGAAC 
AAGGGAAC T GAAGACCT 
CTGAAGAC C TCCACATT 
TGAAGACC T CCACATTA 
AAGACCTC C ACATTAAG 
AGACCTCC A CATTAAGT 
ACCTCCAC A TTAAGTGG 
TAAGTGGC T TTTTAACA 
TTTTTAAC A TGAAAAAC 
TGAAAAAC A CGGCAGCT 
AACACGGC A GCTGTAGC 
ACGGCAGC T GTAGCTCC 
GCTGTAGC T CCCGAGCT 
TGTAGCTC C CGAGCTAC 
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2830 
2832 
2834 



2848 
2850 
2857 
2858 
2862 
2875 
2876 
2881 
2891 
2903 
2915 
2917 
2918 
2919 
2922 
2927 
2949 
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2958 
2962 
2969 
2973 
2978 
2982 
2998 
2999 
3040 

ToTiT 



3068 
3070 
3071 
3075 
3076 
3085 
3088 
3092 
3094 



AGUAGCUC CUGAUGAG X CGAA IGAGCUAC 

AAGAGAGU CUGAUGAG X CGAA ICUCGGGA 

GGCAAGAG CUGAUGAG X CGAA IUAGCUCG 

CUGGCAAG CUGAUGAG X CGAA IAGUAGCU 

UGCUGGCA CUGAUGAG X CGAA IAGAGUAG 

AAAAUGCU CUGAUGAG X CGAA ICAAGAGA 

GAAAAUGC CUGAUGAG X CGAA I GCAAGAG 

UGUGAAAA CUGAUGAG X CGAA ICUGGCAA 

GCAAAAUG CUGAUGAG X CGAA IAAAAUGC 

AGGCAAAA CUGAUGAG X CGAA IUGAAAAU 

ACGAGAAA CUGAUGAG X CGAA ICAAAAUG 

CACGAGAA CUGAUGAG X CGAA IGCAAAAU 

CUACCACG CUGAUGAG X CGAA IAAAGGCA 

UCUGUACU CUGAUGAG X CGAA ICUUCUAC 

CUCUGUAC CUGAUGAG X CGAA IGCUUCUA 

AAUUUCUC CUGAUGAG X CGAA IUACUGGC 

CCCACCAC CUGAUGAG X CGAA IAAUUUCU 

ACCUCGAA CUGAUGAG X CGAA IUUCCCAC 

CUGCAGGG CUGAUGAG X CGAA IACACCUC 

CUCUGCAG CUGAUGAG X CGAA IUGACACC 

GCUCUGCA CUGAUGAG X CGAA IGUGACAC 

AGCUCUGC CUGAUGAG X CGAA IGGUGACA 

CAUAGCUC CUGAUGAG X CGAA I CAGGGUG 

CUCACCAU CUGAUGAG X CGAA ICUCUGCA 

GGCACCUA CUGAUGAG X CGAA ICCUUAUC 

UACAGCCU CUGAUGAG X CGAA ICACCUAA 

UUACAGCC CUGAUGAG X CGAA IGCACCUA 

AUGCUUAC CUGAUGAG X CGAA ICCUGGCA 

GCUCAGAA CUGAUGAG X CGAA ICUUACAG 

GCCAGCUC CUGAUGAG X CGAA IAAUGCUU 



AACAAGCC CUGAUGAG X CGAA 
AAACAACA CUGAUGAG X CGAA ICCAGCUC 



CAUAUACA CUGAUGAG X ( 



, IACUUAAA 



ACAUAUAC CUGAUGAG X CGAA IGACUUAA 

UUUUGAAA CUGAUGAG X CGAA ICUACUAU 

GUCCAUUU CUGAUGAG X CGAA IAAAUGCU 

GUUAAACC CUGAUGAG X CGAA IUACGUCC 

GGAUAGGA CUGAUGAG X CGAA IUUAAACC 

AGGAUAGG CUGAUGAG X CGAA IGUUAAAC 

CAAGGAUA CUGAUGAG X CGAA IAGGUUAA 

CCAAGGAU CUGAUGAG X CGAA IGAGGUUA 

CUCUCCAA CUGAUGAG X CGAA IAUAGGAG 

GCUCUCCA CUGAUGAG X CGAA IGAUAGGA 

GAGCCAGC CUGAUGAG X CGAA ICUCUCCA 

GGAGAGCC CUGAUGAG X CGAA ICUGCUCU 

AGGUGGAG CUGAUGAG X CGAA ICCAGCUG 

CAAGGUGG CUGAUGAG X CGAA IAGCCAGC 



GTAGCTCC C GAGCTACT 
TCCCGAGC T ACTCTCTT 
CGAGCTAC T CTCTTGCC 
AGCTACTC T CTTGCCAG 
CTACTCTC T TGCCAGCA 
TCTCTTGC C AGCATTTT 
CTCTTGCC A GCATTTTC 
TTGCCAGC A TTTTCACA 
GCATTTTC A CATTTTGC 
ATTTTCAC A TTTTGCCT 
CATTTTGC C TTTCTCGT 
ATTTTGCC T TTCTCGTG 
TGCCTTTC T CGTGGTAG 
GTAGAAGC C AGTACAGA 
TAGAAGCC A GTACAGAG 
GCCAGTAC A GAGAAATT 
AGAAATTC T GTGGTGGG 
GTGGGAAC A TTCGAGGT 
GAGGTGTC A CCCTGCAG 
GGTGTCAC C CTGCAGAG 
GTGTCACC C TGCAGAGC 
TGTCACCC T GCAGAGCT 
CACCCTGC A GAGCTATG 
TGCAGAGC T ATGGTGAG 
GATAAGGC T TAGGTGCC 
TTAGGTGC C AGGCTGTA 
TAGGTGCC A GGCTGTAA 
TGCCAGGC T GTAAGCAT 
CTGTAAGC A TTCTGAGC 
AAGCATTC T GAGCTGGC 
TTCTGAGC T GGCTTGTT 
GAGCTGGC T TGTTGTTT 
TTTAAGTC C TGTATATG 
TTAAGTCC T GTATATGT 
ATAGTAGC A TTTCAAAA 
AGCATTTC A AAATGGAC 
GGACGTAC T GGTTTAAC 
GGTTTAAC C TCCTATCC 
GTTTAACC T CCTATCCT 
TTAACCTC C TATCCTTG 
TAACCTCC T ATCCTTGG 
CTCCTATC C TTGGAGAG 
TCCTATCC T TGGAGAGC 
TGGAGAGC A GCTGGCTC 
AGAGCAGC T GGCTCTCC 
CAGCTGGC T CTCCACCT 
GCTGGCTC T CCACCTTG 
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3096 AACAAGGU CUGAUGAG X CGAA IAGAGCCA 

3097 UAACAAGG CUGAUGAG X CGAA I GAGAGCC 

3099 UGUAACAA CUGAUGAG X CGAA IUGGAGAG 

3100 GUGUAACA CUGAUGAG X CGAA I GUGGAGA 
3107 ACAUAAUG CUGAUGAG X CGAA IUAACAAG ' 
3109 UAACAUAA CUGAUGAG X CGAA IUGUAACA 
3132 AGCAGAGC CUGAUGAG X CGAA ICUCGCUA . 
3135 CAUAGCAG CUGAUGAG X CGAA ICAGCUCG 
3137 GACAUAGC CUGAUGAG X CGAA IAGCAGCU 
3140 AAGGACAU CUGAUGAG X CGAA ICAGAGCA 

3146 UGGCUUAA CUGAUGAG X CGAA IACAUAGC 

3147 UUGGCUUA CUGAUGAG X CGAA IGACAUAG 

3153 UAAAUAUU CUGAUGAG X CGAA ICUUAAGG 

3154 GUAAAUAU CUGAUGAG X CGAA IGCUUAAG 
3163 ACCUGAUG CUGAUGAG X CGAA IUAAAUAU 
316 5 UGACCUGA CUGAUGAG X CGAA IAGUAAAU 
3168 UAAUGACC CUGAUGAG X CGAA IAUGAGUA 
3173 AAAAAUAA CUGAUGAG X CGAA IACCUGAU 

3155 AUGGCCAU CUGAUGAG X CGAA IUAAAAAA 

3191 UAUUCCAU CUGAUGAG X CGAA ICCAUUGU 

3192 UUAUUCCA CUGAUGAG X CGAA IGCCAUUG 

3203 GUAAAAAU CUGAUGAG X CGAA IUUUAUUC 

3204 UGUAAAAA CUGAUGAG X CGAA IGUUUAUU 



TGGCTCTC C ACCTTGTT 
GGCTCTCC A CCTTGTTA 
CTCTCCAC C TTGTTACA 
TCTCCACC T TGTTACAC 
CTTGTTAC A CATTATGT 
TGTTACAC A TTATGTTA 
TAGCGAGC T GCTCTGCT 
CGAGCTGC T CTGCTATG 
AGCTGCTC T GCTATGTC 
TGCTCTGC T ATGTCCTT 
GCTATGTC C TTAAGCCA 
CTATGTCC T TAAGCCAA 
CCTTAAGC C AATATTTA 
CTTAAGCC A' ATATTTAC 
ATATTTAC T CATCAGGT 
ATTTACTC A- TCAGGTCA 
TACTCATC A GGTCATTA 
ATCAGGTC A TTATTTTT 
TTTTTTAC A ATGGCCAT 
ACAATGGC C ATGGAATA 
CAATGGCC A TGGAATAA 
GAATAAAC C ATTTTTAC 
AATAAACC A TTTTTACA 
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CCUUACAA 1 
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TTGTAAGG GGCTAGCTACAACGA TTTCCAAT 1 


TTCTTGGG GGCTAGCTACAACGA TGTAAGGT 


CTCTCGAG GGCTAGCTACAACGA TTCTTGGG 


ATGTAAGA GGCTAGCTACAACGA CTCTCGAG 


TGGAAATG GGCTAGCTACAACGA AAGATCTC 


AGTGGAAA GGCTAGCTACAACGA GTAAGATC 1 


TGGTATAG GGCTAGCTACAACGA GGAAATGT 1 


ATGTGGTA GGCTAGCTACAACGA AGTGGAAA 1 


CCATGTGG GGCTAGCTACAACGA ATAGTGGA 1 


AGGCCATG GGCTAGCTACAACGA GGTATAGT 1 


TCAGGCCA GGCTAGCTACAACGA GTGGTATA 1 


AAGTCAGG GGCTAGCTACAACGA CATGTGGT 1 


CTCCAAAG GGCTAGCTACAACGA CAGGCCAT j 


GCTGGTGA GGCTAGCTACAACGA TCAGGGAC 


GAGGCTGG GGCTAGCTACAACGA GATTCAGG 


GAATGAGG GGCTAGCTACAACGA TGGTGATT 
TTCAAGAA GGCTAGCTACAACGA GAGGCTGG 


CTCTCGGA GGCTAGCTACAACGA TTTGAAAA 
GACCCTGA GGCTAGCTACAACGA TCTCGGAC 


CTGAGTGA GGCTAGCTACAACGA CCTGACTC 


GGGCTGAG GGCTAGCTACAACGA GACCCTGA 


GCTCCGGG GGCTAGCTACAACGA TGAGTGAC 


GGCCCGTG GGCTAGCTACAACGA TCCGGGCT 


CGGGCCCG GGCTAGCTACAACGA GCTCCGGG 


ACAACGGG GGCTAGCTACAACGA CCGTGCTC 


CACCACAA GGCTAGCTACAACGA GGGCCCGT 


GTGCACCA GGCTAGCTACAACGA AACGGGCC 


| GCAGTGCA GGCTAGCTACAACGA CACAACGG 
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Table 9 



Table 9: Human methionine aminopeptidase type 2 (Met AP-2) Hammerhead 
Ribozyme and Target Sequence 



Nt. 
position 


Ribozyme Sequence 


Seq ID 
nos. 


Substrate Sequence 


Scq ID 

nos. 


9 


CCGAGAGA CUGAUGAG X CGAA ACGAGGGA 


1 


TCCCTCGT C TCTCTCGG 


413 


11 


GCCCGAGA CUGAUGAG X CGAA AGACGAGG 


2 


CCTCGTCT C TCTCGGGC 


414 


13 


UUGCCCGA CUGAUGAG X CGAA AGAGACGA 


3 


TCGTCTCT C TCGGGCAA 


415 


15 


UGUUGCCC CUGAUGAG X CGAA AGAGAGAC 


4 


GTCTCTCT C GGGCAACA 


416 


43 


GAGGCCGC CUGAUGAG X CGAA ACCUCCUC 


5 


GAGGAGGT A GCGGCCTC 


417 


51 


GGCUCCCG CUGAUGAG X CGAA AGGCCGCU 


6 


AGCGGCCT C CGGGAGCC 


418 


80 


GUCGUCUG CUGAUGAG X CGAA AUCCAGGU 


7 


ACCTGGAT C CAGACGAC 


419 


108 


CAGCCGUA CUGAUGAG X CGAA AGGCAGCU 


8 


AGCTGCCT C TACGGCTG 


420 


110 


CUCAGCCG CUGAUGAG X CGAA AGAGGCAG 


9 


CTGCCTCT A CGGCTGAG 


421 


167 


UGCUGCAG CUGAUGAG X CGAA AGGCCCUU 


10 


AAGGGCCT T CTGCAGCA 


422 


168 


CUGCUGCA CUGAUGAG X CGAA AAGGCCCU 


11 


AGGGCCTT C TGCAGCAG 


423 


194 


UGAUUCUU CUGAUGAG X CGAA AUCAGGUU 


12 


AACCTGAT A AAGAATCA 


424 


201 


AGGCUCCU CUGAUGAG X CGAA AUUCUUUA 


13 


TAAAGAAT C AGGAGCCT 


425 


210 


CAUCCACU CUGAUGAG X CGAA AGGCUCCU 


14 


AGGAGCCT C AGTGGATG 


426 


223 


UGUCUUGC CUGAUGAG X CGAA ACUUCAUC 


15 


GATGAAGT A GCAAGACA 


427 


234 1 AUCUUUCC CUGAUGAG X CGAA ACUGUCUU 


16 


AAGACAGT T GGAAAGAT 


428 


243 


CCAAUGCU CUGAUGAG X CGAA AUCUUUCC 


17 


GGAAAGAT C AGCATTGG 


429 


249 


UAUCUUCC CUGAUGAG X CGAA AUGCUGAU 


18 


ATCAGCAT T GGAAGATA 


430 


257 


UCUUUCUU CUGAUGAG X CGAA AUCUUCCA 


19 


TGGAAGAT A AAGAAAGA 


431 


355 


UCUGUUUG CUGAUGAG X CGAA ACUUUUGG 


20 


CCAAAAGT T CAAACAGA 


432 


356 


GUCUGUUU CUGAUGAG X CGAA AACUUUUG 


21 


CAAAAGTT C AAACAGAC 


433 


368 


AACUGAGG CUGAUGAG X CGAA AGGGUCUG 


22 


CAGACCCT C CCTCAGTT 


434 


372 


UUGGAACU CUGAUGAG X CGAA AGGGAGGG 


23 


CCCTCCCT C AGTTCCAA 


435 


376 


CAUAUUGG CUGAUGAG X CGAA ACUGAGGG 


24 


CCCTCAGT T CCAATATG 


436 


377 


ACAUAUUG CUGAUGAG X CGAA AACUGAGG 


25 


CCTCAGTT C CAATATGT 


437 


382 


AGGUCACA CUGAUGAG X CGAA AUUGGAAC 


26 


GTTCCAAT A TGTGACCT 


438 


393 


CAUUAGGA CUGAUGAG X CGAA ACAGGUCA. 


27 


TGACCTGT A TCCTAATG 


439 


395 


ACCAUUAG CUGAUGAG X CGAA AUACAGGU 


28 


ACCTGTAT C CTAATGGT 


440 


398 


UACACCAU CUGAUGAG X CGAA AGGAUACA 


29 


TGTATCCT A ATGGTGTA 


441 


406 


UUGGGAAA CUGAUGAG X CGAA ACACCAUU 








408 


CUUUGGGA CUGAUGAG X CGAA AUACACCA 


31 


TGGTGTAT T TCCCAAAG 


443 


409 


CCUUUGGG CUGAUGAG X CGAA AAUACACC 


32 


GGTGTATT T CCCAAAGG 


444 


410 


UCCUUUGG CUGAUGAG X CGAA AAAUACAC 


33 


GTGTATTT C CCAAAGGA 


445 


432 


UGGGUGGG CUGAUGAG X CGAA AUUCGCAU 


34 


ATGCGAAT A CCCACCCA 


446 


464 


AGUUCUCC CUGAUGAG X CGAA AGCAGCUG 


35 


CAGCTGCT T GGAGAACT 


447 


473 


UUCACUUG CUGAUGAG X CGAA AGUUCUCC 


36 


GGAGAACT A CAAGTGAA 


448 


495 


CCUGAUCU CUGAUGAG X CGAA AUGCUUUC 


37 


GAAAGCAT T AGATCAGG 


449 


496 


GCCUGAUC CUGAUGAG X CGAA AAUGCUUU 


38 


AAAGCATT A GATCAGGC 


450 


500 


ACUUGCCU CUGAUGAG X CGAA AUCUAAUG 


39 


CATTAGAT C AGGCAAGT 


451 


517 


UCAUUCCA CUGAUGAG X CGAA AUCUCUUC 


40 


GAAGAGAT T TGGAATGA 


452 


518 


AUCAUUCC CUGAUGAG X CGAA AAUCUCUU 


41 


AAGAGATT T GGAATGAT 


453 


527 


UUCUCGAA CUGAUGAG X CGAA AUCAUUCC 


42 


GGAATGAT T TTCGAGAA 


454 
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CUUCUCGA CUGAUGAG X CGAA AAUCAUUC 



GAATGATT T TCGAGAAG 



GCUUCUCG CUGAUGAG X CGAA AAAUCAUU 



AATGATTT T CGAGAAGC 



AGCUUCUC CUGAUGAG X CGAA AAAAUCAU 



ATGATTTT C GAGAAGCT 



AACUUGUC CUGAUGAG X CGAA AUGUGCUU 



AAGCACAT C GACAAGTT 



UAUUUUCU CUGAUGAG X CGAA ACUUGUCG 



CGACAAGT T AGAAAATA 



GUAUUUUC CUGAUGAG X CGAA AACUUGUC 



GACAAGTT A GAAAATAC 



UCAUUACG CUGAUGAG X CGAA AUUUUCUA 



TAGAAAAT A CGTAATGA 



CAGCUCAU CUGAUGAG X CGAA ACGUAUUU 



AAATACGT A ATGAGCTG 



CCAGGCUU CUGAUGAG X CGAA AUCCAGCU 



AGCTGGAT C AAGCCTGG 



CAGAUUUC CUGAUGAG X CGAA AUCAUUGU 



ACAATGAT A GAAATCTG 



UUUUCACA CUGAUGAG X CGAA AUUUCUAU 



ATAGAAAT C TGTGAAAA 



AGUCUUCC CUGAUGAG X CGAA ACUUUUCA 



TGAAAAGT T GGAAGACT 



CUUGCGUG CUGAUGAG X CGAA ACAGUCUU 



AAGACTGT T CACGCAAG 



ACUUGCGU CUGAUGAG X CGAA AACAGUCU 



AGACTGTT C ACGCAAGT 



CUUUUAUU CUGAUGAG X CGAA ACUUGCGU 



ACGCAAGT T AATAAAAG 



UCUUUUAU CUGAUGAG X CGAA AACUUGCG 



CGCAAGTT A ATAAAAGA 



UUCUCUUU CUGAUGAG X CGAA AUUAACUU 



AAGTTAAT A AAAGAGAA 



679 
680 



CUGCAUUU CUGAUGAG X CGAA AUCCAUUC 



GAATGGAT T AAATGCAG 



CCUGCAUU CUGAUGAG X CGAA AAUCCAUU 



AATGGATT A AATGCAGG 



CAGUAGGA CUGAUGAG X CGAA AUGCCAGG 



CCTGGCAT T TCCTACTG 



CCAGUAGG CUGAUGAG X CGAA AAUGCCAG 



CTGGCATT T CCTACTGG 



UCCAGUAG CUGAUGAG X CGAA AAAUGCCA 



TGGCATTT C CTACTGGA 



ACAUCCAG CUGAUGAG X CGAA AGGAAAUG 



CATTTCCT A CTGGATGT 



AUUGAGAG CUGAUGAG X CGAA ACAUCCAG 



CTGGATGT T CTCTCAAT 



UAUUGAGA CUGAUGAG X CGAA AACAUCCA 



TGGATGTT C TCTCAATA ' 



AUUAUUGA CUGAUGAG X CGAA AGAACAUC 



GATGTTCT C TCAATAAT 



CAAUUAUU CUGAUGAG X CGAA AGAGAACA 



TGTTCTCT C AATAATTG 



AGCACAAU CUGAUGAG X CGAA AUUGAGAG 



CTCTCAAT A ATTGTGCT 



GGCAGCAC CUGAUGAG X CGAA AUUAUUGA 



TCAATAAT T GTGCTGCC 



GGGAGUAU CUGAUGAG X CGAA AUGGGCAG 



CTGCCCAT T ATACTCCC 



UGGGAGUA CUGAUGAG X CGAA AAUGGGCA 



TGCCCATT A TACTCCCA 



AUUGGGAG CUGAUGAG X CGAA AUAAUGGG 



CCCATTAT A CTCCCAAT 



GGCAUUGG CUGAUGAG X CGAA AGUAUAAU 



ATTATACT C CCAATGCC 



UACUGUAA CUGAUGAG X CGAA ACUGUUGU 



ACAACAGT A TTACAGTA 



CAUACUGU CUGAUGAG X CGAA AUACUGUU 



AACAGTAT T ACAGTATG 



UCAUACUG CUGAUGAG X CGAA AAUACUGU 



ACAGTATT A CAGTATGA 



UGUCAUCA CUGAUGAG X CGAA ACUGUAAU 



ATTACAGT A TGATGACA 



757 
772 



k CUGAUGAG X CGAA AUGUCAUC 
I CUGAUGAG X CGAA ACAGAUGU 



GATGACAT C TGTAAAAT 



ACATCTGT A AAATAGAC 



CCAAAGUC CUGAUGAG X CGAA AUUUUACA 



TGTAAAAT A GACTTTGG 



GUGUUCCA CUGAUGAG X CGAA AGUCUAUU 



AATAGACT T TGGAACAC 



UGUGUUCC CUGAUGAG X CGAA AAGUCUAU 



ATAGACTT T GGAACACA 



ACCACUUA CUGAUGAG X CGAA AUGUGUUC 



GAACACAT A TAAGTGGT 



CUACCACU CUGAUGAG X CGAA AUAUGUGU 



ACACATAT A AGTGGTAG 



AAUAAUCC CUGAUGAG X CGAA ACCACUUA 



TAAGTGGT A GGATTATT 



CAGUCAAU CUGAUGAG X CGAA AUCCUACC 



GGTAGGAT T ATTGACTG 



ACAGUCAA CUGAUGAG X CGAA AAUCCUAC 



GTAGGATT A TTGACTGT 
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GCACAGUC CUGAUGAG X CGAA AUAAUCCU 
GACAGUAA CUGAUGAG X CGAA AGCACAGU 



UGACAGUA CUGAUGAG X CGAA AAGCACAG 



AGGATTAT T GACTGTGC 
ACTGTGCT T TTACTGTC 



CTGTGCTT T TACTGTCA 



GUGACAGU CUGAUGAG X CGAA AAAGCACA 



GTGCTTT T ACTGTCAC 



AGUGACAG CUGAUGAG X CGAA AAAAGCAC 



GTGCTTTT A CTGTCACT 



UUAAAAGU CUGAUGAG X CGAA ACAGUAAA 



"TTACTGT C ACTTTTAA 



GGGAUUAA CUGAUGAG X CGAA AGUGACAG 



CTGTCACT T TTAATCCC 



UGGGAUUA CUGAUGAG X CGAA • AAGUGACA 



TGTCACTT T TAATCCCA 



UUGGGAUU CUGAUGAG X CGAA AAAGUGAC 



GTCACTTT T AATCCCAA 



1 CUGAUGAG X CGAA AAAAGUGA 



: A ATCCCAAA 



AUAUUUGG CUGAUGAG X CGAA AUUAAAAG 



CTTTTAAT C CCAAATAT 



ACGUAUCA CUGAUGAG X CGAA AUUUGGGA 



TCCCAAAT A TGATACGT 



UAAUAACG CUGAUGAG X CGAA AUCAUAUU 



AATATGAT A CGTTATTA 



CUUUUAAU CUGAUGAG X CGAA ACGUAUCA 



TGATACGT T ATTAAAAG 



GCUUUUAA CUGAUGAG X CGAA AACGUAUC 



GATACGTT A TTAAAAGC 



CAGCUUUU CUGAUGAG X CGAA AUAACGUA 



TACGTTAT T AAAAGCTG 



ACAGCUUU CUGAUGAG X CGAA AAUAACGU 



ACGTTATT A AAAGCTGT 



GCAUCUUU CUGAUGAG X CGAA ACAGCUUU 



AAAGCTGT A AAAGATGC 



AGUGUUAG CUGAUGAG X CGAA AGCAUCUU 



AAGATGCT A CTAACACT 



UCCAGUGU CUGAUGAG X CGAA AGUAGCAU 



ATGCTACT A ACACTGGA 



GCACACUU CUGAUGAG X CGAA AUUCCAGU 



ACTGGAAT A AAGTGTGC 



CGAACAUC CUGAUGAG X CGAA AUUCCAGC 
CACAGACG CUC 



GCTGGAAT T GATGTTCG 



C CGAA ACAUCAAU 



ATTGATGT T CGTCTGTG 



ACACAGAC CUGAUGAG X CGAA AACAUCAA 



TTGATGTT C GTCTGTGT 



AUCACACA CUGAUGAG X CGAA ACGAACAU 



ATGTTCGT C TGTGTGAT 



GCCUCACC CUGAUGAG X CGAA ACAUCACA 
ACUUCUUG CUGAUGAG X CGAA AUGGCCUC 



TGTGATGT T GGTGAGGC 



GAGGCCAT C CAAGAAGT 



GACUCCAU CUGAUGAG X CGAA ACUUCUUG 



CAAGAAGT T ATGGAGTC 



GGACUCCA CUGAUGAG X CGAA AACUUCUU 



AAGAAGTT A TGGAGTCC 



CUUCAUAG CUGAUGAG X CGAA ACUCCAUA 



CAACUUCA CUGAUGAG X CGAA AGGACUCC 
UCUAUUUC CUGAUGAG X CGAA ACUUCAUA 
UUCCCAUC CUGAUGAG X CGAA AUUUCAAC 
UCACUUGA CUGAUGAG X CGAA AUGUCUUC 
UUUCACUU CUGAUGAG X CGAA AUAUGUCU 
AGAUUACG CUGAUGAG X CGAA AUUGGUUU 
AUUUAGAU CUGAUGAG X CGAA ACGGAUUG 
UCCAUUUA CUGAUGAG X CGAA AUUACGGA 
UGUCCAUU CUGAUGAG X CGAA AGAUUACG 
CCCAAUUG CUGAUGAG X CGAA AUGUCCAU 
GCCCAAUU CUGAUGAG X CGAA AAUGUCCA 
UAUUGCCC CUGAUGAG X CGAA AUUGAAUG 
GUAUUCUA CUGAUGAG X CGAA AUUGCCCA 
AUGUAUUC CUGAUGAG X CGAA AUAUUGCC 
CCAGCAUG CUGAUGAG X CGAA AUUCUAUA 
CCUUUCAC CUGAUGAG X CGAA AUCGGCAC 
AUUGCAUA CUGAUGAG X CGAA ACUUCUCC 



TATGGAGT C CTATGAAG 



T A TGAAGTTG 
TATGAAGT T GAAATAGA 
GTTGAAAT A GATGGGAA 
GAAGACAT A TCAAGTGA 
AGACATAT C AAGTGAAA 
AAACCAAT C CGTAATCT 
CAATCCGT A ATCTAAAT 
TCCGTAAT C TAAATGGA 
CGTAATCT A AATGGACA 
ATGGACAT T CAATTGGG 
TGGACATT C AATTGGGC 
CATTCAAT T GGGCAATA 
TGGGCAAT A TAGAATAC 
GGCAATAT A GAATACAT 
TATAGAAT A CATGCTGG 
GTGCCGAT T GTGAAAGG 
GGAGAAGT A TATGCAAT 



529 
530 
531 



539 
540 



547 

548 
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CAAUUGCA CUGAUGAG X CGAA AUACUUCU 
AAGGUUUC CUGAUGAG X CGAA AUUGCAUA 



AGAAGTAT A TGCAATTG 
TATGCAAT T GAAACCTT 



UACUACCA CUGAUGAG X CGAA AGGUUUCA 



TGAAACCT T TGGTAGTA 



GUACUACC CUGAUGAG X CGAA AAGGUUUC 



GAAACCTT T GGTAGTAC 



UCCUGUAC CUGAUGAG X CGAA ACCAAAGG 



CCTTTGGT A GTACAGGA 



UUUUCCUG CUGAUGAG X CGAA ACUACCAA 



TTGGTAGT A CAGGAAAA 



UCAUGAAC CUGAUGAG X CGAA ACACCUUU 



AAAGGTGT T GTTCATGA 



UCAUCAUG CUGAUGAG X CGAA ACAACACC 



GGTGTTGT T CATGATGA 



AUCAUCAU CUGAUGAG X CGAA AACAACAC 



GTGTTGTT C ATGATGAT 



ACAUUCCA CUGAUGAG X CGAA AUCAUCAU 



ATGATGAT A TGGAATGT 



GUAAUGUG CUGAUGAG X CGAA ACAUUCCA 



TGGAATGT T CACATTAC 



UGUAAUGU CUGAUGAG X CGAA AACAUUCC 



GGAATGTT C ACATTACA 



1325 
1335 



UUUCAUGU CUGAUGAG X CGAA AUGUGAAC 



GTTCACAT T ACATGAAA 



UUUUCAUG CUGAUGAG X CGAA AAUGUGAA 
AACAUCAA CUGAUGAG X CGAA AUUUUUCA 
CAACAUCA CUGAUGAG X CG AA AAUUUUUC 
CCAACAUC CUGAUGAG X CGAA AAAUUUUU 
ACAUGUCC CUGAUGAG X CGAA ACAUCAAA 
GGAAGCCU CUGAUGAG X CGAA AUUGGCAC 
GUUCUUGG CUGAUGAG X CGAA AGCCUUAU 
UGUUCUUG CUGAUGAG X CGAA AAGCCUUA 
CAUUUAAC CUGAUGAG X CGAA AGUGUUUU 



UGACAUUU CUGAUGAG X CGAA ACAAGUGU 
AUGACAUU ( 



J CUGAUGAG X CGAA AACAAGUG 



UCAUUGAU CUGAUGAG X CGAA ACAUUUAA 
UUUUCAUU CUGAUGAG X CGAA AUGACAUU 
GGGUUCCA CUGAUGAG X CGAA AGUUUUCA 
AGGGUUCC CUGAUGAG ) 



CAGAAGGC CUGAUGAG 



X CGAA / 
X CGAA / 



^ CUGAUGAG X CGAA AAGGCAAG 



UGCGGCAG CUGAUGAG X CGAA AGGCAAGG 
CUGCGGCA C 
UCCCAAGC CUGAUGAG 
UUUCUCCC CUGAUGAG 



X CGAA 
X CGAA 

CAAGUAUU CUGAUGAG X CGAA 



AGCGAUCC 
ACUUUCUC 

CCAUCAAG CUGAUGAG X CGAA AUUUACUU 
GAGCCAUC CUGAUGAG X CGAA AGUAUUUA 
AUUCUUCA CUGAUGAG X CGAA AGCCAUCA 
GUCACACA CUGAUGAG X CGAA AUUCUUCA 
CAAUGCCC CUGAUGAG X CG AA AGUCACAC 
GGAUCUAC CUGAUGAG X CGAA AUGCCCAA 
UAUGGAUC CUGAUGAG X CGAA ACAAUGCC 
AUCUACAA 
AUGGAUCU 



UGGAUAUG CUGAUGAG X CGAA 
AUGGUGGA CUGAUGAG X CGAA 



UAAUGGUG CUGAUGAG X CGAA AUAUGGAU 
UGUCACAU CUGAUGAG X CG AA AUGGUGGA 
AUGUCACA CUGAUGAG X CGAA AAUGGUGG 
GAUCCUUU CUGAUGAG X CGAA AUGUCACA 



TTCACATT A CATGAAAA 
TGAAAAAT T TTGATGTT 
GAAAAATT T TGATGTTG 
AAAAATTT T GATGTTGG 
TTTGATGT T GGACATGT 
GTGCCAAT A AGGCTTCC 
ATAAGGCT T CCAAGAAC 
TAAGGCTT C CAAGAACA 
AAAACACT T GTTAAATG 
ACACTTGT T AAATGTCA 
CACTTGTT A AATGTCAT 
TTAAATGT C ATCAATGA 
AATGTCAT C AATGAAAA 
TGAAAACT T TGGAACCC 
GAAAACTT T GGAACCCT 
GGAACCCT T GCCTTCTG 
CCTTGCCT T CTGCCGCA 
CTTGCCTT C TGCCGCAG 
GGCTGGAT C GCTTGGGA 
GGATCGCT T GGGAGAAA 
GAGAAAGT A AATACTTG 
AAGTAAAT A CTTGATGG 
TAAATACT T GATGGCTC 
TGATGGCT C TGAAGAAT 
TGAAGAAT C TGTGTGAC 
GTGTGACT T GGGCATTG 
TTGGGCAT T GTAGATCC 
GGCATTGT A GATCCATA 
TTGTAGAT C CATATCCA 
AGATCCAT A TCCACCAT 
ATCCATAT C CACCATTA 
TCCACCAT T ATGTGACA 
CCACCATT A TGTGACAT 
TGTGACAT T AAAGGATC 



564 
565 



569 
570 



582 
5B3 



589 

590 
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UGAUCCUU CUGAUGAG X CGAA AAUGUCAC 



GTGACATT A AAGGATCA 



CUGUAUAU CUGAUGAG X CGAA AUCCUUUA 



TAAAGGAT C ATATACAG 



GCGCUGUA CUGAUGAG X CGAA AUGAUCCU 



AGGATCAT A TACAGCGC 



UUGCGCUG CUGAUGAG X CGAA AUAUGAUC 



GATCATAT A CAGCGCAA 



UAUGUUCA CUGAUGAG X CGAA AUUGCGCU 



AGCGCAAT T TGAACATA 



GUAUGUUC CUGAUGAG X CGAA AAUUGCGC 



GCGCAATT T GAACATAC 



CAGGAUGG CUGAUGAG X CGAA AUGUUCAA 



TTGAACAT A CCATCCTG 



CGCAACAG CUGAUGAG X CGAA AUGGUAUG 



CATACCAT C CTGTTGCG 



UUGGACGC CUGAUGAG X CGAA ACAGGAUG 



CATCCTGT T GCGTCCAA 



CGAA ACGCAACA 



TGTTGCGT C CAACATGT 



AACUUCUU CUGAUGAG X CGAA ACAUGUUG 



CAACATGT A AAGAAGTT 



CUGCUGAC CUGAUGAG X CGAA ACUUCUUU 



AAAGAAGT T GTCAGCAG 



CCUCUGCU CUGAUGAG X CGAA ACAACUUC 



GAAGTTGT C AGCAGAGG 



AAGUUUAA CUGAUGAG X CGAA AGUCAUCU 



AGATGACT A TTAAACTT 



CUAAGUUU CUGAUGAG X CGAA AUAGUCAU 



ATGACTAT T AAACTTAG 



ACUAAGUU CUGAUGAG X CGAA AAUAGUCA 



TGACTATT A AACTTAGT 



UUUGGACU CUGAUGAG X CGAA AGUUUAAU 



ATTAAACT T AGTCCAAA 



CUUUGGAC CUGAUGAG X CGAA AAGUUUAA 



TTAAACTT A GTCCAAAG 



UGGCUUUG CUGAUGAG X CGAA ACUAAGUU 



AACTTAGT C CAAAGCCA 



AAGGUGUU CUGAUGAG X CGAA AGGUGGCU 



AGCCACCT C AACACCTT 



AGAAAAUA CUGAUGAG X ( 



CAACACCT T TATTTTCT 



CAGAAAAU CUGAUGAG X CGAA AAGGUGUU 



AACACCTT T ATTTTCTG 



UCAGAAAA CUGAUGAG X CGAA AAAGGUGU 



ACACCTTT A TTTTCTGA 



GCUCAGAA CUGAUGAG X CGAA AUAAAGGU 



ACCTTTAT T TTCTGAGC 



AGCUCAGA CUGAUGAG X CGAA AAUAAAGG 



CCTTTATT T TCTGAGCT 



AAGCUCAG CUGAUGAG X CGAA AAAUAAAG 



CTTTATTT T CTGAGCTT 



AAAGCUCA CUGAUGAG X CGAA AAAAUAAA 



TTTATTTT C TGAGCTTT 



UUCCAACA CUGAUGAG X ( 



k AGCUCAGA 



TCTGAGCT T TGTTGGAA 



UUUCCAAC CUGAUGAG X CGAA AAGCUCAG 



CTGAGCTT T GTTGGAAA 



UGUUUUCC CUGAUGAG X CGAA ACAAAGCU 



AGCTTTGT T GGAAAACA 



AAUUCUGG CUGAUGAG X CGAA AUCAUGUU 



AACATGAT A CCAGAATT 



GGCAAAUU CUGAUGAG X CGAA AUUCUGGU 



ACCAGAAT T AATTTGCC 



UGGCAAAU CUGAUGAG X CGAA AAUUCUGG 



CCAGAATT A ATTTGCCA 



AUGUGGCA CUGAUGAG X CGAA AUUAAUUC 



GAATTAAT T TGCCACAT 



CAUGUGGC CUGAUGAG X CGAA AAUUAAUU 



AATTAATT T GCCACATG 



AAACAGAC CUGAUGAG X CGAA ACAUGUGG 



CCACATGT T GTCTGTTT 



UUAAAACA CUGAUGAG X CGAA ACAACAUG 



CATGTTGT C TGTTTTAA 



ACUGUUAA CUGAUGAG X CGAA ACAGACAA 



TTGTCTGT T TTAACAGT 



CACUGUUA CUGAUGAG X CGAA AACAGACA 



TGTCTGTT T TAACAGTG 



CCACUGUU CUGAUGAG X CGAA AAACAGAC 



1 T AACAGTGG 



UCCACUGU CUGAUGAG X CGAA AAAACAGA 



TCTGTTTT A ACAGTGGA 



AAAAGUAU CUGAUGAG X CGAA ACAUGGGU 



ACCCATGT A ATACTTTT 



GAUAAAAG CUGAUGAG X CGAA AUUACAUG 



CATGTAAT A CTTTTATC 



AUGGAUAA CUGAUGAG X CGAA AGUAUUAC 



GTAATACT T TTATCCAT 



CAUGGAUA CUGAUGAG X CGAA AAGUAUUA 



TAATACTT T TATCCATG 



ACAUGGAU CUGAUGAG X CGAA AAAGUAUU 



AATACTTT T ATCCATGT 



AACAUGGA CUGAUGAG X CGAA AAAAGUAU 



ATACTTTT A TCCATGTT 
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UAAACAUG CUGAUGAG X CGAA AUAAAAGU 



ACTTTTAT C CATGTTTA 



CUUUUUUA CUGAUGAG X CGAA ACAUGGAU 



ATCCATGT T TAAAAAAG 



UCUUUUUU CUGAUGAG X CGAA AACAUGGA 



TCCATGTT T AAAAAAGA 



UUCUUUUU CUGAUGAG X CGAA AAACAUGG 



CCATGTTT A AAAAAGAA 



UUUGUCCA CUGAUGAG X CGAA AUUCCUUC 



GAAGGAAT T TGGACAAA 



CUUUGUCC CUGAUGAG X CGAA AAUUCCUU 



AAGGAATT T GGACAAAG 



UUACAUUA CUGAUGAG X CGAA ACGGUUUG 



CAAACCGT C TAATGTAA 



AAUUACAU CUGAUGAG X CGAA AGACGGUU 



AACCGTCT A ATGTAATT 



UGGUUAAU CUGAUGAG X CGAA ACAUUAGA 



TCTAATGT A ATTAACCA 



CGUUGGUU CUGAUGAG X CGAA AUUACAUU 



AATGTAAT T AACCAACG 



UCGUUGGU CUGAUGAG X CGAA AAUUACAU 



ATGTAATT A ACCAACGA 



AGUCCGGA CUGAUGAG X CGAA AGCUUUUU 



AAAAAGCT T TCCGGACT 



AAGUCCGG CUGAUGAG X CGAA AAGCUUUU 



AAAAGCTT T CCGGACTT 



AAAGUCCG CUGAUGAG X CGAA AAAGCUUU 



AAAGCTTT C CGGAC^ 



GCAUUUAA CUGAUGAG X CGAA AGUCCGGA 



: TTAAATGC 



AGCAUUUA CUGAUGAG X CGAA AAGUCCGG 



CCGGACTT T TAAATGCT 



UAGCAUUU CUGAUGAG X CGAA AAAGUCCG 



CGGACTTT T AAATGCTA 



UUAGCAUU CUGAUGAG X CGAA AAAAGUCC 



GGACTTTT A AATGCTAA 



AAAACAGU CUGAUGAG X CGAA AGCAUUUA 



TAAATGCT A ACTGTTTT 



AGGGGAAA CUGAUGAG X CGAA ACAGUUAG 



CTAACTGT T TTTCCCCT 



AAGGGGAA CUGAUGAG X CGAA AACAGUUA 



TAACTGTT T TTCCCCTT 



GAAGGGGA CUGAUGAG X CGAA AAACAGUU 



AACTGTTT T TCCCCTTC 



GGAAGGGG CUGAUGAG X CGAA AAAACAGU 



ACTGTTTT T CCCCTTCC 



AGGAAGGG CUGAUGAG X CGAA AAAAACAG 



CTGTTTTT C CCCTTCCT 



UAGACAGG CUGAUGAG X CGAA AGGGGAAA 



TTTCCCCT T CCTGTCTA 



CUAGACAG CUGAUGAG X C 



TTCCCCTT C CTGTCTAG 



UUUUCCUA CUGAUGAG X CGAA ACAGGAAG 



CTTCCTGT C TAGGAAAA 



CAUUUUCC CUGAUGAG X CGAA AGACAGGA 



TCCTGTCT A GGAAAATG 



GAGCUUUA CUGAUGAG X CGAA AGCAUUUU 



i TAAAGCTC 



UUGAGCUU CUGAUGAG X CGAA AUAGCAUU 



AATGCTAT A AAGCTCAA 



ACUAAUUU CUGAUGAG X CGAA AGCUUUAU 



ATAAAGCT C AAATTAGT 



UCCUAACU CUGAUGAG X CGAA AUUUGAGC 



GCTCAAAT T AGTTAGGA 



: CUGAUGAG X CGAA AAUUUGAG 



CTCAAATT A GTTAGGAA 



UCAUUCCU CUGAUGAG X CGAA ACUAAUUU 



AAATTAGT T AGGAATGA 



GUCAUUCC CUGAUGAG X CGAA AACUAAUU 



AATTAGTT A GGAATGAC 



AAACGUAU CUGAUGAG X CGAA AGUCAUUC 



GAATGACT T ATACGTTT 



AAAACGUA CUGAUGAG X CGAA AAGUCAUU 



AATGACTT A TACGTTTT 



ACAAAACG CUGAUGAG X CGAA AUAAGUCA 



TGACTTAT A CGTTTTGT 



CAAAACAA CUGAUGAG X CGAA ACGUAUAA 



TTATACGT T TTGTTTTG 



UCAAAACA CUGAUGAG X CGAA AACGUAUA 



TATACGTT T TGTTTTGA 



UUCAAAAC CUGAUGAG X CGAA AAACGUAU 



ATACGTTT T GTTTTGAA 



GUAUUCAA CUGAUGAG X CGAA ACAAAACG 



:gttttgt t ttgaatac 



GGUAUUCA CUGAUGAG X CGAA AACAAAAC 



GTTTTGTT T TGAATACC 



AGGUAUUC CUGAUGAG X CGAA AAACAAAA 



TTTTGTTT T GAATACCT 



CUCUUAGG CUGAUGAG X CGAA AUUCAAAA 



TTTTGAAT A CCTAAGAG 



GUAUCUCU CUGAUGAG X CGAA AGGUAUUC 



CCAAAAAG CUGAUGAG X CGAA AUCUCUUA 



GAATACCT A AGAGATAC 
TAAGAGAT A CTTTTTGG 
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UAUCCAAA CUGAUGAG X CGAA AGUAUCUC 



GAGATACT T TTTGGATA 



AUAUCCAA CUGAUGAG X CGAA AAGUAUCU 



AGATACTT T TTGGATAT 



AAUAUCCA CUGAUGAG X CGAA AAAGUAUC 



GATACTTT T TGGATATT 



AAAUAUCC CUGAUGAG X CGAA AAAAGUAU 



ATACTTTT T GGATAT"] 



AAUAUAAA CUGAUGAG X CGAA AUCCAAAA 



TTTTGGAT A TTTATATT 



GCAAUAUA CUGAUGAG X CGAA AUAUCCAA 



TTGGATAT T TATATTGC 



GGCAAUAU CUGAUGAG X CGAA AAUAUCCA 



TGGATATT T ATATTGCC 



UGGCAAUA CUGAUGAG X CGAA AAAUAUCC 



GGATAT TT A TATTGCCA 



UAUGGCAA CUGAUGAG X CGAA AUAAAUAU 



ATATTTAT A TTGCCATA 



AAUAUGGC CUGAUGAG X CGAA AUAUAAAU 



ATTTATAT T GC CAT ATT 



AGUAAGAA CUGAUGAG X CGAA AUGGCAAU 



ATTGCCAT A TTCTTACT 



CAAGUAAG CUGAUGAG X CGAA AUAUGGCA 



TGCCATAT T CTTACTTG 



UCAAGUAA CUGAUGAG X C 



i. AAUAUGGC 



GCCATATT C TTACTTGA 



AUUCAAGU CUGAUGAG X CGAA AGAAUAUG 



CATATTCT T ACTTGAAT 



CAUUCAAG CUGAUGAG X CGAA AAGAAUAU 



ATATTCTT A CTTGAATG 



AAGCAUUC CUGAUGAG X CGAA AGUAAGAA 



TTCTTACT T GAATGCTT 



GUCAUUCA CUGAUGAG X CGAA AGCAUUCA 



TGAATGCT T TGAATGAC 



AGUCAUUC CUGAUGAG X CGAA AAGCAUUC 



GAATGCTT T GAATGACT 



ACUGGAUG CUGAUGAG X CGAA AGUCAUUC 



GAATGACT A CATCCAGT 



CAGAACUG CUGAUGAG X CGAA AUGUAGUC 



GACTACAT C CAGTTCTG 



AGGUGCAG CUGAUGAG X CGAA ACUGGAUG 



CATCCAGT T CTGCACCT 



UAGGUGCA CUGAUGAG X CGAA AACUGGAU 



ATCCAGTT C TGCACCTA 



AGAGGGUA CUGAUGAG X CGAA AGGUGCAG 



CTGCACCT A TACCCTCT 



CCAGAGGG CUGAUGAG X CGAA AUAGGUGC 



GCACCTAT A CCCTCTGG 



CAACACCA CUGAUGAG X CGAA AGGGUAUA 



TATACCCT C TGGTGTTG 



UAAAAAGC CUGAUGAG X CGAA ACACCAGA 



TCTGGTGT T GCTTTTTA 



AGGUUAAA CUGAUGAG X CGAA AGCAACAC 



GTGTTGCT T TTTAACCT 



AAGGUUAA CUGAUGAG X CGAA AAGCAACA 



TGTTGCTT T TTAACCTT 



GAAGGUUA CUGAUGAG X CGAA AAAGCAAC 



GTTGCTTT T TAACCTTC 



GGAAGGUU CUGAUGAG X CGAA AAAAGCAA 



TTGCTTTT T AACCTTCC 



AGGAAGGU CUGAUGAG X CGAA AAAAAGCA 



TGCTTTTT A ACCTTCCT 



AUUCCAGG CUGAUGAG X CGAA AGGUUAAA 



TTTAACCT T CCTGGAAT 



GAUUCCAG CUGAUGAG X CGAA AAGGUUAA 



TTAACCTT C CTGGAATC 



AGAAAAUG CUGAUGAG X CGAA AUUCCAGG 



CCTGGAAT C CATTTTCT 



UUUUAGAA CUGAUGAG X CGAA AUGGAUUC 



GAATCCAT T TTCTAAAA 



i AAUGGAUU 
UUUUUUAG CUGAUGAG X CGAA AAAUGGAU 



UUUUUAGA CUGAUGAG X CGAA J 



AATCCATT T TCTAAAAA 



ATCCATTT T CTAAAAAA 



AUUUUUUA CUGAUGAG X CGAA AAAAUGGA 



: C TAAAAAAT 



UUAUUUUU CUGAUGAG X CGAA AGAAAAUG 



CATTTTCT A AAAAATAA 



UGUGUCUU CUGAUGAG X CGAA AUUUUUUA 



TAAAAAAT A AAGACACA 



CUGAGAAG CUGAUGAG X CGAA AUGUGUCU 



AGACACAT T CTTCTCAG 



GCUGAGAA CUGAUGAG X CGAA AAUGUGUC 



GACACATT C TTCTCAGC 



; CUGAUGAG X CGAA AGAAUGUG 



GUGCUGAG < 
GGUGCUGA CUGAUGAG X CGAA AAGAAUGU 



CACATTCT T CTCAGCAC 



ACATTCTT C TCAGCACC 



GUGGUGCU CUGAUGAG X CGAA AGAAGAAU 



ATTCTTCT C AGCACCAC 



UUUUGGAA CUGAUGAG X CGAA AGGUGUUG 
GAUUUUGG CUGAUGAG X CGAA AUAGGUGU 



CAACACCT A TTCCAAAA 



ACACCTAT T CCAAAATC 
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CGAUUUUG CUGAUGAG X CGAA AAUAGGUG 
AUGUGGUC CUGAUGAG X CGAA AUUUUGGA 
CUUCCAAA CUGAUGAG X CGAA AUGUGGUC 
UACUUCCA CUGAUGAG X CGAA AUAUGUGG 
UUACUUCC CUGAUGAG X CGAA AAUAUGUG 
GAGAGCUU CUGAUGAG X CGAA ACUUCCAA 
GCUGAGGA CUGAUGAG X CGAA AGCUUUAC 
UUGCUGAG CUGAUGAG X CGAA AGAGCUUU 
CAUUUGCU CUGAUGAG X CGAA AGGAGAGC 
UGUUCUUU CUGAUGAG X CGAA ACAUUUGC 
UUUGUUAU CUGAUGAG X CGAA AUUUCUGU 
GUUUGUUA CUGAUGAG X CGAA AAUUUCUG 
CAGUUUGU CUGAUGAG X CGAA AUAAUUUC 
GUCUGAGA CUGAUGAG X CGAA ACAGUUUG 
UGGUCUGA CUGAUGAG X CGAA AGACAGUU 
UGUGGUCU CUGAUGAG X CGAA AGAGACAG 
UUUGGUUA CUGAUGAG X CGAA ACUGUGGU 
AGUUUGGU CUGAUGAG X CGAA AUACUGUG 
CUGAGUUC CUGAUGAG X CGAA AGUUUGGU 
UUAAUCCU CUGAUGAG X CGAA AGUUCUAG 
AGUUUCUU CUGAUGAG X CGAA AUCCUGAG 
GAGUUUCU CUGAUGAG X CGAA AAUCCUGA 
UUUUGAGU CUGAUGAG X CGAA AGUUUCUU 
GUGGUUUU CUGAUGAG X CGAA AGUGAGUU 
UUUCCAUG CUGAUGAG X CGAA AGUUGUGU 
UCAUUCAG CUGAUGAG X CGAA AGCAGGUU 
GUAUCCAG CUGAUGAG X CGAA AGUCAUUC 
UUGUUAUG CUGAUGAG X CGAA AUCCAGUA 
CAUUUUGU CUGAUGAG X CGAA AUGUAUCC 
AACAUCUU CUGAUGAG X CGAA AUUUCUGC 
UUUUAAAG CUGAUGAG X CGAA ACAUCUUU 
GUUUUAAA CUGAUGAG X CGAA AACAUCUU 
UGGUUUUA CUGAUGAG X CGAA AGAACAUC 
UUGGUUUU CUGAUGAG X CGAA AAGAACAU 
AUUGGUUU CUGAUGAG X CGAA AAAGAACA 
GAUUCUGG CUGAUGAG X CGAA AUGUUGUG 
GUCCCAGA CUGAUGAG X CGAA AUUCUGGU 
GUGUCCCA CUGAUGAG X CGAA AGAUUCUG 
CUGCUUUG CUGAUGAG X CGAA AUGUGUCC 
ACUGCUUU CUGAUGAG X CGAA AAUGUGUC 
UUUCCCUC CUGAUGAG X CGAA ACACACUG 
GUGCUAUA CUGAUGAG X CGAA AUUUCCCU 
AGUGCUAU CUGAUGAG X CGAA AAUUUCCC 
UAGUGCUA CUGAUGAG X CGAA AAAUUUCC 
UUUAGUGC CUGAUGAG X CGAA AUAAAUUU 
UGGGCAUU CUGAUGAG X CGAA AGUGCUAU 
AUUUUAGA CUGAUGAG X CGAA AUUUCCUG 



CACCTATT C CAAAATCG 
TCCAAAAT C GACCACAT 
GACCACAT A TTTGGAAG 
CCACATAT T TGGAAGTA 
CACATATT T GGAAGTAA 
TTGGAAGT A AAGCTCTC 
GTAAAGCT C TCCTCAGC 
AAAGCTCT C CTCAGCAA 
GCTCTCCT C AGCAAATG 
GCAAATGT A AAAGAACA 
ACAGAAAT T ATAACAAA 
CAGAAATT A TAACAAAC 
GAAATTAT A ACAAACTG 
CAAACTGT C TCTCAGAC 
AACTGTCT C TCAGACCA 
CTGTCTCT C AGACCACA 
ACCACAGT A TAACCAAA 
CACAGTAT A ACCAAACT 
ACCAAACT A GAACTCAG 
CTAGAACT C AGGATTAA 
CTCAGGAT T AAGAAACT 
TCAGGATT A AGAAACTC 
AAGAAACT C ACTCAAAA 
AACTCACT C AAAACCAC 
ACACAACT A CATGGAAA 
AACCTGCT C CTGAATGA 
GAATGACT A CTGGATAC 
TACTGGAT A CATAACAA 
GGATACAT A ACAAAATG 
GCAGAAAT A AAGATGTT 
AAAGATGT T CTTTAAAA 
AAGATGTT C TTTAAAAC 



T TAAAACCA 
ATGTTCTT T AAAACCAA 
TGTTCTTT A AAACCAAT 
CACAACAT A CCAGAATC 
ACCAGAAT C TCTGGGAC 
CAGAATCT C TGGGACAC 
GGACACAT T CAAAGCAG 
GACACATT C AAAGCAGT 
CAGTGTGT A GAGGGAAA 
AGGGAAAT T TATAGCAC 
GGGAAATT T ATAGCACT 
GGAAATTT A TAGCACTA 
AAATTTAT A GCACTAAA 
ATAGCACT A AATGCCCA 
CAGGAAAT A TCTAAAAT 
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2246 


CAAUUUUA CUGAUGAG X CGAA AUAUUUCC 


372 


GGAAATAT C TAAAATTG 


784 


2248 


GUCAAUUU CUGAUGAG X CGAA AGAUAUUU 


3 73 


AAATATCT A AAATTGAC 


785 


2253 


AGGGUGUC CUGAUGAG X CGAA AUUUUAGA 


374 


TCTAAAAT T GACACCCT 


786 


2262 


UGUGAUGU CUGAUGAG X CGAA AGGGUGUC 


375 


GACACCCT A ACATCACA 


787 


2267 


UUAAUUGU CUGAUGAG X CGAA AUGUUAGG 


376 


CCTAACAT C ACAATTAA 


788 


2273 


GUUCUUUU CUGAUGAG X CGAA AUUGUGAU 


377 


ATCACAAT T AAAAGAAC 


789 


2274 


AGUUCUUU CUGAUGAG X CGAA AAUUGUGA 


378 


TCACAATT A AAAGAACT 


790 


2283 


UGCUUCUC CUGAUGAG X CGAA AGUUCUUU 


379 


AAAGAACT A GAGAAGCA 


791 


2305 


AGCUUUUC CUGAUGAG X CGAA AUGUGUUU 


380 


AAACACAT T GAAAAGCT 


792 


2314 


CCUUCUCU CUGAUGAG X CGAA AGCUUUUC 


381 


GAAAAGCT A AGAGAAGG 


793 


2331 


AUCUUAGU CUGAUGAG X CGAA AUUUCUUG 


382 


CAAGAAAT A ACTAAGAT 


794 


2335 


UCUGAUCU CUGAUGAG X CGAA AGUUAUUU 


383 


AAATAACT A AGATCAGA 


795 


2340 


UCUGCUCU CUGAUGAG X CGAA AUCUUAGU 


384 


ACTAAGAT C AGAGCAGA 


796 


2361 


UGUGUCUC CUGAUGAG X CGAA AUUUCCUU 


385 


AAGGAAAT A GAGACACA 


797 


2377 


UUUUUGAA CUGAUGAG X CGAA AGUUUUUU 


386 


AAAAAACT C TTCAAAAA 


798 


2379 


AUUUUUUG CUGAUGAG X CGAA AGAGUUUU 


387 


AAAACTCT T CAAAAAAT 


799 


2380 


GAUUUUUU CUGAUGAG X CGAA AAGAGUUU 


388 


AAACTCTT C AAAAAATC 


800 


2388 


GAUUCAUU CUGAUGAG. X CGAA AUUUUUUG 


389 


CAAAAAAT C AATGAATC 


801 


2396 


AGCUCCUG CUGAUGAG X CGAA AUUCAUUG 


3 90 


CAATGAAT C CAGGAGCT 


802 


2408 


UUUCAAAA CUGAUGAG X CGAA ACCAGCUC 


391 


GAGCTGGT T TTTTGAAA 


803 


2409 


GUUUCAAA CUGAUGAG X CGAA AACCAGCU 


392 


AGCTGGTT T TTTGAAAC 


804 


2410 


CGUUUCAA CUGAUGAG X CGAA AAACCAGC 


393 


GCTGGTTT T TTGAAACG 


805 


2411 


UCGUUUCA CUGAUGAG X CGAA AAAACCAG 


394 


CTGGTTTT T TGAAACGA 


■ 806 


2412 


AUCGUUUC CUGAUGAG X CGAA AAAAACCA 


395 


TGGTTTTT T GAAACGAT 


807 


2421 


AUUUUGUU CUGAUGAG X CGAA AUCGUUUC 


3 96 


GAAACGAT C AACAAAAT 


808 


2430 


UGUCUAUC CUGAUGAG X CGAA AUUUUGUU 


397 


AACAAAAT T GATAGACA 


809 


2434 


CUAGUGUC CUGAUGAG X CGAA AUCAAUUU 


398 


AAATTGAT A GACACTAG 


810 


2441 


AGUCUUGC CUGAUGAG X CGAA AGUGUCUA 


399 


TAGACACT A GCAAGACT 


811 


2450 


. UUCUUUAU CUGAUGAG X CGAA AGUCUUGC 


400 


GCAAGACT A ATAAAGAA 


812 


2453 


UUCUUCUU CUGAUGAG X CGAA AUUAGUCU 


401 


AGACTAAT A AAGAAGAA 


813 


2475 


UUCUAUUU CUGAUGAG X CGAA AUUCUUCU 


402 


AGAAGAAT C AAATAGAA 


814 


2480 


AUUGCUUC CUGAUGAG X CGAA AUUUGAUU 


403 


AATCAAAT A GAAGCAAT 


815 


2489 


UCAUUUUU CUGAUGAG X CGAA AUUGCUUC 


404 


GAAGCAAT A AAAAATGA 


816 


2499 


AUCCCCUU CUGAUGAG X CGAA AUCAUUUU 


405 


AAAATGAT A AAGGGGAT 


817 


250B 


GGUGGUGA CUGAUGAG X CGAA AUCCCCUU 


406 


AAGGGGAT A TCACCACC 


818 


2510 


UUGGUGGU CUGAUGAG X CGAA AUAUCCCC 


407 


GGGGATAT C ACCACCAA 


819 


2520 


UUCUGUGG CUGAUGAG X CGAA AUUGGUGG 




CCACCAAT C CCACAGAA 


820 


2531 


UGGUGGUU CUGAUGAG X CGAA AUUUCUGU 


409 


ACAGAAAT A AACCACCA 


821 


2541 


UAUUCUCU CUGAUGAG X CGAA AUGGUGGU 


410 


ACCACCAT C AGAGAATA 


822 


2549 


GUUUGUAG CUGAUGAG X CGAA AUUCUCUG 


411 


CAGAGAAT A CTACAAAC 


823 


2552 | GGUGUUUG CUGAUGAG X CGAA AGUAUUCU 


412 | AGAATACT A CAAACACC 


824 


Input Sequence = HSU29607. Cut Site = (JH/. 



Stem Length = 8 . Core Sequence = CUGAUGAG X CGAA (X = GCCGUUAGGC or other stem II) 
Seql = HSU29607 (Human methionine aminopeptidase mRNA, complete cds., 2569 bp) 
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Table 10: Human methionine aminopeptidase type 2 (MetAP-2) NCH Ribozyme and 
Target Sequence 



Nt. 
position 


Ribozyme Sequence 


Seq. ID 


Substrate Sequence 


Seq. ID Nos. 




CCCGAGAG CUGAUGAG X CGAA IACGAGGG 


825 


CCCTCGTC T CTCTCGGG 


1255 


— n — 


UGCCCGAG CUGAUGAG X CGAA IAGACGAG 


826 


CTCGTCTC T CTCGGGCA 





— n — 


GUUGCCCG CUGAUGAG X CGAA IAGAGACG 


827 


CGTCTCTC T CGGGCAAC 




20 


CGCCAUGU CUGAUGAG X CGAA ICCCGAGA 




TCTCGGGC A ACATGGCG 


12sa 


23 


GCCCGCCA CUGAUGAG X CGAA IUUGCCCG 


— 8~2~i — 


CGGGCAAC A TGGCGGGC 


1259 


49 


CUCCCGGA CUGAUGAG X CGAA ICCGCUAC 


830 


GTAGCGGC C TCCGGGAG 


~ 1260 




GCUCCCGG CUGAUGAG X CGAA IGCCGCUA 


831 


TAGCGGCC T CCGGGAGC 


12gl 


— si — 


UGGCUCCC CUGAUGAG X CGAA IAGGCCGC 


832 


GCGGCCTC C GGGAGCCA 


~T262 




AUUCAGGU CUGAUGAG X CGAA ICUCCCGG 


833 


CCGGGAGC C ACCTGAAT 


l2g3 




CAUUCAGG CUGAUGAG X CGAA IGCUCCCG 


834 


CGGGAGCC A CCTGAATG 


"~ 1264 








GGAGCCAC C TGAATGGC 


1265 


— e3 — 


CGCCAUUC CUGAUGAG X CGAA IGUGGCUC 


836 


GAGCCACC T GAATGGCG 


1266 




UGGAUCCA CUGAUGAG X CGAA IUCGCCAU 


837 


ATGGCGAC C TGGATCCA 


12S7 




CUGGAUCC CUGAUGAG X CGAA IGUCGCCA 


838 


TGGCGACC T GGATCCAG 


1268 


81 




839 


CCTGGATC C AGACGACA 


1269 




CUGUCGUC CUGAUGAG X CGAA IGAUCCAG 


840 


CTGGATCC A GACGACAG 


1270 


89 


UUCUUCCC CUGAUGAG X CGAA IUCGUCUG 


841 


CAGACGAC A GGGAAGAA 


1271 


103 


GUAGAGGC CUGAUGAG X CGAA ICUCCUUC 


842 


GAAGGAGC T GCCTCTAC 


1272 






843 


GGAGCTGC C TCTACGGC 


1273 




AGCCGUAG CUGAUGAG X CGAA IGCAGCUC 


844 


GAGCTGCC T CTACGGCT 


1274 


— Tol — 


UCAGCCGU CUGAUGAG X CGAA IAGGCAGC 


845 


GCTGCCTC T ACGGCTGA 


1275 


115 


GCUUCCUC CUGAUGAG X CGAA ICCGUAGA 


— iii 


TCTACGGC T GAGGAAGC 


1276 


124 


UUCUUGGC CUGAUGAG X CGAA ICUUCCUC 


847 


GAGGAAGC A GCCAAGAA | 12 77 


127 


UUUUUCUU CUGAUGAG X CGAA ICUGCUUC 


848 


GAAGCAGC C AAGAAAAA 


1278 


128 


UUUUUUCU CUGAUGAG X CGAA IGCUGCUU 


B49 


AAGCAGCC A AGAAAAAA 


1279 


158 


AGGCCCUU CUGAUGAG X CGAA ICUCUUCU 


850 


AGAAGAGC A AAGGGCCT 


1280 


165 


CUGCAGAA CUGAUGAG X CGAA ICCCUUUG 


851 


CAAAGGGC C TTCTGCAG 


1281 


166 


GCUGCAGA CUGAUGAG X CGAA IGCCCUUU 


852 


AAAGGGCC T TCTGCAGC 


1282 


169 


CCUGCUGC CUGAUGAG X CGAA IAAGGCCC 


853 


GGGCCTTC T GCAGCAGG 


1283 


172 


UCCCCUGC CUGAUGAG X CGAA ICAGAAGG 


854 


CCTTCTGC A GCAGGGGA 


1284 


175 


UGUUCCCC CUGAUGAG X CGAA ICUGCAGA 


855 


TCTGCAGC A GGGGAACA 


1285 


183 


CAGGUUCC CUGAUGAG X CGAA IUUCCCCU 


856 


AGGGGAAC A GGAACCTG 


1286 


189 


CUUUAUCA CUGAUGAG X CGAA IUUCCUGU 


857 


ACAGGAAC C TGATAAAG 


1287 


190 


UCUUUAUC CUGAUGAG X CGAA IGUUCCUG 


358 


CAGGAACC T GATAAAGA 


1288 


202 


GAGGCUCC CUGAUGAG X CGAA IAUUCUUU 


859 


AAAGAATC A GGAGCCTC 


1289 


20B 


UCCACUGA CUGAUGAG X CGAA ICUCCUGA 


860 


TCAGGAGC C TCAGTGGA 


1290 


209 


AUCCACUG CUGAUGAG X CGAA IGCUCCUG 


861 


CAGGAGCC T CAGTGGAT 


1291 


211 


UCAUCCAC CUGAUGAG X CGAA IAGGCUCC 


862 


GGAGCCTC A GTGGATGA 


1292 


226 


AACUGUCU CUGAUGAG X CGAA ICUACUUC 


863 


GAAGTAGC A AGACAGTT 


1293 


231 


UUUCCAAC CUGAUGAG X CGAA IUCUUGCU 


864 


AGCAAGAC A GTTGGAAA 


1294 


244 


UCCAAUGC CUGAUGAG X CGAA IAUCUUUC 


865 


GAAAGATC A GCATTGGA 


1295 


247 


UCUUCCAA CUGAUGAG X CGAA ICUGAUCU 


866 


AGATCAGC A TTGGAAGA 


1296 
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307 


UUUCCAGU CUGAUGAG X CGAA ICUCCAUC 


867 


GATGGAGC A ACTGGAAA 


1297 


310 


UUCUUUCC CUGAUGAG X CGAA IUUGCUCC 


868 


GGAGCAAC T GGAAAGAA 


1298 


348 


GAACUUUU CUGAUGAG X CGAA IUCCUCUC 


869 


GAGAGGAC C AAAAGTTC 


1299 


349 


UGAACUUU CUGAUGAG X CGAA IGUCCUCU 


870 


AGAGGACC A AAAGTTCA 


1300 


357 


GGUCUGUU CUGAUGAG X CGAA IAACUUUU 


871 


AAAAGTTC A AACAGACC 


1301 


361 


GGAGGGUC CUGAUGAG X CGAA IUUUGAAC 


872 


GTTCAAAC A GACCCTCC 


1302 


365 


UGAGGGAG CUGAUGAG X CGAA IUCUGUUU 


873 


AAACAGAC C CTCCCTCA 


1303 


... 366 


CUGAGGGA CUGAUGAG X CGAA IGUCUGUU 


S74 


AACAGACC C TCCCTCAG 


1304 


367 


ACUGAGGG CUGAUGAG X CGAA IGGUCUGU 


875 


ACAGACCC T CCCTCAGT 


1305 


369 


GAACUGAG CUGAUGAG X CGAA IAGGGUCU 


876 


AGACCCTC C CTCAGTTC 


1306 


370 


GGAACUGA CUGAUGAG X CGAA IGAGGGUC 


877 


GACCCTCC C TCAGTTCC 


1307 


371 


UGGAACUG CUGAUGAG X CGAA IGGAGGGU 


878 


ACCCTCCC T CAGTTCCA 


1308 


373 


AUUGGAAC CUGAUGAG X CGAA IAGGGAGG 


879 


CCTCCCTC A GTTCCAAT 


1309 


378 


CACAUAUU CUGAUGAG X CGAA IAACUGAG 


880 


CTCAGTTC C AATATGTG 


1310 


379 


UCACAUAU CUGAUGAG X CGAA IGAACUGA 


881 


TCAGTTCC A ATATGTGA 


1311 


389 


AGGAUACA CUGAUGAG X CGAA IUCACAUA 


882 


TATGTGAC C TGTATCCT 


1312 


390 


UAGGAUAC CUGAUGAG X CGAA IGUCACAU 


883 


ATGTGACC T GTATCCTA 


1313 


396 


CACCAUUA CUGAUGAG X CGAA IAUACAGG 


884 


CCTGTATC C TAATGGTG 


1314 


397 


ACACCAUU CUGAUGAG X CGAA IGAUACAG 


885 


CTGTATCC T AATGGTGT 


1315 


411 


GUCCUUUG CUGAUGAG X CGAA IAAAUACA 


886 


TGTATTTC C CAAAGGAC 


1316 


412 


UGUCCUUU CUGAUGAG X CGAA IGAAAUAC 


887 


GTATTTCC C AAAGGACA 


1317 


413 


UUGUCCUU CUGAUGAG X CGAA IGGAAAUA 


888 


TATTTCCC A AAGGACAA 


1318 


420 


CGCAUUCU CUGAUGAG X CGAA IUCCUUUG 


889 


CAAAGGAC A AGAATGCG 


1319 


434 


UGUGGGUG CUGAUGAG X CGAA IUAUUCGC 


890 


GCGAATAC C CACCCACA 


1320 


435 


GUGUGGGU CUGAUGAG X CGAA IGUAUUCG 


891 


CGAATACC C ACCCACAC 


1321 


436 


UGUGUGGG CUGAUGAG X CGAA IGGUAUUC 


892 


GAATACCC A CCCACACA 


1322 


438 


CUUGUGUG CUGAUGAG X CGAA IUGGGUAU 


893 


ATACCCAC C CACACAAG 


1323 


439 


UCUUGUGU CUGAUGAG X CGAA IGUGGGUA 


894 


TACCCACC C ACACAAGA 


1324 


440 


AUCUUGUG CUGAUGAG X CGAA IGGUGGGU 


895 


ACCCACCC A CACAAGAT 


1325 


442 


CCAUCUUG CUGAUGAG X CGAA IUGGGUGG 


896 


CCACCCAC A CAAGATGG 


1326 


444 


GCCCAUCU CUGAUGAG X CGAA IUGUGGGU 


897 


ACCCACAC A AGATGGGC 


1327 


457 


CAAGCAGC CUGAUGAG X CGAA IUUCGCCC 


898 


GGGCGAAC A GCTGCTTG 


1328 


460 


CUCCAAGC CUGAUGAG X CGAA ICUGUUCG 


899 


CGAACAGC T GCTTGGAG 


1329 


463 


GUUCUCCA CUGAUGAG X CGAA ICAGCUGU 


900 


ACAGCTGC T TGGAGAAC 


1330 


472 


UCACUUGU CUGAUGAG X CGAA IUUCUCCA 


901 


TGGAGAAC T ACAAGTGA 


1331 


475 


UCUUCACU CUGAUGAG X CGAA IUAGUUCU 


902 


AGAACTAC A AGTGAAGA 


1332 


493 


UGAUCUAA CUGAUGAG X CGAA ICUUUCUU 


903 


AAGAAAGC A TTAGATCA 


1333 




CACUUGCC CUGAUGAG X CGAA IAUCUAAU 


904 


ATTAGATC A GGCAAGTG 


1334 


505 


UCUUCACU CUGAUGAG X CGAA ICCUGAUC 




GATCAGGC A AGTGAAGA 


1335 


538 


GCUUCUGC CUGAUGAG X CGAA ICUUCUCG 


906 


CGAGAAGC T GCAGAAGC 


1336 


541 


UGUGCUUC CUGAUGAG X CGAA ICAGCUUC 


907 


GAAGCTGC A GAAGCACA 


1337 


547 


UGUCGAUG CUGAUGAG X CGAA ICUUCUGC 


908 


GCAGAAGC A CATCGACA 


1338 


549 


CUUGUCGA CUGAUGAG X CGAA IUGCUUCU 


909 


AGAAGCAC A TCGACAAG 


1339 


555 


UUCUAACU CUGAUGAG X CGAA IUCGAUGU 


910 


ACATCGAC A AGTTAGAA 


1340 


578 


CUUGAUCC CUGAUGAG X CGAA ICUCAUUA 


911 


TAATGAGC T GGATCAAG 


1341 


584 


CCCAGGCU CUGAUGAG X CGAA IAUCCAGC 


912 


GCTGGATC A AGCCTGGG 


1342 


588 


UCAUCCCA CUGAUGAG X CGAA ICUUGAUC 


913 


GATCAAGC C TGGGATGA 


1343 
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GUCAUCCC CUGAUGAG X CGAA IGCUUGAU 



ATCAAGCC T GGGATGAC 



UCUAUCAU CUGAUGAG X CGAA IUCAUCCC 



GGGATGAC A ATGATAGA 



CUUUUCAC CUGAUGAG X CGAA IAUUUCUA 



TAGAAATC T GTGAAAAG 



GCGUGAAC CUGAUGAG X CGAA IUCUUCCA 



TGGAAGAC T GTTCACGC 



AACUUGCG CUGAUGAG X CGAA IAACAGUC 



GACTGTTC A CGCAAGTT 



UAUUAACU CUGAUGAG X CGAA ICGUGAAC 



GTTCACGC A AGTTAATA 



GCCAGGCC CUGAUGAG X CGAA ICAUUUAA 



TTAAATGC A GGCCTGGC 



AAAUGCCA CUGAUGAG X CGAA ICCUGCAU 



ATGCAGGC C TGGCATTT 



GAAAUGCC CUGAUGAG X CGAA IGCCUGCA 



TGCAGGCC T GGCATTTC 



GUAGGAAA CUGAUGAG X CGAA ICCAGGCC 



GGCCTGGC A TTTCCTAC 



AUCCAGUA CUGAUGAG X CGAA IAAAUGCC 



GGCATTTC C TACTGGAT 



CAUCCAGU CUGAUGAG X CGAA IGAAAUGC 



GCATTTCC T ACTGGATG 



GAACAUCC CUGAUGAG X CGAA IUAGGAAA 



TTTCCTAC T GGATGTTC 



UUAUUGAG CUGAUGAG X CGAA IAACAUCC 



GGATGTTC T CTCAATAA 



AAUUAUUG CUGAUGAG X CGAA IAGAACAU 



ATGTTCTC T CAATAATT 



ACAAUUAU CUGAUGAG X CGAA IAGAGAAC 



GTTCTCTC A ATAATTGT 



UAAUGGGC CUGAUGAG X CGAA ICACAAUU 



AATTGTGC T GCCCATTA 



GUAUAAUG CUGAUGAG X CGAA ICAGCACA 



TGTGCTGC C CATTATAC 



AGUAUAAU CUGAUGAG X CGAA IGCAGCAC 



GTGCTGCC C ATTATACT 



GAGUAUAA CUGAUGAG X CGAA IGGCAGCA 



TGCTGCCC A TTATACTC 



GCAUUGGG CUGAUGAG X CGAA IUAUAAUG 



CATTATAC T CCCAATGC 



CGGCAUUG CUGAUGAG X CGAA IAGUAUAA 



TTATACTC C CAATGCCG 



CCGGCAUU CUGAUGAG X CGAA IGAGUAUA 



TATACTCC C AATGCCGG 



ACCGGCAU CUGAUGAG X CGAA IGGAGUAU 



ATACTCCC A ATGCCGGT 



GUGUCACC CUGAUGAG X CGAA ICAUUGGG 



CCCAATGC C GGTGACAC 



UACUGUUG CUGAUGAG X CGAA IUCACCGG 



CCGGTGAC A CAACAGTA 



AAUACUGU CUGAUGAG X CGAA IUGUCACC 



GGTGACAC A ACAGTATT 



UGUAAUAC CUGAUGAG X CGAA IUUGUGUC 



GACACAAC A. GT ATT AC A 



CAUCAUAC CUGAUGAG X C 



AGTATTAC A GTATGATG 



UUUACAGA CUGAUGAG X CGAA IUCAUCAU 



ATGATGAC A TCTGTAAA 



UAUUUUAC CUGAUGAG X CGAA IAUGUCAU 



ATGACATC T GTAAAATA 



UGUUCCAA CUGAUGAG X CGAA IUCUAUUU 



AAATAGAC T TTGGAACA 



CUUAUAUG CUGAUGAG X CGAA IUUCCAAA 



TTTGGAAC A CATATAAG 



CACUUAUA CUGAUGAG X CGAA IUGUUCCA 



TGGAACAC A TATAAGTG 



AAAAGCAC CUGAUGAG X CGAA IUCAAUAA 



TTATTGAC T GTGCTTTT 



ACAGUAAA CUGAUGAG X CGAA ICACAGUC 



GACTGTGC T TTTACTGT 



AAAGUGAC CUGAUGAG X CGAA IUAAAAGC 



GCTTTTAC T GTCACTTT 



AUUAAAAG CUGAUGAG X CGAA IACAGUAA 



TTACTGTC A CTTTTAAT 



GGAUUAAA CUGAUGAG X CGAA IUGACAGU 



ACTGTCAC T TTTAATCC 



CAUAUUUG CUGAUGAG X CGAA IAUUAAAA 



TTTTAATC C CAAATATG 



UCAUAUUU CUGAUGAG X CGAA IGAUUAAA 



TTTAATCC C AAATATGA 



AUCAUAUU CUGAUGAG X CGAA IGGAUUAA 



TTAATCCC A AATATGAT 



UCUUUUAC CUGAUGAG X CGAA ICUUUUAA 



TTAAAAGC T GTAAAAGA 



GUGUUAGU CUGAUGAG X CGAA ICAUCUUU 



AAAGATGC T ACTAACAC 



CCAGUGUU CUGAUGAG X CGAA IUAGCAUC 



GATGCTAC T AACACTGG 



UAUUCCAG CUGAUGAG X CGAA IUUAGUAG 



CTACTAAC A CTGGAATA 



UUUAUUCC CUGAUGAG X CGAA IUGUUAGU 



ACTAACAC T GGAATAAA 
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UCAAUUCC CUGAUGAG X CGAA ICACACUU 
CAUCACAC CUGAUGAG X CGAA IACGAACA 
UCUUGGAU CUGAUGAG X CGAA ICCUCACC 
UUCUUGGA CUGAUGAG X CGAA IGCCUCAC 
AACUUCUU CUGAUGAG X CGAA IAUGGCCU 
UAACUUCU CUGAUGAG X CGAA IGAUGGCC 
ACUUCAUA CUGAUGAG X CGAA IACUCCAU 
AACUUCAU CUGAUGAG X CGAA IGACUCCA 
ACUUGAUA CUGAUGAG X CGAA IUCUUCCC 
GUUUCACU CUGAUGAG X CGAA IAUAUGUC 
UACGGAUU CUGAUGAG X CGAA IUUUCACU 
UUACGGAU CUGAUGAG X CGAA IGUUUCAC 
UAGAUUAC CUGAUGAG X CGAA IAUUGGUU 
GUCCAUUU CUGAUGAG X CGAA IAUUACGG 



AAGTGTGC T GGAATTGA 
TGTTCGTC T GTGTGATG" 
GGTGAGGC C ATCCAAGA ~ 
GTGAGGCC A TCCAAGAA ~ 
AGGCCATC C AAGAAGTT 
GGCCATCC A AGAAGTTA ~ 
ATGGAGTC C TATGAAGT 
TGGAGTCC T ATGAAGTT 
GGGAAGAC A TATCAAGT 
GACATATC A AGTGAAAC 
AGTGAAAC C AATCCGTA 
GTGAAACC A ATCCGTAA 



T AAATGGAC 



CAAUUGAA CUGAUGAG X CGAA IUCCAUUU 



AAATGGAC A TTCAATTG 



UGCCCAAU CUGAUGAG X CGAA IAAUGUCC 



GGACATTC A ATTGGGCA 



UUCUAUAU CUGAUGAG X CGAA ICCCAAUU 



AATTGGGC A ATATAGAA 



UUCCAGCA CUGAUGAG X CGA^ IUAUUCUA 



TAGAATAC A TGCTGGAA 



GUUUUUCC CUGAUGAG X CGAA ICAUGUAU 



ATACATGC T GGAAAAAC 



AUCGGCAC CUGAUGAG X CGAA IUUUUUCC 



GGAAAAAC A GTGCCGAT 



UCACAAUC CUGAUGAG X CGAA ICACUGUU 



AACAGTGC C GATTGTGA 



AUUCUUGU CUGAUGAG X CGAA ICCUCCCC 



GGGGAGGC A ACAAGAAT 



UCCAUUCU CUGAUGAG X CGAA IUUGCCUC 



GAGGCAAC A AGAATGGA 



GUUUCAAU CUGAUGAG X CGAA ICAUAUAC 



GTATATGC A ATTGAAAC 



CUACCAAA CUGAUGAG X CGAA IUUUCAAU 



ATTGAAAC C TTTGGTAG 



ACUACCAA CUGAUGAG X CGAA IGUUUCAA 



TTGAAACC T TTGGTAGT 



CCUUUUCC CUGAUGAG X CGAA IUACUACC 



GGTAGTAC A GGAAAAGG 



UAUCAUCA CUGAUGAG X C 



TGTTGTTC A TGATGATA 



AUGUAAUG CUGAUGAG X CGAA IAACAUUC 



GAATGTTC A CATTACAT 



UCAUGUAA CUGAUGAG X CGAA IUGAACAU 



ATGTTCAC A TTACATGA 



AUUUUUCA CUGAUGAG X CGAA IUAAUGUG 



C AC ATT AC A TGAAAAAT 



UUGGCACA CUGAUGAG X CGAA IUCCAACA 



TGTTGGAC A TGTGCCAA 



3AUGAG X CGAA ICACAUGU 

AGCCUUAU CUGAUGAG X CGAA IGCACAUG 

UUCUUGGA CUGAUGAG X CGAA ICCUUAUU 

UUGUUCUU CUGAUGAG X CGAA IAAGCCUU 

UUUGUUCU CUGAUGAG X CGAA IGAAGCCU 

AAGUGUUU CUGAUGAG X CGAA IUUCUUGG 

UUAACAAG CUGAUGAG X CGAA IUUUUGUU 

AUUUAACA CUGAUGAG X CGAA IUGUUUUG 

UUCAUUGA CUGAUGAG X CGAA IACAUUUA 

GUUUUCAU CUGAUGAG X CGAA IAUGACAU 

GGUUCCAA CUGAUGAG X CGAA IUUUUCAU 

AAGGCAAG CUGAUGAG X CGAA IUUCCAAA 

GAAGGCAA CUGAUGAG X CGAA IGUUCCAA 

AGAAGGCA CUGAUGAG X CGAA IGGUUCCA 

CGGCAGAA CUGAUGAG X CGAA I CAAGGGU 



ACATGTGC C AATAAGGC 
CATGTGCC A ATAAGGCT 
AATAAGGC T TCCAAGAA 
AAGGCTTC C AAGAACAA 
AGGCTTCC A AGAACAAA 
CCAAGAAC A AAACACTT 
AACAAAAC A CTTGTTAA 
CAAAACAC T TGTTAAAT 
TAAATGTC A TCAATGAA 
ATGTCATC A ATGAAAAC 
ATGAAAAC T TTGGAACC 
TTTGGAAC C CTTGCCTT 
TTGGAACC C TTGCCTTC 
TGGAACCC T TGCCTTCT 
ACCCTTGC C TTCTGCCG 
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GCGGCAGA CUGAUGAG X CGAA IGCAAGGG 



CCCTTGCC T TCTGCCGC 



UCUGCGGC CUGAUGAG X CGAA IAAGGCAA 



TTGCCTTC T GCCGCAGA 



CCAUCUGC CUGAUGAG X CGAA I CAGAAGG 



CCTTCTGC C GCAGATGG 



CAGCCAUC CUGAUGAG X CGAA I CGGCAGA 



TCTGCCGC A GATGGCTG 



AGCGAUCC CUGAUGAG X CGAA ICCAUCUG 



CAGATGGC T GGATCGCT 



UUCUCCCA CUGAUGAG X CGAA ICGAUCCA 



TGGATCGC T TGGGAGAA 



AGCCAUCA CUGAUGAG X CGAA IUAUUUAC 



GTAAATAC T TGATGGCT 



UUCUUCAG CUGAUGAG X CGAA ICCAUCAA 



TTGATGGC T CTGAAGAA 



GAUUCUUC CUGAUGAG X CGAA IAGCCAUC 



GATGGCTC T GAAGAATC 



AGUCACAC CUGAUGAG X CGAA IAUUCUUC 



GAAGAATC T GTGTGACT 



AAUGCCCA CUGAUGAG X CGAA IUCACACA 



TGTGTGAC T TGGGCATT 



AUCUACAA CUGAUGAG X CGAA ICCCAAGU 



ACTTGGGC A TTGTAGAT 



GUGGAUAU CUGAUGAG X CGAA IAUCUACA 



TGTAGATC C ATATCCAC 



GGUGGAUA CUGAUGAG X CGAA IGAUCUAC 



GTAGATCC A TATCCACC 



AUAAUGGU CUGAUGAG X CGAA IAUAUGGA 



TCCATATC C ACCATTAT 



CAUAAUGG CUGAUGAG X CGAA IGAUAUGG 



CCATATCC A CCATTATG 



CACAUAAU CUGAUGAG X CGAA IUGGAUAU 



ATATCCAC C ATTATGTG 



UCACAUAA CUGAUGAG X CGAA IGUGGAUA 



TATCCACC A TTATGTGA 



UCCUUUAA CUGAUGAG X CGAA IUCACAUA 



TATGTGAC A TTAAAGGA 



GCUGUAUA CUGAUGAG X CGAA IAUCCUUU 



AAAGGATC A TATACAGC 



AAUUGCGC CUGAUGAG X CGAA IUAUAUGA 



TCATATAC A GCGCAATT 



GUUCAAAU CUGAUGAG X CGAA ICGCUGUA 



TACAGCGC A ATTTGAAC 



GGAUGGUA CUGAUGAG X CGAA IUUCAAAU 



ATTTGAAC A TACCATCC 



AACAGGAU CUGAUGAG X CGAA IUAUGUUC 



GAACATAC C ATCCTGTT 



CAACAGGA CUGAUGAG X CGAA IGUAUGUU 



AACATACC A TCCTGTTG 



ACGCAACA CUGAUGAG X CGAA IAUGGUAU 



ATACCATC C TGTTGCGT 



GACGCAAC CUGAUGAG X CGAA IGAUGGUA 



TACCATCC T GTTGCGTC 



UACAUGUU CUGAUGAG X CGAA IACGCAAC 



GTTGCGTC C AACATGTA 



UUACAUGU CUGAUGAG X CGAA IGACGCAA 



TTGCGTCC A ACATGTAA 



UCUUUACA CUGAUGAG X CGAA IUUGGACG 



CGTCCAAC A TGTAAAGA 



UCCUCUGC CUGAUGAG X CGAA IACAACUU 



AAGTTGTC A GCAGAGGA 



AUCUCCUC CUGAUGAG X CGAA ICUGACAA 



AGUUUAAU CUGAUGAG X CGAA IUCAUCUC 



TTGTCAGC A GAGGAGAT 



GAGATGAC T ATTAAACT 



UUGGACUA CUGAUGAG X CGAA IUUUAAUA 



TATTAAAC T TAGTCCAA 



GUGGCUUU CUGAUGAG X CGAA IACUAAGU 



ACTTAGTC C AAAGCCAC 



GGUGGCUU CUGAUGAG X CGAA IGACUAAG 



CTTAGTCC A AAGCCACC 



GUUGAGGU CUGAUGAG X CGAA ICUUUGGA 



TCCAAAGC C ACCTCAAC 



UGUUGAGG CUGAUGAG X CGAA IGCUUUGG 



CCAAAGCC A CCTCAACA 



GGUGUUGA CUGAUGAG X CGAA IUGGCUUU 



AAAGCCAC C TCAACACC 



AGGUGUUG CUGAUGAG X CGAA IGUGGCUU 



AAGCCACC T CAACACCT 



AAAGGUGU CUGAUGAG X CGAA IAGGUGGC 



GCCACCTC A ACACCTTT 



AAUAAAGG CUGAUGAG X CGAA IUUGAGGU 



ACCTCAAC A CCTTTATT 



AAAAUAAA CUGAUGAG X CGAA IUGUUGAG 



CTCAACAC C TTTATTTT 



GAAAAUAA CUGAUGAG X CGAA IGUGUUGA 



TCAACACC T TTATTTTC 



CAAAGCUC CUGAUGAG X CGAA IAAAAUAA 



UCCAACAA CUGAUGAG X CGAA ICUCAGAA 



TTATTTTC T GAGCTTTG 
TTCTGAGC T TTGTTGGA 



UGGUAUCA CUGAUGAG X CGAA IUUUUCCA 



TGGAAAAC A TGATACCA 
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UUAAUUCU CUGAUGAG X CGAA IUAUCAUG 
AUUAAUUC CUGAUGAG X CGAA IGUAUCAU 



CATGATAC C AGAATTAA 



ATGATACC A GAATTAAT 



CAACAUGU CUGAUGAG X CGAA ICAAAUUA 



TAATTTGC C ACATGTTG 



ACAACAUG CUGAUGAG X CGAA IGCAAAUU 



AATTTGCC A CATGTTGT 



AGACAACA CUGAUGAG X CGAA IUGGCAAA 



TTTGCCAC A TGTTGTCT 



GUUAAAAC CUGAUGAG X CGAA IACAACAU 



ATGTTGTC T GTTTTAAC 



GGGUCCAC CUGAUGAG X CGAA IUUAAAAC 



GTTTTAAC A GTGGACCC 



AUUACAUG CUGAUGAG X CGAA IUCCACUG 



CAGTGGAC C CATGTAAT 



1492 
1493 



UAUUACAU CUGAUGAG X CGAA IGUCCACU 



AGTGGACC C ATGTAATA 



GUAUUACA CUGAUGAG X CGAA IGGUCCAC 



GTGGACCC A TGTAATAC 



UGGAUAAA CUGAUGAG X CGAA IUAUUACA 



TGTAATAC T TTTATCCA 



UUAAACAU CUGAUGAG X CGAA IAUAAAAG 



CTTTTATC C ATGTTTAA 



UUUAAACA CUGAUGAG X CGAA IGAUAAAA 



TTTTATCC A TGTTTAAA 



UUUGCCUU CUGAUGAG X CGAA IUCCAAAU 



ATTTGGAC A AAGGCAAA 



AGACGGUU CUGAUGAG X CGAA ICCUUUGU 



ACAAAGGC A AACCGTCT 



CAUUAGAC CUGAUGAG X CGAA IUUUGCCU 



AGGCAAAC C GTCTAATG 



AUUACAUU CUGAUGAG X CGAA IACGGUUU 



AAACCGTC T AATGTAAT 



UUUUCGUU CUGAUGAG X CGAA IUUAAUUA 



TAATTAAC C AACGAAAA 



UUUUUCGU CUGAUGAG X ( 



AATTAACC A ACGAAAAA 



GUCCGGAA CUGAUGAG X CGAA ICUUUUUC 



GAAAAAGC T TTCCGGAC 



AAAAGUCC CUGAUGAG X CGAA IAAAGCUU 



AAGCTTTC C GGACTTTT 



CAUUUAAA CUGAUGAG X CGAA IUCCGGAA 



TTCCGGAC T TTTAAATG 



1505 
1506 



AAACAGUU CUGAUGAG X CGAA ICAUUUAA 



TTAAATGC T AACTGTTT 



GGAAAAAC CUGAUGAG X CGAA IUUAGCAU 



ATGCTAAC T GTTTTTCC 



CAGGAAGG CUGAUGAG X CGAA IAAAAACA 



: C CCTTCCTG 



ACAGGAAG CUGAUGAG X CGAA IGAAAAAC 
GACAGGAA CUGAUGAG X CGAA IGGAAAAA 



GTTTTTCC C CTTCCTGT 



TTTTTCCC C TTCCTGTC 



AGACAGGA CUGAUGAG X CGAA IGGGAAAA 



TTTTCCCC T TCCTGTCT 



CCUAGACA CUGAUGAG X CGAA IAAGGGGA 



TCCCCTTC C TGTCTAGG 



UCCUAGAC CUGAUGAG X CGAA IGAAGGGG 



CCCCTTCC T GTCTAGGA 



AUUUUCCU CUGAUGAG X CGAA IACAGGAA 



AGCUUUAU CUGAUGAG X CGAA ICAUUUUC 
CUAAUUUG 



: T AGGAAAAT 
: T ATAAAGCT 



3 CUGAUGAG X CGAA ICUUUAUA 



TATAAAGC T CAAATTAG 



AACUAAUU CUGAUGAG X CGAA IAGCUUUA 



TAAAGCTC A AATTAGTT 



AACGUAUA CUGAUGAG X CGAA IUCAUUCC 



AUCUCUUA CUGAUGAG X CGAA IUAUUCAA 



UAUCUCUU CUGAUGAG X CGAA IGUAUUCA 



AUCCAAAA CUGAUGAG X CGAA IUAUCUCU 



GGAATGAC T TATACGTT 



TTGAATAC C TAAGAGAT 



TGAATACC T AAGAGATA 



AGAGATAC T TTTTGGAT 



AAGAAUAU CUGAUGAG ) 



TATATTGC C ATATTCTT 



UAAGAAUA CUGAUGAG X CGAA IGCAAUAU 



ATATTGCC A TATTCTTA 



V CUGAUGAG X CGAA IAAUAUGG 



UUCAAGUA ( 
AGCAUUCA CUGAUGAG X CGAA IUAAGAAU 



CCATATTC T TACTTGAA 



UCAUUCAA CUGAUGAG X CGAA ICAUUCAA 



ATTCTTAC T TGAATGCT 
TTGAATGC T TTGAATGA 



CUGGAUGU CUGAUGAG X CGAA IUCAUUCA 



TGAATGAC T ACATCCAG 



1528 
1529 



GAACUGGA CUGAUGAG X CGAA IUAGUCAU 



ATGACTAC A TCCAGTTC 



GCAGAACU CUGAUGAG X CGAA IAUGUAGU 



ACTACATC C AGTTCTGC 



UGCAGAAC CUGAUGAG X CGAA IGAUGUAG 



CTACATCC A GTTCTGCA 
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AUAGGUGC CUGAUGAG X CGAA IAACUGGA 
GGUAUAGG CUGAUGAG X CGAA ICAGAACU 
AGGGUAUA CUGAUGAG X CGAA IUGCAGAA 
GAGGGUAU CUGAUGAG X CGAA IGUGCAGA 
CACCAGAG CUGAUGAG X CGAA fuAUAGGU 
ACACCAGA CUGAUGAG X CGAA IGUAUAGG 
AACACCAG CUGAUGAG X CGAA IGGUAUAG 
GCAACACC CUGAUGAG X CGAA IAGGGUAU 
GGUUAAAA CUGAUGAG X CGAA ICAACACC 
UCCAGGAA CUGAUGAG X CGAA IUUAAAAA 
UUCCAGGA CUGAUGAG X CGAA IGUUAAAA 
GGAUUCCA CUGAUGAG X CGAA IAAGGUUA 
UGGAUUCC CUGAUGAG X CGAA IGAAGGUU 
UAGAAAAU CUGAUGAG X CGAA IAUUCCAG 
UUAGAAAA CUGAUGAG X CGAA IGAUUCCA 
UAUUUUUU CUGAUGAG X CGAA IAAAAUGG 
GAAGAAUG CUGAUGAG X CGAA IUCUUUAU 
GAGAAGAA CUGAUGAG X CGAA IUGUCUUU 
UGCUGAGA CUGAUGAG X CGAA IAAUGUGU 
UGGUGCUG CUGAUGAG X CGAA IAAGAAUG 
UGUGGUGC CUGAUGAG X CGAA IAGAAGAA 
UUGUGUGG CUGAUGAG X CGAA I CUGAGAA 
UGUUGUGU CUGAUGAG X CGAA IUGCUGAG 
GUGUUGUG CUGAUGAG X CGAA IGUGCUGA 
AGGUGUUG CUGAUGAG X CGAA IUGGUGCU 
AUAGGUGU CUGAUGAG X CGAA IUGUGGUG 
GGAAUAGG CUGAUGAG X CGAA IUUGUGUG 
UUGGAAUA CUGAUGAG X CGAA IUGUUGUG 
UUUGGAAU CUGAUGAG X CGAA IGUGUUGU 
UCGAUUUU CUGAUGAG X CGAA IAAUAGGU 
GUCGAUUU CUGAUGAG X CGAA IGAAUAGG 
AAAUAUGU CUGAUGAG X CGAA IUCGAUUU 
CAAAUAUG CUGAUGAG X CGAA IGUCGAUU 
UCCAAAUA CUGAUGAG X CGAA IUGGUCGA 
CUGAGGAG CUGAUGAG X CGAA ICUUUACU 
UGCUGAGG CUGAUGAG X CGAA IAGCUUUA 
UUUGCUGA CUGAUGAG X CGAA IAGAGCUU 
AUUUGCUG CUGAUGAG X CGAA IGAGAGCU 
ACAUUUGC CUGAUGAG X CGAA IAGGAGAG 
UUUACAUU CUGAUGAG X CGAA I CUGAGGA 
AUAAUUUC CUGAUGAG X CGAA IUUCUUUU 
AGACAGUU CUGAUGAG X CGAA IUUAUAAU 
UGAGAGAC CUGAUGAG X CGAA IUUUGUUA 
GGUCUGAG CUGAUGAG X CGAA IACAGUUU 
GUGGUCUG CUGAUGAG X CGAA IAGACAGU 
CUGUGGUC CUGAUGAG X CGAA IAGAGACA 
UAUACUGU CUGAUGAG X CGAA IUCUGAGA 



TCCAGTTC T GCACCTAT 
AGTTCTGC A CCTATAC~ 
TTCTGCAC C TATACCCT ~ 
TCTGCACC T ATACCCTC ~ 
ACCTATAC C CTCTGGTG _ 
CCTATACC C TCTGGTGT ~ 
CTATACCC T CTGGTGTT 
ATACCCTC T GGTGTTGC 
GGTGTTGC T TTTTAACC ~ 
TTTTTAAC C TTCCTGGA 
TTTTAACC T TCCTGGAA 
TAACCTTC C TGGAATCC 
AACCTTCC T GGAATCCA 
CTGGAATC C ATTTTCTA 
TGGAATCC A TTTTCTAA 
CCATTTTC T AAAAAATA 
ATAAAGAC A CATTCTTC 
AAAGACAC A TTCTTCTC 
ACACATTC T TCTCAGCA 
CATTCTTC T CAGCACCA 
TTCTTCTC A GCACCACA 
TTCTCAGC A CCACACAA 
CTCAGCAC C ACACAACA 
TCAGCACC A CACAACAC 
AGCACCAC A CAACACCT 
CACCACAC A ACACCTAT 
CACACAAC A CCTATTCC 
CACAACAC C TATTCCAA 
ACAACACC T ATTCCAAA 
ACCTATTC C AAAATCGA 
CCTATTCC A AAATCGAC 
AAATCGAC C ACATATTT 
A CATATTTG 
TCGACCAC A TATTTGGA 
AGTAAAGC T CTCCTCAG 
TAAAGCTC T CCTCAGCA 
AAGCTCTC C TCAGCAAA 
AGCTCTCC T CAGCAAAT 
CTCTCCTC A GCAAATGT 
TCCTCAGC A AATGTAAA 
AAAAGAAC A GAAATTAT 
ATTATAAC A AACTGTCT 
TAACAAAC T GTCTCTCA 
AAACTGTC T CTCAGACC 
ACTGTCTC T CAGACCAC 
TGTCTCTC A GACCACAG 
TCTCAGAC C ACAGTATA 
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UUAUACUG CUGAUGAG X CGAA IGUCUGAG 



CTCAGACC A CAGTATAA 



GGUUAUAC CUGAUGAG X CGAA IUGGUCUG 



CAGACCAC A GTATAACC 



UCUAGUUU CUGAUGAG X CGAA IUUAUACU 



AGTATAAC C AAACTAGA 



UUCUAGUU CUGAUGAG X CGAA IGUUAUAC 



GTATAACC A AACTAGAA 



UGAGUUCU CUGAUGAG X CGAA IUUUGGUU 



AACCAAAC T AGAACTCA 



UAAUCCUG CUGAUGAG X CGAA IUUCUAGU 



ACTAGAAC T CAGGATTA 



CUUAAUCC CUGAUGAG X CGAA IAGUUCUA 



TAGAACTC A GGATTAAG 



UUUGAGUG CUGAUGAG X CGAA IUUUCUUA 



TAAGAAAC T CACTCAAA 



GUUUUGAG CUGAUGAG X CGAA IAGUUUCU 



AGAAACTC A CTCAAAAC 



UGGUUUUG CUGAUGAG X CGAA IUGAGUUU 



AAACTCAC T CAAAACCA 



UGUGGUUU CUGAUGAG X CGAA IAGUGAGU 



ACTCACTC A AAACCACA 



AGUUGUGU CUGAUGAG X CGAA IUUUUGAG 



CTCAAAAC C ACACAACT 



UAGUUGUG CUGAUGAG X CGAA IGUUUUGA 



TCAAAACC A CACAACTA 



UGUAGUUG CUGAUGAG X CGAA IUGGUUUU 



AAAACCAC A CAACTACA 



CAUGUAGU CUGAUGAG X CGAA . IUGUGGUU 



AACCACAC A ACTACATG 



UUCCAUGU CUGAUGAG X CGAA IUUGUGUG 



CACACAAC T ACATGGAA 



AGUUUCCA CUGAUGAG X CGAA IUAGUUGU 



ACAACTAC A TGGAAACT 



GGUUGUUC CUGAUGAG X CGAA IUUUCCAU 



ATGGAAAC T GAACAACC 



GAGCAGGU CUGAUGAG X CGAA IUUCAGUU 



AACTGAAC A ACCTGCTC 



CAGGAGCA CUGAUGAG X CGAA IUUGUUCA 



TGAACAAC C TGCTCCTG 



UCAGGAGC CUGAUGAG X CGAA IGUUGUUC 



GAACAACC T GCTCCTGA 



CAUUCAGG CUGAUGAG X CGAA ICAGGUUG 



CAACCTGC T CCTGAATG 



GUCAUUCA CUGAUGAG X CGAA IAGCAGGU 



ACCTGCTC C TGAATGAC 



AGUCAUUC CUGAUGAG X CGAA IGAGCAGG 



CCTGCTCC T GAATGACT 



UAUCCAGU CUGAUGAG X CGAA IUCAUUCA 



TGAATGAC T ACTGGATA 



AUGUAUCC CUGAUGAG X CGAA IUAGUCAU 



ATGACTAC T GGATACAT 



UUUUGUUA CUGAUGAG X CGAA IUAUCCAG 



CTGGATAC A TAACAAAA 



CUUCAUUU CUGAUGAG X CGAA IUUAUGUA 



TACATAAC A AAATGAAG 



UUUAUUUC CUGAUGAG X CGAA ICCUUCAU 



ATGAAGGC A GAAATAAA 



GGUUUUAA CUGAUGAG X CGAA IAACAUCU 



AGATGTTC T TTAAAACC 



UUCUCAUU CUGAUGAG X CGAA IUUUUAAA 



TTTAAAAC C AATGAGAA 



GUUCUCAU CUGAUGAG X CGAA IGUUUUAA 



TTAAAACC A ATGAGAAC 



UGUGUCUU CUGAUGAG X CGAA IUUCUCAU 



ATGAGAAC A AAGACACA 



GUAUGUUG CUGAUGAG X CGAA IUCUUUGU 



ACAAAGAC A CAACATAC 



UGGUAUGU CUGAUGAG X CGAA IUGUCUUU 



AAAGACAC A ACATACCA 



UUCUGGUA CUGAUGAG X CGAA IUUGUGUC 



GACACAAC A TACCAGAA 



GAGAUUCU CUGAUGAG X CGAA IUAUGUUG 



CAACATAC C AGAATCTC 



AGAGAUUC CUGAUGAG X CGAA IGUAUGUU 



AACATACC A GAATCTCT 



UGUCCCAG CUGAUGAG X CGAA IAUUCUGG 



CCAGAATC T CTGGGACA 



UGUGUCCC CUGAUGAG X CGAA IAGAUUCU 



AGAATCTC T GGGACACA 



UUUGAAUG CUGAUGAG X CGAA IUCCCAGA 



TCTGGGAC A CATTCAAA 



GCUUUGAA CUGAUGAG X CGAA IUGUCCCA 



TGGGACAC A TTCAAAGC 



CACUGCUU CUGAUGAG X CGAA IAAUGUGU 



ACACATTC A AAGCAGTG 



CUACACAC CUGAUGAG X CGAA ICUUUGAA 



TTCAAAGC A GTGTGTAG 



GCAUUUAG CUGAUGAG X CGAA ICUAUAAA 



TTTATAGC A CTAAATGC 



GGGCAUUU CUGAUGAG X CGAA IUGCUAUA 



TATAGCAC T AAATGCCC 



CUCUUGUG CUGAUGAG X CGAA ICAUUUAG 



CTAAATGC C CACAAGAG 
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UCUCUUGU CUGAUGAG X CGAA IGCAUUUA 



TAAATGCC C ACAAGAGA 



2225 
2227 



UUCUCUUG CUGAUGAG X CGAA IGGCAUUU 



AAATGCCC A CAAGAGAA 



CUUUCUCU CUGAUGAG X CGAA IUGGGCAU 



ATGCCCAC A AGAGAAAG 



AUAUUUCC CUGAUGAG X CGAA ICUUUCUC 



GAGAAAGC A GGAAATAT 



UCAAUUUU CUGAUGAG X CGAA IAUAUUUC 



GAAATATC T AAAATTGA 



UGUUAGGG CUGAUGAG X CGAA IUCAAUUU 



AAATTGAC A CCCTAACA 



GAUGUUAG CUGAUGAG X CGAA IUGUCAAU 



ATTGACAC C CTAACATC 



UGAUGUUA CUGAUGAG X CGAA IGUGUCAA 



TTGACACC C TAACATCA 



GUGAUGUU CUGAUGAG X CGAA IGGUGUCA 



TGACACCC T AACATCAC 



AAUUGUGA CUGAUGAG X CGAA IUUAGGGU 



ACCCTAAC A TCACAATT 



UUUAAUUG CUGAUGAG X CGAA IAUGUUAG 



CTAACATC A CAATTAAA 



CUUUUAAU CUGAUGAG X CGAA IUGAUGUU 



AACATCAC A ATTAAAAG 



GCUUCUCU CUGAUGAG X CGAA IUUCUUUU 



AAAAGAAC T AGAGAAGC 



UUUGCUCU CUGAUGAG X CGAA ICUUCUCU 



AGAGAAGC A AGAGCAAA 



AAUGUGUU CUGAUGAG X CGAA ICUCUUGC 



GCAAGAGC A AACACATT 



UUUCAAUG CUGAUGAG X CGAA IUUUGCUC 



GAGCAAAC A CATTGAAA 



CUUUUCAA CUGAUGAG X CGAA IUGUUUGC 



GCAAACAC A TTGAAAAG 



CUUCUCUU CUGAUGAG X CGAA ICUUUUCA 



TGAAAAGC T AAGAGAAG 



UUAUUUCU CUGAUGAG X CGAA ICCUUCUC 



GAGAAGGC A AGAAATAA 



CUGAUCUU CUGAUGAG X CGAA IUUAUUUC 



GAAATAAC T AAGATCAG 



UUCUGCUC CUGAUGAG X CGAA IAUCUUAG 



CTAAGATC A GAGCAGAA 



UUCAGUUC CUGAUGAG X CGAA ICUCUGAU 



ATCAGAGC A GAACTGAA 



UUUCCUUC CUGAUGAG X CGAA IUUCUGCU 



GUUUUUUG CUGAUGAG X CGAA IUCUCUAU 



AGCAGAAC T GAAGGAAA 
ATAGAGAC A CAAAAAAC 



GAGUUUUU CUGAUGAG X CGAA IUGUCUCU 



AGAGACAC A AAAAACTC 



UUUUGAAG CUGAUGAG X CGAA IUUUUUUG 



CAAAAAAC T CTTCAAAA 



UUUUUUGA CUGAUGAG X CGAA IAGUUUUU 



AAAAACTC T TCAAAAAA 



UGAUUUUU CUGAUGAG X CGAA IAAGAGUU 



AACTCTTC A AAAAATCA 



GGAUUCAU CUGAUGAG X CGAA IAUUUUUU 



AAAAAATC A ATGAATCC 



CAGCUCCU CUGAUGAG X ( 



AATGAATC C AGGAGCTG 



CCAGCUCC CUGAUGAG X CGAA IGAUUCAU 



ATGAATCC A GGAGCTGG 



AAAAAACC CUGAUGAG X CGAA ICUCCUGG 



CCAGGAGC T GGTTTTTT 



AAUUUUGU CUGAUGAG X CGAA IAUCGUUU 



AAACGATC A ACAAAATT 



AUCAAUUU CUGAUGAG X CGAA IUUGAUCG 



CGATCAAC A AAATTGAT 



CUUGCUAG CUGAUGAG X CGAA IUCUAUCA 



TGATAGAC A CTAGCAAG 



GUCUUGCU CUGAUGAG X CGAA IUGUCUAU 



AUUAGUCU CUGAUGAG X CGAA ICUAGUGU 



ATAGACAC T AGCAAGAC 



ACACTAGC A AGACTAAT 



UCUUUAUU CUGAUGAG X CGAA IUCUUGCU 



AGCAAGAC T AATAAAGA 



CUUCUAUU CUGAUGAG X CGAA IAUUCUUC 



GAAGAATC A AATAGAAG 



UUUUUUAU CUGAUGAG X CGAA ICUUCUAU 



ATAGAAGC A ATAAAAAA 



AUUGGUGG CUGAUGAG X CGAA IAUAUCCC 



GGGATATC A CCACCAAT 



GGAUUGGU CUGAUGAG X CGAA IUGAUAUC 



GATATCAC C ACCAATCC 



GGGAUUGG CUGAUGAG X CGAA IGUGAUAU 



GUGGGAUU CUGAUGAG X CGAA IUGGUGAU 



ATATCACC A CCAATCCC 
ATCACCAC C AATCCCAC 



UGUGGGAU CUGAUGAG X CGAA IGUGGUGA 



TCACCACC A ATCCCACA 



UUUCUGUG CUGAUGAG X CGAA IAUUGGUG 



CACCAATC C CACAGAAA 



AUUUCUGU CUGAUGAG X CGAA IGAUUGGU 



ACCAATCC C ACAGAAAT 
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Table 10 



2523 


UAUUUCUG CUGAUGAG X CGAA IGGAUUGG 


1243 


CCAATCCC A CAGAAATA 


1673 




2525 


UUUAUUUC CUGAUGAG X CGAA IUG3GAUU 


1244 


AATCCCAC A GAAATAAA 


~ 1674 


2535 


CUGAUGGU CUGAUGAG X CGAA IUtFJAUUU 


1245 


AAATAAAC C ACCATCAG 


1675 


2536 


UCUGAUGG CUGAUGAG X CGAA IGUUUAUU 


124S 


AATAAACC A C CATC AG A 




2538 


UCUCUGAU CUGAUGAG X CGAA IUGGUUUA 


1247 


TAAACCAC C ATCAGAGA 


1677 


2539 


UUCUCUGA CUGAUGAG X CGAA IGUGGUUU 


1248 


AAACCACC A TCAGAGAA 




2542 


GUAUUCUC CUGAUGAG X CGAA IAUGGUGG 


1249 


CCACCATC A GAGAATAC 


1679 


2551 


GUGUUUGU CUGAUGAG X CGAA IUAUUCUC 


1250 


GAGAATAC T ACAAACAC 


1680 


2554 


GAGGUGUU CUGAUGAG X CGAA IUAGUAUU 


1251 


AATACTAC A AACACCTC 


1681 


2558 


CGUAGAGG CUGAUGAG X CGAA IUUUGUAG 


1252 


CTACAAAC A CCTCTACG 


1582 


2560 


UGC3UAGA CUGAUGAG X CGAA IUGUUUGU 


1253 


ACAAACAC C TCTACGCA 


1683 


2561 


UUGCGUAG CUGAUGAG X CGAA IGUGUUUG 


1254 


CAAACACC T CTACGCAA 


1684 






Input Sequence = HSU29607. Cut Site = CH/. ~j ' " |~ j 




Stem Length = 8 . Core Sequence = CUGAUGAG X CGAA (X = GCCGLFUAGGC or other stem II) ~ 




Scq 1 HSU29607 (Human methionine aminopeptidase mRNA, complete cds., 2569 bp) 
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1895 ! 




1898 
1899 


1900 
1901 




1905 




1907 




1910 ; 


19X1 


1912 


1913 




1916 


1917 
1918 






1921 


1922 




GGCCU UGAUGGCAUGCACUAUGCGCG ACCAACAUCA 1 


i I 
1 1 

B h 

s g 
§ b 

3 f 


GGUUU UGAUGGCAUGCACUAUGCGCG ACUUGAUAUG j 


si! 

P a < 
3 6 c 

< < ti 

888 

p o c 
sue 

III 

g 6 g 

ope 

§ 8 
B B g 

< 8 s 
8 B g 


UUUCA UGAUGGCAUGCACUAUGCGCG AAUCGGCACU 
CCUUU UGAUGGCAUGCACUAUGCGCG ACAAUCGGCA 


AAUUG UGAUGGCAUGCACUAUGCGCG AUAUACUUCU 
GGUUU UGAUGGCAUGCACUAUGCGCG AAUUGCAUAU 


AACAA UGAUGGCAUGCACUAUGCGCG ACCUUUUCCU 
AUGAA UGAUGGCAUGCACUAUGCGCG AACACCUUUU 


AUCAU UGAUGGCAUGCACUAUGCGCG AUGAACAACA 


CAUAU UGAUGGCAUGCACUAUGCGCG AUCAUGAACA j 


If 

u P 
P o 

1 § 

8 8 
§§ 
11 

< ft 

B h 

B h 

0 g 

1 1 


AACAU UGAUGGCAUGCACUAUGCGCG AAAAUUUUUC 


UCCAA UGAUGGCAUGCACUAUGCGCG AUCAAAAUUU 


UGGCA UGAUGGCAUGCACUAUGCGCG AUGUCCAACA 


AUUGG UGAUGGCAUGCACUAUGCGCG ACAUGUCCAA 


UUUAA UGAUGGCAUGCACUAUGCGCG AAGUGUUUUG 


GAUGA UGAUGGCAUGCACUAUGCGCG AUUUAACAAG 
GUUUU UGAUGGCAUGCACUAUGCGCG AUUGAUGACA 


GAAGG UGAUGGCAUGCACUAUGCGCG AAGGGUUCCA 
UGCGG UGAUGGCAUGCACUAUGCGCG AGAAGGCAAG 


AUCUG UGAUGGCAUGCACUAUGCGCG GGCAGAAGGC 


CCAAG UGAUGGCAUGCACUAUGCGCG GAUCCAGCCA 


( GCCAU UGAUGGCAUGCACUAUGCGCG AAGUAUUUAC 


UUCUU UGAUGGCAUGCACUAUGCGCG AGAGCCAUCA 


| UCACA UGAUGGCAUGCACUAUGCGCG AGAUUCUUCA 








1748 
1749 


1751 
1752 




1755 
1756 






1759 


1761 


1752 










1769 












UGAUGUUGGU G AGGCC 1 


GGAGUCCUAU G AAGUU 


CAUAUCAAGU G AAACC 


UAGAAUACAU G CUGGA 
GAAAAACAGU G CCGAU 


AGUGCCGAUU G UGAAA 
UGCCGAUUGU G AAAGG 


AGAAGUAUAU G CAAUU 
AUAUGCAAUU G AAACC 


AGGAAAAGGU G UUGUU 
AAAAGGUGUU G UUCAU 


UGUUGUUCAU G AUGAU 


UGUUCAUGAU G AUAUG 


GAUAUGGAAU G UUCAC 


GAAAAAUUUU G AUGUU 


AAAUUUUGAU G UUGGA 


UGUUGGACAU G UGCCA 


UUGGACAUGU G CCAAU 


CAAAACACUU G UUAAA 


CUUGUUAAAU G UCAUC 
UGUCAUCAAU G AAAAC 


UGGAACCCUU G CCUUC 
CUUGCCUUCU G CCGCA 


GCCUUCUGCC G CAGAU 


UGGCUGGAUC G CUUGG 


GUAAAUACUU G AUGGC 


UGAUGGCUCU G AAGAA 


< 

a 

3 
1 








1040 
1054 


1061 
1063 


1106 
1112 


1139 
1142 




rnirl 


1161 


1184 


1187 




1198 


1228 




1262 
1269 


1272 




1309 


1318 


| 1327 
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1924 1 










1930 








1936 








1942 








1950 
1951 






| 1954 j 


AGUCA UGAUGGCAUGCACUAUGCGCG ACAGAUUCUU j 


CAAGU UGAUGGCAUGCACUAUGCGCG ACACAGAUUC 


AUCUA UGAUGGCAUGCACUAUGCGCG AAUGCCCAAG | 


UGUCA UGAUGGCAUGCACUAUGCGCG AUAAUGGUGG 


AAUGU UGAUGGCAUGCACUAUGCGCG ACAUAAUGGU | 


AAUUG UGAUGGCAUGCACUAUGCGCG GCUGUAUAUG I 


CGCAA UGAUGGCAUGCACUAUGCGCG AGGAUGGUAU j 


GGACG UGAUGGCAUGCACUAUGCGCG AACAGGAUGG 1 


GCUGA UGAUGGCAUGCACUAUGCGCG AACUUCUUUA 


AUAGU UGAUGGCAUGCACUAUGCGCG AUCUCCUCUG 


UCCAA UGAUGGCAUGCACUAUGCGCG AAAGCUCAGA 


u 

jj 

ii 

E l 

< a 

ii 

< < 
§ I 


GACAA UGAUGGCAUGCACUAUGCGCG AUGUGGCAAA j 


P - 

B t 
B a 
6 § 
3 g 

1 P 
1 s 

y t 
< 2 

1 

g e 

D C 


8 c 
< < 

8 | 
1 ^ 

u 
u 

li 

D t 


AAUUA UGAUGGCAUGCACUAUGCGCG AUUAGACGGU 
UUUUU UGAUGGCAUGCACUAUGCGCG GUUGGUUAAU 


D' P < 

ill 

I < « 
o o o 

8 § e 
||| 

3 8 t 
< < < 

CD O L 

8 8 

ill 


11 

i\ 

8 h 

1 § 
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1 I 
1 ? 

D C 

1 I 
; 1 g 


CAAAA UGAUGGCAUGCACUAUGCGCG AAAACGUAUA 


< 

8 
§ 
B 

1 


j UAUGG UGAUGGCAUGCACUAUGCGCG AAUAUAAAUA 


1775 1 


1776 






















1791 


1792 




1796 
1797 




1801 






j 1805 


AAGAAUCUGU G UGACU 1 


GAAUCUGUGU G ACUUG 


CUUGGGCAUU G UAGAU 


CCACCAUUAU G UGACA 


ACCAUUAUGU G ACAUU 


CAUAUACAGC G CAAUU 


AUACCAUCCU G UUGCG 


CCAUCCUGUU G CGUCC 


UAAAGAAGUU G UCAGC 


CAGAGGAGAU G ACUAU 


UCUGAGCUUU G UUGGA 


UGGAAAACAU G AUACC 


UUUGCCACAU G UUGUC 


GCCACAUGUU G UCUGU 


GUGGACCCAU G UAAUA 


ACCGUCUAAU G UAAUU 
AUUAACCAAC G AAAAA 


ACUUUUAAAU G CUAAC 
AAUGCUAACU G UUUUU 


CUAGGAAAAU G CUAUA 


UAUACGUUUU G UUUUG 


GUUUUGUUUU G AAUAC 


| 
CD 

| 


1329 1 


-narl 




1365 


1367 


1390 






1436 






^\ 


1539 


1542 




1624 

1637 


1662 
1669 


1696 


1737 


1742 


| 1777 
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mil n 


II III 


i 1 

5 b 1 1 1 i § 


GCCACCU C AACACCU 
AAAAGCU U UCCGGAC 
UCCAGUU C UGCACCU 
ACACCCU A ACAUCAC 
CCUCGUC U CUCUCGG 

GACCCUC C CUCAGUU 
ACCCUCC C UCAGUUC 


UCAGCAC C ACACAAC 
ACCACAC A ACACCUA 

AUCCAGUUCU G CACCU 
UGAACAACCU G CUCCU 
GCUCCUGAAU G ACUAC 


iN II ill! 


1 1 S 1 1 II 


11 III 


i 11 
1 11 


AGGUGUU CUGAUGAGGCCGUUAGGCCGAA AGGUGGC 
GUCCGGA CUGAUGAGGCCGUUAGGCCGAA AGCUUUU 
AGGUGCA CUGAUGAGGCCGUUAGGCCGAA AACUGGA 
GUGAUGU CUGAUGAGGCCGUUAGGCCGAA AGGGUGU 
CCGAGAG CUGAUGAGGCCGUUAGGCCGAA GACGAGG 

AACUGAG CUGAUGAGGCCGUUAGGCCGAA lAGGGUC 
GAACUGA CUGAUGAGGCCGUUAGGCCGAA IGAGGGU 


GUUGUGU CUGAUGAGGCCGUUAGGCCGAA lUGCUGA 
UAGGUGU CUGAUGAGGCCGUUAGGCCGAA lUGUGGU 

AGGUG UGAUGGCAUGCACUAUGCGCG AGAACUGGAU 
AGGAG UGAUGGCAUGCACUAUGCGCG AGGUUGUUCA 
GUAGU UGAUGGCAUGCACUAUGCGCG AUUCAGGAGC 


Alias 
HH 

MAP2-11 
MAP2-15 
MAP2-464 
MAP2-911 
MAP2-1290 
MAP2-1342 


minis 


MAP2-1901 
MAP2-1906 

G-Cleaver 

MAP2-1B21 
MAP2-2086 
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Table 13: Human telomerase reverse transcriptase (TERT) Hammerhead 
Ribozyme and Target Sequence 



nt. 


Ribozyme Sequence 




Seq ID 


Substrate Sequence 


Seq ID 1 


Position 


_J>Jos. 




N °'- — 


13 


CGCAGCAG CUGAUGAG X CGAA ACGCAGCG 




CGCTGCGT C CTGCTGCG 




68 


GCAGCGGG CUGAUGAG X CGAA AGCGCGCG 




CGCGCGCT C CCCGCTGC 




90 


GCAGCAGG CUGAUGAG X CGAA AGCGCACG 




CGTGCGCT C CCTGCTGC 




108 


CCUCGCGG CUGAUGAG X CGAA AGUGGCUG 




CAGCCACT A CCGCGAGG 




135 


GCCGCACG CUGAUGAG X CGAA ACGUGGCC 




GGCCACGT T CGTGCGGC 




136 


CGCCGCAC CUGAUGAG X CGAA AACGUGGC 




GCCACGTT C GTGCGGCG 





194 


CGCGCGGA CUGAUGAG X CGAA AGCCGCCG 




CGGCGGCT T TCCGCGCG 




195 


GCGCGCGG CUGAUGAG X CGAA AAGCCGCC 




GGCGGCTT T CCGCGCGC 




196 


AGCGCGCG CUGAUGAG X CGAA AAAGCCGC 




GCGGCTTT C CGCGCGCT 




264 


GGCGGAAG CUGAUGAG X CGAA AGGGGGCG 




CGCCCCCT C CTTCCGCC 




267 


CCUGGCGG CUGAUGAG X CGAA AGGAGGGG 




CCCCTCCT T CCGCCAGG 




268 


ACCUGGCG CUGAUGAG X CGAA AAGGAGGG 




CCCTCCTT C CGCCAGGT 




279 


UCAGGCAG CUGAUGAG X CGAA ACACCUGG 




CCAGGTGT C CTGCCTGA 




351 


CGAAGCCG CUGAUGAG X CGAA AGGCCAGC 




GCTGGCCT T CGGCTTCG 




352 


GCGAAGCC CUGAUGAG X CGAA AAGGCCAG 




CTGGCCTT C GGCTTCGC 





357 


GCAGCGCG CUGAUGAG X CGAA AGCCGAAG 




CTTCGGCT T CGCGCTGC 




358 


AGCAGCGC CUGAUGAG X CGAA AAGCCGAA 




TTCGGCTT C GCGCTGCT 




399 


UGGUGGUG CUGAUGAG X CGAA AGGCCUCG 




CGAGGCCT T CACCACCA 




400 


CUGGUGGU CUGAUGAG X CGAA AAGGCCUC 




GAGGCCTT C ACCACCAG 





420 


UGGGCAGG CUGAUGAG X CGAA AGCUGCGC 




GCGCAGCT A CCTGCCCA . 




505 


AGCAGGUG CUGAUGAG X CGAA ACCAGCAC 




GTGCTGGT T CACCTGCT 





506 


CAGCAGGU CUGAUGAG X CGAA AACCAGCA 




TGCTGGTT C ACCTGCTG 




529 


AGCACAAA CUGAUGAG X CGAA AGCGCGCA 




TGCGCGCT C TTTGTGCT 





531 


CCAGCACA CUGAUGAG X CGAA AGAGCGCG 




CGCGCTCT T TGTGCTGG 




532 


ACCAGCAC CUGAUGAG X CGAA AAGAGCGC 




GCGCTCTT T GTGCTGGT 




545 


GCAGCUGG CUGAUGAG X CGAA AGCCACCA 




TGGTGGCT C CCAGCTGC 




558 


ACACCUGG CUGAUGAG X CGAA AGGCGCAG 




CTGCGCCT A CCAGGTGT 




582 


CGAGCUGG CUGAUGAG X CGAA ACAGCGGC 




GCCGCTGT A CCAGCTCG 




589 


GCAGCGCC CUGAUGAG X CGAA AGCUGGUA 




TACCAGCT C GGCGCTGC 




602 


CCGGGCCU CUGAUGAG X CGAA AGUGGCAG 




CTGCCACT C AGGCCCGG 




626 


GGGUCCAC CUGAUGAG X CGAA AGCGUUUU 




CACACGCT A GTGGACCC 






GCAUCCCA CUGAUGAG X CGAA ACGCCUUC 




GAAGGCGT C TGGGATGC 




671 


CCUGACGC CUGAUGAG X CGAA AUGGUUCC 




GGAACCAT A GCGTCAGG 




676 


GCCUCCCU CUGAUGAG X CGAA ACGCUAUG 




CATAGCGT C AGGGAGGC 




691 


CCCAGGGG CUGAUGAG X CGAA ACCCCGGC 




GCCGGGGT C CCCCTGGG 




749 


CAACGGCA CUGAUGAG X CGAA ACUUCGGC 




GCCGAAGT C TGCCGTTG 




756 


UCUUGGGC CUGAUGAG X CGAA ACGGCAGA 




TCTGCCGT T GCCCAAGA 




808 


CCCUGCCC CUGAUGAG X CGAA ACGGGCGU 




ACGCCCGT T GGGCAGGG 




819 


GGGCCCAG CUGAUGAG X CGAA ACCCCUGC 




GCAGGGGT C CTGGGCCC 




863 


CACACAGA CUGAUGAG X CGAA ACCACGGU 




ACCGTGGT T TCTGTGTG 




864 


CCACACAG CUGAUGAG X CGAA AACCACGG 




CCGTGGTT T CTGTGTGG 




865 


ACCACACA CUGAUGAG X CGAA AAACCACG 




CGTGGTTT C TGTGTGGT 




876 


UGGCAGGU CUGAUGAG X CGAA ACACCACA 




TGTGGTGT C ACCTGCCA 
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906 


CCUCCAAA CUGAUGAG X CGAA AGGUGGCU 




AGCCACCT C TTTGGAGG 




908 


ACCCUCCA CUGAUGAG X CGAA AGAGGUGG 




CCACCTCT T TGGAGGGT 





909 


CACCCUCC CUGAUGAG X CGAA AAGAGGUU 




CACCTCTT T GGAGGGTG 




922 


GUGCCAGA CUGAUGAG X CGAA AGCGCACC 




GGTGCGCT C TCTGGCAC 




924 


GCGUGCCA CUGAUGAG X CGAA AGAGCGCA 




TGCGCTCT C TGGCACGC 




939 


AUGGGUGG CUGAUGAG X CGAA AGUGGCGC 




GCGCCACT C CCACCCAT 




94B 


GGCCCACG CUGAUGAG X CGAA AUGGGUGG 




CCACCCAT C CGTGGGCC 




981 


GCGAUGUG CUGAUGAG X CGAA AUGGGGGG 




CCCCCCAT C CACATCGC 




987 


GUGGCCGC CUGAUGAG X CGAA AUGUGGAU 




ATCCACAT C GCGGCCAC 




1001 


GUCCCAGG CUGAUGAG X CGAA ACGUGGUG 




CACCACGT C CCTGGGAC 




1016 


CGGGGGAC CUGAUGAG X CGAA AGGCGUGU 




ACACGCCT T GTCCCCCG 




1019 


CACCGGGG CUGAUGAG X CGAA ACAAGGCG 




CGCCTTGT C CCCCGGTG 




1029 


UCUCGGCG CUGAUGAG X CGAA ACACCGGG 








1047 


AGUAGAGG CUGAUGAG X CGAA AGUGCUUG 




CAAGCACT T CCTCTACT 




1048 


GAGUAGAG CUGAUGAG X CGAA AAGUGCUU 




AAGCACTT C CTCTACTC 




1051 


GAGGAGUA CUGAUGAG X CGAA AGGAAGUG 




CACTTCCT C TACTCCTC 




1053 


CUGAGGAG CUGAUGAG X CGAA AGAGGAAG 




CTTCCTCT A CTCCTCAG 




1056 


CGCCUGAG CUGAUGAG X CGAA AGUAGAGG 




CCTCTACT C CTCAGGCG 




1059 


UGUCGCCU CUGAUGAG X CGAA AGGAGUAG 




CTACTCCT C AGGCGACA 




1086 


GUAGGAAG CUGAUGAG X CGAA AGGGCCGC 








1089 


UGAGUAGG CUGAUGAG X CGAA AGGAGGGC 




GCCCTCCT T CCTACTCA 




1090 


CUGAGUAG CUGAUGAG X CGAA AAGGAGGG 




CCCTCCTT C CTACTCAG 




1093 


GAGCUGAG CUGAUGAG X CGAA AGGAAGGA 




TCCTTCCT A CTCAGCTC 




1096 


AGAGAGCU CUGAUGAG X CGAA AGUAGGAA 








1101 


GCCUCAGA CUGAUGAG X CGAA AGCUGAGU 




ACTCAGCT C TCTGAGGC 




1103 


GGGCCUCA CUGAUGAG X CGAA AGAGCUGA 








1127 


GAGCCUCC CUGAUGAG X CGAA AGCGCCAG 




CTGGCGCT C GGAGGCTC 




1135 


GUCUCCAC CUGAUGAG X CGAA AGCCUCCG 




CGGAGGCT C GTGGAGAC 





1147 


CCCAGAAA CUGAUGAG X CGAA AUGGUCUC 




GAGACCAT C TTTCTGGG 




1149 


AACCCAGA CUGAUGAG X CGAA AGAUGGUC 




GACCATCT T TCTGGGTT 




1150 


GAACCCAG CUGAUGAG X CGAA AAGAUGGU 




ACCATCTT T CTGGGTTC 





1151 


GGAACCCA CUGAUGAG X CGAA AAAGAUGG 




CCATCTTT C TGGGTTCC 




1157 


GGGCCUGG CUGAUGAG X CGAA ACCCAGAA 




TTCTGGGT T CCAGGCCC 




1158 


AGGGCCUG CUGAUGAG X CGAA AACCCAGA 




TCTGGGTT C CAGGCCCT 




1181 


CCUGCGGG CUGAUGAG X CGAA AGUCCCUG 




CAGGGACT C CCCGCAGG 




1191 


GGCGGGGC CUGAUGAG X CGAA ACCUGCGG 




CCGCAGGT T GCCCCGCC 




1212 


UUUGCCAG CUGAUGAG X CGAA AGCGCUGG 




CCAGCGCT A CTGGCAAA 




1233 


GCUCCAGA CUGAUGAG X CGAA ACAGGGGC 




GCCCCTGT T TCTGGAGC 






AGCUCCAG CUGAUGAG X CGAA AACAGGGG 




CCCCTGTT T CTGGAGCT 




1235 


CAGCUCCA CUGAUGAG X CGAA AAACAGGG 




CCCTGTTT C TGGAGCTG 




1245 


UGGUUCCC CUGAUGAG X CGAA AGCAGCUC 




GAGCTGCT T GGGAACCA 




1269 


GCACCCCG CUGAUGAG X CGAA AGGGGCAC 




GTGCCCCT A CGGGGTGC 




1279 


GUCUUGAG CUGAUGAG X CGAA AGCACCCC 




GGGGTGCT C CTCAAGAC 




1282 


UGCGUCUU CUGAUGAG X CGAA AGGAGCAC 




GTGCTCCT C AAGACGCA 




1312 


GCUGGGGU CUGAUGAG X CGAA ACCGCAGC 




GCTGCGGT C ACCCCAGC 




1330 


CGGGCACA CUGAUGAG X CGAA ACACCGGC 




GCCGGTGT C TGTGCCCG 




1356 


CCGCCACA CUGAUGAG X CGAA AGCCCUGG 




CCAGGGCT C TGTGGCGG 
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1394 


CACCAGGC CUGAUGAG X CGAA ACGGGGGU 




ACCCCCGT C GCCTGGTG ^ 




1411 


UGCUGGCG CUGAUGAG X CGAA AGCAGCUG 




CAGCTGCT C CGCCAGCA 




1440 


CGAAGCCG CUGAUGAG X CGAA ACACCUGC 




GCAGGTGT A CGGCTTCG 




1446 


CCCGCACG CUGAUGAG X CGAA AGCCGUAC 




GTACGGCT T CGTGCGGG 




1447 


GCCCGCAC CUGAUGAG X CGAA AAGCCGUA 




TACGGCTT C GTGCG GC 




1486 


GAGCCCCA CUGAUGAG X CGAA AGGCCUGG 




CCAGGCCT C TGGGGCTC 




1494 


UGUGCCUG CUGAUGAG X CGAA AGCCCCAG 




CTGGGGCT C CAGGCACA 




1515 


UCCUGAGG CUGAUGAG X CGAA AGCGGCGU 




ACGCCGCT T CCTCAGGA 




1516 


UUCCUGAG CUGAUGAG X CGAA AAGCGGCG 




CGCCGCTT C CTCAGGAA 




1519 


GUGUUCCU CUGAUGAG X CGAA AGGAAGCG 




CGCTTCCT C AGGAACAC 




1536 


GGGAGAUG CUGAUGAG X CGAA ACUUCUUG 




CAAGAAGT T CATCTCCC 




1537 


AGGGAGAU CUGAUGAG X CGAA AACUUCUU 




AAGAAGTT C ATCTCCCT 




1540 


CCCAGGGA CUGAUGAG X CGAA AUGAACUU 




AAGTTCAT C TCCCTGGG 




1542 


UCCCCAGG CUGAUGAG X CGAA AGAUGAAC 




GTTCATCT C CCTGGGGA 




1564 


OGCAGCGA CUGAUGAG X CGAA AGCUUGGC 




GCCAAGCT C TCGCTGCA 




1566 


CCUGCAGC CUGAUGAG X CGAA AGAGCUUG 




CAAGCTCT C GCTGCAGG 




1610 


GCGCAGCC CUGAUGAG X CGAA AGCGCAGU 




ACTGCGCT T GGCTGCGC 


■ 


1633 


ACACAGCC CUGAUGAG X CGAA ACCCCUGG 




CCAGGGGT T GGCTGTGT 




1642 


GCGGCCGG CUGAUGAG X CGAA ACACAGCC 




GGCTGTGT T CCGGCCGC 




1643 


UGCGGCCG CUGAUGAG X CGAA AACACAGC 




GCTGTGTT C CGGCCGCA 




1661 


CUCACGCA CUGAUGAG X CGAA ACGGUGCU 




AGCACCGT C TGCGTGAG 




1675 


UUGGCCAG CUGAUGAG X CGAA AUCUCCUC 




GAGGAGAT C CTGGCCAA 




1686 


AGUGCAGG CUGAUGAG X CGAA ACUUGGCC 




GGCCAAGT T CCTGCACT 




1687 


CAGUGCAG CUGAUGAG X CGAA AACUUGGC 




GCCAAGTT C CTGCACTG 




1710 


CGACGACG CUGAUGAG X CGAA ACACACUC 




GAGTGTGT A CGTCGTCG 




1714 


AGCUCGAC CUGAUGAG X CGAA ACGUACAC 




GTGTACGT C GTCGAGCT 




1717 


AGCAGCUC CUGAUGAG X CGAA ACGACGUA 




TACGTCGT C GAGCTGCT 





1726 


AAAGACCU CUGAUGAG X CGAA AGCAGCUC 




GAGCTGCT C AGGTCTTT 




1731 


AAAAGAAA CUGAUGAG X CGAA ACCUGAGC 




GCTCAGGT C TTTCTTTT 




1733 


AUAAAAGA CUGAUGAG X CGAA AGACCUGA 




TCAGGTCT T TCTTTTAT 




1734 


CAUAAAAG CUGAUGAG X CGAA AAGACCUG 




CAGGTCTT T CTTTTATG 




1735 


ACAUAAAA CUGAUGAG X CGAA AAAGACCU 




AGGTCTTT C TTTTATGT 




1737 


UGACAUAA CUGAUGAG X CGAA AGAAAGAC 




GTCTTTCT T TTATGTCA 




1738 


GUGACAUA CUGAUGAG X CGAA AAGAAAGA 




TCTTTCTT T TATGTCAC 




1739 


CGUGACAU CUGAUGAG X CGAA AAAGAAAG 




CTTTCTTT T ATGTCACG 




1740 


CCGUGACA CUGAUGAG X CGAA AAAAGAAA 




TTTCTTTT A TGTCACGG 




1744 


GUCUCCGU CUGAUGAG X CGAA ACAUAAAA 




TTTTATGT C ACGGAGAC 




1758 


UCUUUUGA CUGAUGAG X CGAA ACGUGGUC 




GACCACGT T TCAAAAGA 






UUCUUUUG CUGAUGAG X CGAA AACGUGGU 




ACCACGTT T CAAAAGAA 




1760 


GUUCUUUU CUGAUGAG X CGAA AAACGUGG 




CCACGTTT C AAAAGAAC 




1774 


UAGAAAAA CUGAUGAG X CGAA AGCCUGUU 




AACAGGCT C TTTTTCTA 




1776 


GGUAGAAA CUGAUGAG X CGAA AGAGCCUG 




CAGGCTCT T TTTCTACC 




1777 


CGGUAGAA CUGAUGAG X CGAA AAGAGCCU 




AGGCTCTT T TTCTACCG 




1778 


CCGGUAGA CUGAUGAG X CGAA AAAGAGCC 




GGCTCTTT T TCTACCGG 




1779 


UCCGGUAG CUGAUGAG X CGAA AAAAGAGC 




GCTCTTTT T CTACCGGA 




1780 


UUCCGGUA CUGAUGAG X CGAA AAAAAGAG 




CTCTTTTT C TACCGGAA 




1782 


UCUUCCGG CUGAUGAG X CGAA AGAAAAAG 




CTTTTTCT A CCGGAAGA 
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1795 


UUGCUCCA CUGAUGAG X CGAA ACACUCUU 




AAGAGTGT C TGGAGCAA 




1806 


UGCUUUGC CUGAUGAG X CGAA ACUUGCUC 




GAGCAAGT T GCAAAGCA 




1816 


CUGAUUCC CUGAUGAG X CGAA AUGCUUUG 




CAAAGCAT T GGAATCAG 




1822 


UGCUGUCU CUGAUGAG X CGAA AUUCCAAU 




ATTGGAAT C AGACAGCA 




1833 


CCCUCUUC CUGAUGAG X CGAA AGUGCUGU 




ACAGCACT T GAAGAGGG 




1860 


CUGCUUCC CUGAUGAG X CGAA ACAGCUCC 








1873 


UGCUGCCU CUGAUGAG X CGAA ACCUCUGC 




GCAGAGGT C AGGCAGCA 




1883 


GGCUUCCC CUGAUGAG X CGAA AUGCUGCC 




GGCAGCAT C GGGAAGCC 




1911 


GGAGUCUG CUGAUGAG X CGAA ACGUCAGC 




GCTvjACGT A A 




1918 


AUGAAGCG CUGAUGAG X CGAA AGUCUGGA 




TCCAGACT C CGCTT AT 




1923 


UGGGGAUG CUGAUGAG X CGAA AGCGGAGU 




ACTCCGCT t catcccca 




1924 


UUGGGGAU CUGAUGAG X CGAA AAGCGGAG 




CTCCGCTT C ATCCCCAA 




1927 


GGCUUGGG CUGAUGAG X CGAA AUGAAGCG 




CGCTTCAT C CCCAAGCC 




1954 


AUGUUCAC CUGAUGAG X CGAA AUCGGCCG 




CGGCCGAT T GTGAA AT 




1968 


CCACGACG CUGAUGAG X CGAA AGUCCAUG 




CATGGACT A CGTCGTGG 




1972 


GCUCCCAC CUGAUGAG X CGAA ACGUAGUC 




GACTACGT C GTGGGAGC 




1989 


CUCUGCGG CUGAUGAG X CGAA ACGUUCUG 




CAGAACGT T CCGCAGAG 




1990 


UCUCUGCG CUGAUGAG X CGAA AACGUUCU 




AGAACGTT C CGCAGAGA 




2015 


CGAGGUGA CUGAUGAG X CGAA ACGCUCGG 




- CCGAGCGT C TCACCTCG 




2017 


CUCGAGGU CUGAUGAG X CGAA AGACGCUC 




GAGCGTCT C ACCTCGAG 




2022 


UCACCCUC CUGAUGAG X CGAA AGGUGAGA 




TCTCACCT C GAGGGTGA 




2040 


GCACGCUG CUGAUGAG X CGAA ACAGUGCC 




GGCACTGT T CAGCGTGC 




2041 


AGCACGCU CUGAUGAG X CGAA AACAGUGC 




GCACTGTT C AGCGTGCT 




2050 


UCGUAGUU CUGAUGAG X CGAA AGCACGCU 




AGCGTGCT C AACTACGA 




2055 


CCCGCUCG CUGAUGAG X CGAA AGUUGAGC 




GCTCAACT A CGAGCGGG 




2080 


GCGCCCAG CUGAUGAG X CGAA AGGCCGGG 




CCCGGCCT C CTGGGCGC^ 




2091 


CCAGCACA CUGAUGAG X CGAA AGGCGCCC 




GGGCGCCT C TGTGCTGG 




2111 


CCUGUGGA CUGAUGAG X CGAA AUCGUCCA 




TGGACGAT A TCCACAGG 




2113 


GCCCUGUG CUGAUGAG X CGAA AUAUCGUC 








2133 


GCAGCACG CUGAUGAG X CGAA AGGUGCGC 




GCGCACCT T CGTGCTGC 




2134 


CGCAGCAC CUGAUGAG X CGAA AAGGUGCG 




CGCACCTT C GTG T 




2175 


UGACAAAG CUGAUGAG X CGAA ACAGCUCA 




TGAGCTGT A CTTTGTCA 




2178 


CCUUGACA CUGAUGAG X CGAA AGUACAGC 




GCTGTACT T TGTCAAGG 




2179 


ACCUUGAC CUGAUGAG X CGAA AAGUACAG 




CTGTACTT T GTCAAGGT 




2182 


UCCACCUU CUGAUGAG X CGAA ACAAAGUA 




TACTTTGT C AAG 




2205 


UGGUGUCG CUGAUGAG X CGAA ACGCGCCC 




GGGCGCGT A CGACACCA 




2215 


UCCUGGGG CUGAUGAG X CGAA AUGGUGUC 




GA A 




2230 


ACCUCCGU CUGAUGAG X CGAA AGCCUGUC 










CUGGCGAU CUGAUGAG X CGAA ACCUCCGU 




ACGGAGGT C ATCGCCAG 




2242 


AUGCUGGC CUGAUGAG X CGAA AUGACCUC 




GAGGTCAT C GCCAGCAT 




2251 


GGUUUGAU CUGAUGAG X CGAA AUGCUGGC 




GCCAGCAT C ATCAAACC 




2254 


UGGGGUUU CUGAUGAG X CGAA AUGAUGCU 




AGCATCAT C AAACCCCA 




2271 


GCACGCAG CUGAUGAG X CGAA ACGUGUUC 




GAACACGT A CTGCGTGC 




2282 


GGCAUACC CUGAUGAG X CGAA ACGCACGC 




GCGTGCGT C GGTATGCC 




2286 


CCACGGCA CUGAUGAG X CGAA ACCGACGC 




GCGTCGGT A TGCCGTGG 




2296 


GCCUUCUG CUGAUGAG X CGAA ACCACGGC 




GCCGTGGT C CAGA^GGC 




2320 


GCCUUGCG CUGAUGAG X CGAA ACGUGCCC 




GGGCACGT C CGCAAGGC 
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GGCUCUUG CUGAUGAG X CGAA AGGCCUUG 



CAAGGCCT T CAAGAGCC 



UGGCUCUU CUGAUGAG X CGAA AAGGCCUU 



AAGGCCTT C AAGAGCCA 



AAGGUAGA CUGAUGAG X CGAA ACGUGGCU 



AGCCACGT C TCTACCTT 



UCAAGGUA CUGAUGAG X CGAA AGACGUGG 



CCACGTCT C TACCTTGA 



UGUCAAGG CUGAUGAG X CGAA AGAGACGU 



ACGTCTCT A CCTTGACA 



GGUCUGUC CUGAUGAG X CGAA AGGUAGAG 



CTCTACCT T GACAGACC 



UACGGCUG CUGAUGAG X CGAA AGGUCUGU 



ACAGACCT C CAGCCGTA 



GUCGCAUG CUGAUGAG X CGAA ACGGCUGG 



CCAGCCGT A CATGCGAC 



GAGCCACG CUGAUGAG X CGAA ACUGUCGC 



GCGACAGT T CGTGGCTC 



UGAGCCAC CUGAUGAG X CGAA AACUGUCG 



CGACAGTT C GTGGCTCA 



CUGCAGGU CUGAUGAG X CGAA AGCCACGA 



TCGTGGCT C ACCTGCAG 



UCGAUGAC CUGAUGAG X CGAA ACGGCAUC 



GATGCCGT C GTCATCGA 



UGCUCGAU CUGAUGAG X CGAA ACGACGGC 



GCCGTCGT C ATCGAGCA 



CUCUGCUC CUGAUGAG X CGAA AUGACGAC 



GTCGTCAT C GAGCAGAG 



UCAGGGAG CUGAUGAG X CGAA AGCUCUGC 



GCAGAGCT C CTCCCTGA 



CAUUCAGG CUGAUGAG X CGAA AGGAGCUC 



GAGCTCCT C CCTGAATG 



ACGUCGAA CUGAUGAG X CGAA AGGCCACU 



AGTGGCCT C TTCGACGT 



AGACGUCG CUGAUGAG X CGAA AGAGGCCA 



TGGCCTCT T CGACGTCT 



AAGACGUC CUGAUGAG X CGAA AAGAGGCC 



GGCCTCTT C GACGTCTT 



CGUAGGAA CUGAUGAG X CGAA ACGUCGAA 



TTCGACGT C TTCCTACG 



AGCGUAGG CUGAUGAG X CGAA AGACGUCG 



CGACGTCT T CCTACGCT 



AAGCGUAG CUGAUGAG X CGAA AAGACGUC 



GACGTCTT C CTACGCTT 



AUGAAGCG CUGAUGAG X CGAA AGGAAGAC 



GTCTTCCT A CGCTTCAT 



GGCACAUG CUGAUGAG X CGAA AGCGUAGG 



CCTACGCT T CATGTGCC 



UGGCACAU CUGAUGAG X CGAA AAGCGUAG 



CTACGCTT C ATGTGCCA 



UUGCCCCU CUGAUGAG X CGAA AUGCGCAC 



GTGCGCAT C AGGGGCAA 



GGACGUAG CUGAUGAG X CGAA ACUUGCCC 



GGGCAAGT C CTACGTCC 



ACUGGACG CUGAUGAG X CGAA AGGACUUG 



CAAGTCCT A CGTCCAGT 



UGGCACUG CUGAUGAG X CGAA ACGUAGGA 



TCCTACGT C CAGTGCCA 



CCCUGCGG CUGAUGAG X CGAA AUCCCCUG 



CAGGGGAT C CCGCAGGG 



AGAGGAUG CUGAUGAG X CGAA AGCCCUGC 



GCAGGGCT C CATCCTCT 



GUGGAGAG CUGAUGAG X CGAA AUGGAGCC 



GGCTCCAT C CTCTCCAC 



AGCGUGGA CUGAUGAG X ( 



TCCATCCT C TCCACGCT 



GCAGCGUG CUGAUGAG X CGAA AGAGGAUG 



CATCCTCT C CACGCTGC 



AGGCUGCA CUGAUGAG X CGAA AGCAGCGU 



ACGCTGCT C TGCAGCCT 



UGUCGCCG CUGAUGAG X CGAA AGCACAGG 



CCTGTGCT A CGGCGACA 



UCCCCGCA CUGAUGAG X CGAA ACAGCUUG 



CAAGCTGT T TGCGGGGA 



AUCCCCGC CUGAUGAG X CGAA AACAGCUU 



UCCCGCCG CUGAUGAG X CGAA AUCCCCGC 



: T GCGGGGAT 
: T CGGCGGGA 



GUCCCGCC CUGAUGAG X CGAA AAUCCCCG 



CGGGGATT C GGCGGGAC 



AAACGCAG CUGAUGAG X CGAA AGCAGCCC 



GGGCTGCT C CTGCGTTT 



AUCCACCA CUGAUGAG X CGAA ACGCAGGA 



TCCTGCGT T TGGTGGAT 



CAUCCACC CUGAUGAG X CGAA * 



CAACAAGA CUGAUGAG X CGAA P 



CCTGCGTT T GGTGGATG 



TGGATGAT T TCTTGTTG 



CCAACAAG CUGAUGAG X CGAA AAUCAUCC 



GGATGATT T CTTGTTGG 



ACCAACAA CUGAUGAG X CGAA AAAUCAUC 



GATGATTT C TTGTTGGT 



UCACCAAC CUGAUGAG X CGAA AGAAAUCA 



TGATTTCT T GTTGGTGA 
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GUGUCACC CUGAUGAG X CGAA ACAAGAAA 



TTTCTTGT T GGTGACAC 



GGUGAGGU CUGAUGAG X CGAA AGGUGUCA 



TGACACCT C ACCTCACC 



GCGUGGGU CUGAUGAG X CGAA AGGUGAGG 



CCTCACCT C ACCCACGC 



UCCUGAGG CUGAUGAG X CGAA AGGUUUUC 



GAAAACCT T CCTCAGGA 



GUCCUGAG CUGAUGAG X CGAA AAGGUUUU 



AAAACCTT C CTCAGGAC 



AGGGUCCU CUGAUGAG X CGAA AGGAAGGU 



ACCTTCCT C AGGACCCT 



ACACCUCG CUGAUGAG X CGAA ACCAGGGU 



ACCCTGGT C CGAGGTGT 



UACUCAGG CUGAUGAG X CGAA ACACCUCG 



CGAGGTGT C CCTGAGTA 



CGCAGCCA CUGAUGAG X CGAA ACUCAGGG 



CCCTGAGT A TGGCTGCG 



UCUUCCGC CUGAUGAG X CGAA AGUUCACC 



GGTGAACT T GCGGAAGA 



CUACAGGG CUGAUGAG X CGAA AGUUCACC 



GGTGAACT T CCCTGTAG 



UCUACAGG CUGAUGAG X CGAA AAGUUCAC 



GTGAACTT C CCTGTAGA 



UCGUCUUC CUGAUGAG X CGAA ACAGGGAA 



CUGAACAA CUGAUGAG X CGAA AGCCGUGC 



GCACGGCT 1 



UCUGAACA CUGAUGAG X CGAA AAGCCGUG 



CACGGCTT T TGTTCAGA 



AUCUGAAC CUGAUGAG X CGAA AAAGCCGU 



ACGGCTTT T GTTCAGAT 



GGCAUCUG CUGAUGAG X CGAA ACAAAAGC 



GCTTTTGT T CAGATGCC 



CGGCAUCU CUGAUGAG X CGAA AACAAAAG 



CTTTTGTT C AGATGCCG 



CAGGGGAA CUGAUGAG X CGAA AGGCCGUG 



CACGGCCT A TTCCCCTG 



ACCAGGGG CUGAUGAG X CGAA AUAGGCCG 



CGGCCTAT T CCCCTGGT 



CACCAGGG CUGAUGAG X CGAA AAUAGGCC 



GGCCTATT C CCCTGGTG 



GGUCCGGG CUGAUGAG X CGAA AUCCAGCA 



TGCTGGAT A CCCGGACC 



AGCUGGAG CUGAUGAG X CGAA AGUCGCUC 



GAGCGACT A CTCCAGCT 



CAUAGCUG CUGAUGAG X CGAA AGUAGUCG 



CGACTACT C CAGCTATG 



UCCGGGCA CUGAUGAG X CGAA AGCUGGAG 



CTCCAGCT A TGCCCGGA 



CUCUGAUG CUGAUGAG X CGAA AGGUCCGG 



CCGGACCT C CATCAGAG 



CUGGCUCU CUGAUGAG X CGAA AUGGAGGU 



ACCTCCAT C AGAGCCAG 



GAAGGUGA CUGAUGAG X C 



GAGCCAGT C TCACCTTC 



UUGAAGGU CUGAUGAG X CGAA AGACUGGC 



GCCAGTCT C ACCTTCAA 



CGCGGUUG CUGAUGAG X CGAA AGGUGAGA 



TCTCACCT T CAACCGCG 



CCGCGGUU CUGAUGAG X CGAA AAGGUGAG 



7 C AACCGCGG 



CAGCCUUG CUGAUGAG X CGAA AGCCGCGG 



CCGCGGCT T CAAGGCTG 



CCAGCCUU CUGAUGAG X CGAA AAGCCGCG 



CGCGGCTT C AAGGCTGG 



GAGUUUGC CUGAUGAG X CGAA ACGCAUGU 



ACATGCGT C GCAAACTC 



ACCCCAAA CUGAUGAG X CGAA AGUUUGCG 



CGCAAACT C TTTGGGGT 



AGACCCCA CUGAUGAG X CGAA AGAGUUUG 



CAAACTCT T TGGGGTCT 



AAGACCCC CUGAUGAG X CGAA AAGAGUUU 



AAACTCTT T GGGGTCTT 



AGCCGCAA CUGAUGAG X CGAA ACCCCAAA 



TTTGGGGT C TTGCGGCT 



UCAGCCGC CUGAUGAG X CGAA AGACCCCA 



TGGGGTCT T GCGGCTGA 



CAGGCUGU CUGAUGAG X CGAA ACACUUCA 



TGAAGTGT C ACAGCCTG 



AAUCCAGA CUGAUGAG X CGAA ACAGGCUG 



CAGCCTGT T TCTGGATT 



AAAUCCAG CUGAUGAG X CGAA AACAGGCU 



AGCCTGTT T CTGGATTT 



CAAAUCCA CUGAUGAG X CGAA AAACAGGC 



GCCTGTTT C TGGATTTG 



CACCUGCA CUGAUGAG X CGAA AUCCAGAA. 



TTCTGGAT T TGCAGGTG 



UCACCUGC CUGAUGAG X CGAA AAUCCAGA 



TCTGGATT T GCAGGTGA 



ACCGUCUG CUGAUGAG X CGAA AGGCUGUU 



AACAGCCT C CAGACGGT 



AUCUUGUA CUGAUGAG X CGAA AUGUUGGU 



ACCAACAT C TACAAGAT 
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GGAUCUUG CUGAUGAG X CGAA AGAUGUUG 



CAACATCT A CAAGATCC 



AGCAGGAG CUGAUGAG X CGAA AUCUUGUA 



TACAAGAT C CTCCTGCT 



UGCAGCAG CUGAUGAG X CGAA AGGAUCUU 



AAGATCCT C CTGCTGCA 



GAAACCUG CUGAUGAG X CGAA ACGCCUGC 



GCAGGCGT A CAGGTTTC 



AUGCGUGA CUGAUGAG X CGAA ACCUGUAC 



GTACAGGT T TCACGCAT 



CAUGCGUG CUGAUGAG X CGAA AACCUGUA 



TACAGGTT T CACGCATG 



ACAUGCGU CUGAUGAG X CGAA AAACCUGU 



ACAGGTTT C ACGCATGT 



UGAAAUGG CUGAUGAG X CGAA AGCUGCAG 



GCUGAUGA CUGAUGAG X CGAA AUGGGAGC 



GCTCCCAT T TCATCAGC 



UGCUGAUG CUGAUGAG X CGAA AAUGGGAG 



CTCCCATT T CATCAGCA 



UUGCUGAU CUGAUGAG X CGAA AAAUGGGA 



TCCCATTT C ATCAGCAA 



AACUUGCU CUGAUGAG X CGAA AUGAAAUG 



CATTTCAT C AGCAAGTT 



UUCUUCCA CUGAUGAG X CGAA ACUUGCUG 



CAGCAAGT T TGGAAGAA 



GUUCUUCC CUGAUGAG X CGAA AACUUGCU 



AGCAAGTT T GGAAGAAC 



GCAGGAAA CUGAUGAG X CGAA AUGUGGGG 



CCCCACAT T TTTCCTGC 



CGCAGGAA CUGAUGAG X CGAA AAUGUGGG 



CCCACATT T TTCCTGCG 



GCGCAGGA CUGAUGAG X CGAA AAAUGUGG 



CCACATTT T TCCTGCGC 



CGCGCAGG CUGAUGAG X CGAA AAAAUGUG 



CACATTTT T CCTGCGCG 



ACGCGCAG CUGAUGAG X CGAA AAAAAUGU 



ACATTTTT C CTGCGCGT 



UCAGAGAU CUGAUGAG X CGAA ACGCGCAG 



CTGCGCGT C ATCTCTGA 



GUGUCAGA CUGAUGAG X CGAA AUGACGCG 



CGCGTCAT C TCTGACAC 



CCGUGUCA CUGAUGAG X CGAA AGAUGACG 



CGTCATCT C TGACACGG 



AGCAGAGG CUGAUGAG X CGAA AGGCCGUG 



CACGGCCT C CCTCTGCT 



GAGUAGCA CUGAUGAG X CGAA AGGGAGGC 



GCCTCCCT C TGCTACTC 



GGAUGGAG CUGAUGAG X CGAA AGCAGAGG 



UCAGGAUG CUGAUGAG X CGAA AGUAGCAG 



CTGCTACT C CATCCTGA 



GCUUUCAG CUGAUGAG X CGAA AUGGAGUA 



TACTCCAT C CTGAAAGC 



CCCCCAGC CUGAUGAG X CGAA ACAUCCCU 



AGGGATGT C C 



GGAGGGCA CUGAUGAG X C 



i AGGGCCGG 



CCGGCCCT C TGCCCTCC 



CGGCCUCG CUGAUGAG X CGAA AGGGCAGA 



TCTGCCCT C CGAGGCCG 



UGAGCAGG CUGAUGAG X CGAA AUGCUUGG 



CCAAGCAT T CCTGCTCA 



UUGAGCAG CUGAUGAG X CGAA AAUGCUUG 



CAAGCATT C CTGCTCAA 



GUCAGCUU CUGAUGAG X CGAA AGCAGGAA 



TTCCTGCT C AAGCTGAC 



ACGGUGUC CUGAUGAG X CGAA AGUCAGCU 



AGCTGACT C GACACCGT 



ACGUAGGU CUGAUGAG X CGAA ACACGGUG 



CACCGTGT C ACCTACGT 



GUGGCACG CUGAUGAG X CGAA AGGUGACA 



GACCCCAG CUGAUGAG X CGAA AGUGGCAC 



TGTCACCT 
GTGCCACT 



A CGTGCCAC 
C CTGGGGTC 



UCCUGAGU CUGAUGAG X CGAA ACCCCAGG 



CCTGGGGT C ACTCAGGA 



GCUGUCCU CUGAUGAG X CGAA AGUGACCC 



GGGTCACT C AGGACAGC 



GAGCUUCC CUGAUGAG X CGAA ACUCAGCU 



AGCTGAGT C GGAAGCTC 



GUCCCCGG CUGAUGAG X CGAA AGCUUCCG 



CGGAAGCT C CCGGGGAC 



UGAAGUCU CUGAUGAG X CGAA AGGGCAGU 



ACTGCCCT C AGACTTCA 



UGGUCUUG CUGAUGAG X CGAA AGUCUGAG 



CTCAGACT T CAAGACCA 



AUGGUCUU CUGAUGAG X CGAA AAGUCUGA 



TCAGACTT C AAGACCAT 



CAGUCCAG CUGAUGAG X CGAA AUGGUCUU 



AAGACCAT C CTGGACTG 



CCCGGCGU CUGAUGAG X CGAA ACAGGGCU 



AGCCCTGT C ACGCCGGG 



GGGACGUA CUGAUGAG X CGAA AGCCCGGC 



GCCGGGCT C TACGTCCC 
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CUGGGACG CUGAUGAG X CGAA AGAGCCCG 



CGGGCTCT A CGTCCCAG 



CUCCCUGG CUGAUGAG X CGAA ACGUAGAG 



CTCTACGT C CCAGGGAG 



AGGCCUCA CUGAUGAG X CGAA ACUCCCAG 



CTGGGAGT C TGAGGCCT 



CUCGGCCA CUGAUGAG X CGAA ACACUCAC 



GTGAGTGT T TGGCCGAG 



CCUCGGCC CUGAUGAG X CGAA AACACUCA 



TGAGTGTT T GGCCGAGG 



UUCAGCCG CUGAUGAG ) 



CTGCATGT C CGGCTGAA 



CUCAGCCG CUGAUGAG X CGAA ACACUCAG 



CTGAGTGT C CGGCTGAG 



CUUGGCUG CUGAUGAG X CGAA ACACUCGC 



GCGAGTGT C CAGCCAAG 



GUGUGCUG CUGAUGAG X CGAA ACACUCAG 



CTGAGTGT C CAGCACAC 



GAAGUGAA CUGAUGAG X CGAA ACGGCAGG 



CCTGCCGT C TTCACTTC 



GGGAAGUG CUGAUGAG X CGAA AGACGGCA 



TGCCGTCT T CACTTCCC 



GGGGAAGU CUGAUGAG X CGAA AAGACGGC 



GCCGTCTT C ACTTCCCC 



CUGUGGGG CUGAUGAG X CGAA AGUGAAGA 



TCTTCACT T CCCCACAG 



CCUGUGGG CUGAUGAG X CGAA AAGUGAAG 



CTTCACTT C CCCACAGG 



GUGGAGCC CUGAUGAG X CGAA AGCGCCAG 



CTGGCGCT C GGCTCCAC 



CUGGGGUG CUGAUGAG X CGAA AGCCGAGC 



GCTCGGCT C CACCCCAG 



GUGAGGAA CUGAUGAG X CGAA AGCUGGCC 



GGCCAGCT T TTCCTCAC 



GGUGAGGA CUGAUGAG X CGAA AAGCUGGC 



GCCAGCTT T TCCTCACC 



UGGUGAGG CUGAUGAG X CGAA AAAGCUGG 



CCAGCTTT T CCTCACCA 



CUGGUGAG CUGAUGAG X CGAA AAAAGCUG 



CAGCTTTT C CTCACCAG 



CUCCUGGU CUGAUGAG X CGAA AGGAAAAG 



CTTTTCCT C ACCAGGAG 



GGGAGUGG CUGAUGAG X CGAA AGCCGGGC 



GCCCGGCT T CCACTCCC 



GGGGAGUG CUGAUGAG X CGAA AAGCCGGG 



CCCGGCTT C CACTCCCC 



UAUGUGGG CUGAUGAG X CGAA AGUGGAAG 



CTTCCACT C CCCACATA 



ACUAUUCC CUGAUGAG X CGAA AUGUGGGG 



CCCCACAT A GGAATAGT 



GGAUGGAC CUGAUGAG X CGAA AUUCCUAU 



ATAGGAAT A GTCCATCC 



UGGGGAUG CUGAUGAG X CGAA ACUAUUCC 



GGAATAGT C CATCCCCA 



AAUCUGGG .CUGAUGAG X CGAA AUGGACUA 



TAGTCCAT C CCCAGATT 



CAAUGGCG CUGAUGAG X CGAA AUCUGGGG 



CCCCAGAT T CGCCATTG 



ACAAUGGC CUGAUGAG X CGAA AAUCUGGG 



CCCAGATT C GCCATTGT 



GGGUGAAC CUGAUGAG X CGAA AOGGCGAA 



TTCGCCAT T GTTCACCC 



GAGGGGUG CUGAUGAG X CGAA ACAAUGGC 



GCCATTGT T CACCCCTC 



CGAGGGGU CUGAUGAG X CGAA AACAAUGG 



CCATTGTT C ACCCCTCG 



GGCAGGGC CUGAUGAG X CGAA AGGGGUGA 



TCACCCCT C GCCCTGCC 



AGGCAAAG CUGAUGAG X CGAA AGGGCAGG 



CCTGCCCT C CTTTGCCT 



GGAAGGCA CUGAUGAG X CGAA AGGAGGGC 



GCCCTCCT T TGCCTTCC 



UGGAAGGC CUGAUGAG X CGAA AAGGAGGG 



CCCTCCTT T GCCTTCCA 



GGGGGUGG CUGAUGAG X CGAA AGGCAAAG 



CTTTGCCT T CCACCCCC 



UGGGGGUG CUGAUGAG X CGAA AAGGCAAA 



TTTGCCTT C CACCCCCA 



UCCACCUG CUGAUGAG X CGAA AUGGUGGG 



CCCACCAT C CAGGTGGA 



AAUUCCCA CUGAUGAG X CGAA AGCUCCCA 



TGGGAGCT C TGGGAATT 



UCACUCCA CUGAUGAG X CGAA AUUCCCAG 



CTGGGAAT T TGGAGTGA 



GUCACUCC CUGAUGAG X CGAA AAUUCCCA 



TGGGAATT T GGAGTGAC 



CGCCUGUG CUGAUGAG X CGAA ACAGGGCA 



TGCCCTGT A CACAGGCG 



CCCACAGG CUGAUGAG X CGAA ACCCCCAU 



ATGGGGGT C CCTGTGGG 



CCCAAUUU CUGAUGAG X CGAA ACCCACAG 



CTGTGGGT C AAATTGGG 



CUCCCCCC CUGAUGAG X CGAA AUUUGACC 



GGTCAAAT T GGGGGGAG 
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CAGUAUUU CUGAUGAG X CGAA ACUCCCAC 



GTGGGAGT A AAATACTG 



AUAUUCAG CUGAUGAG X CGAA AUUUUACU 



AGTAAAAT A CTGAATAT 



AACUCAUA CUGAUGAG X CGAA AUUCAGUA 



TACTGAAT A TATGAGTT 



AAAACUCA CUGAUGAG X CGAA AUAUUCAG 



CTGAATAT A TGAGTTTT 



AACUGAAA CUGAUGAG X CGAA ACUCAUAU 



ATATGAGT T TTTCAGTT 



AAACUGAA CUGAUGAG X CGAA AACUCAUA 



TATGAGTT T TTCAGTTT 



AAAACUGA CUGAUGAG X CGAA AAACUCAU 



ATGAGTTT T TCAGTTTT 



CAAAACUG CUGAUGAG X CGAA AAAACUCA 



TGAGTTTT T CAGTTTTG 



UCAAAACU CUGAUGAG X CGAA AAAAACUC 



GAGTTTTT C AGTTTTGA 



UUUUUCAA CUGAUGAG X CGAA ACUGAAAA 



TTTTCAGT T TTGAAAAA 



UUUUUUCA CUGAUGAG X CGAA AACUGAAA 



TTTCAGTT T TGAAAAAA 



UUUUUUUC CUGAUGAG X CGAA AAACUGAA 



TTCAGTTT T GAAAAAAA 



Stem Length = 8 - Core Sequence = CUGAUGAG X CGAA (X = GCCGUUAGGC or other stem II 
sequence and length (greater than or equal to 2 base-pairs)) 

Seql = TERT (Homo sapiens telomerase reverse transcriptase (TERT) mRNA, 4015 bp); Nakamura et 
al., Science 277 (5328), 955-959 (1997) 
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Table 14: Human telomerase reverse transcriptase (TERT) NCH Ribozyme and 
Target Sequence 



nt 

Position 


Ribozyme Sequence 


Seq ID 

Nos 


Substrate Sequence 


Seq ID 

Nos 


14 


GCGCAGCA CUGAUGAG X CGAA IACGCAGC 




GCTGCGTC C TGCTGCGC 




15 


UGCGCAGC CUGAUGAG X CGAA IGACGCAG 




CTGCGTCC T GCTGCGCA 




18 


ACGUGCGC CUGAUGAG X CGAA ICAGGACG 




CGTCCTGC T GCGCACGT 




23 


UUCCCACG CUGAUGAG X CGAA ICGCAGCA 




TGCTGCGC A CGTGGGAA 




34 


GGGGCCAG CUGAUGAG X CGAA ICUUCCCA 




TGGGAAGC C CTGGCCCC 




35 


CGGGGCCA CUGAUGAG X CGAA IGCUUCCC 




GGGAAGCC C TGGCCCCG 




36 


CCGGGGCC CUGAUGAG X CGAA IGGCUUCC 




GGAAGCCC T GGCCCCGG 




40 


GUGGCCGG CUGAUGAG X CGAA ICCAGGGC 




GCCCTGGC C CCGGCCAC 




41 


GGUGGCCG CUGAUGAG X CGAA IGCCAGGG 




CCCTGGCC C CGGCCACC 




42 


GGGUGGCC CUGAUGAG X CGAA IGGCCAGG 




CCTGGCCC C GGCCACCC 




46 


GCGGGGGU CUGAUGAG X CGAA ICCGGGGC 




GCCCCGGC C ACCCCCGC 




47 


CGCGGGGG CUGAUGAG X CGAA IGCCGGGG 




CCCCGGCC A CCCCCGCG 




49 


AUCGCGGG CUGAUGAG X CGAA IUGGCCGG 




CCGGCCAC C CCCGCGAT 




50 


CAUCGCGG CUGAUGAG X CGAA IGUGGCCG 




CGGCCACC C CCGCGATG 




51 


GCAUCGCG CUGAUGAG X CGAA IGGUGGCC 




GGCCACCC C CGCGATGC 




52 


GGCAUCGC CUGAUGAG X CGAA IGGGUGGC 




GCCACCCC C GCGATGCC 




SO 


GAGCGCGC CUGAUGAG X CGAA ICAUCGCG 




CGCGATGC C GCGCGCTC 




67 


CAGCGGGG CUGAUGAG X CGAA ICGCGCGG 




CCGCGCGC T CCCCGCTG 




69 


GGCAGCGG CUGAUGAG X CGAA IAGCGCGC 




GCGCGCTC C CCGCTGCC 




70 


CGGCAGCG CUGAUGAG X CGAA IGAGCGCG 




CGCGCTCC C CGCTGCCG 




71 


UCGGCAGC CUGAUGAG X CGAA IGGAGCGC 




GCGCTCCC C GCTGCCGA 




74 


GGCUCGGC CUGAUGAG X CGAA I CGGGGAG 




CTCCCCGC T GCCGAGCC 




77 


CACGGCUC CUGAUGAG X CGAA I CAGCGGG 




CCCGCTGC C GAGCCGTG 




82 


GAGCGCAC CUGAUGAG X CGAA ICUCGGCA 




TGCCGAGC C GTGCGCTC 




89 


CAGCAGGG CUGAUGAG X CGAA ICGCACGG 




CCGTGCGC T CCCTGCTG 




91 


CGCAGCAG CUGAUGAG X CGAA IAGCGCAC 




GTGCGCTC C CTGCTGCG 




92 


GCGCAGCA CUGAUGAG X CGAA I GAGCGCA 




TGCGCTCC C TGCTGCGC 




93 


UGCGCAGC CUGAUGAG X CGAA IGGAGCGC 




GCGCTCCC T GCTGCGCA 




96 


GGCUGCGC CUGAUGAG X CGAA I CAGGGAG 




CTCCCTGC T GCGCAGCC 




101 


GUAGUGGC CUGAUGAG X CGAA ICGCAGCA 




TGCTGCGC A GCCACTAC 




104 


GCGGUAGU CUGAUGAG X CGAA ICUGCGCA 




TGCGCAGC C ACTACCGC 




105 


CGCGGUAG CUGAUGAG X CGAA IGCUGCGC 




GCGCAGCC A CTACCGCG 




107 


CUCGCGGU CUGAUGAG X CGAA IUGGCUGC 




GCAGCCAC T ACCGCGAG 




110 


CACCUCGC CUGAUGAG X CGAA IUAGUGGC 




GCCACTAC C GCGAGGTG 




120 


CCAGCGGC CUGAUGAG X CGAA ICACCUCG 




CGAGGTGC T GCCGCTGG 




123 


UGGCCAGC CUGAUGAG X CGAA ICAGCACC 




GGTGCTGC C GCTGGCCA 




126 


ACGUGGCC CUGAUGAG X CGAA ICGGCAGC 




GCTGCCGC T GGCCACGT 




130 


ACGAACGU CUGAUGAG X CGAA ICCAGCGG 




CCGCTGGC C ACGTTCGT 




131 


CACGAACG CUGAUGAG X CGAA IGCCAGCG 




CGCTGGCC A CGTTCGTG 




146 


GGGCCCCA CUGAUGAG X CGAA ICGCCGCA 




TGCGGCGC C TGGGGCCC 




147 


GGGGCCCC CUGAUGAG X CGAA IGCGCCGC 




GCGGCGCC T GGGGCCCC 




153 


AGCCCUGG CUGAUGAG X CGAA ICCCCAGG 




CCTGGGGC C CCAGGGCT 




154 


CAGCCCUG CUGAUGAG X CGAA IGCCCCAG 




CTGGGGCC C CAGGGCTG 
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CCAGCCCU CUGAUGAG X CGAA IGGCCCCA 



GCCAGCCC CUGAUGAG X CGAA IGGGCCCC 



CAGCCGCC CUGAUGAG X CGAA ICCCUGGG 



GCUGCACC CUGAUGAG X CGAA ICCGCCAG 



CCCCGCGC CUGAUGAG X CGAA ICACCAGC" 



AGCCGCCG CUGAUGAG X CGAA IUCCCCC-C 



AAGCCGCC CUGAUGAG X CGAA IGUCCCCG 



GCGCGGAA CUGAUGAG X CGAA ICCGCCGG 



CAGCGCGC CUGAUGAG X CGAA IAAAGCCG 



GGGCCACC CUGAUGAG X CGAA ICGCGCGG~ 



AGGCACUG CUGAUGAG X CGAA ICCACCAG 



CAGGCACU CUGAUGAG X CGAA IGCCACCA 



CCAGGCAC CUGAUGAG X CGAA IGGCCACC 



GCACACCA CUGAUGAG X CGAA ICACUGGG 



CGCACACC CUGAUGAG X CGAA IGCACUGG 



CGUCCCAG CUGAUGAG X CGAA ICACGCAC 



GCGUCCCA CUGAUGAG X CGAA IGCACGCA 



UGCGUCCC CUGAUGAG X CGAA IGGCACGC 



GGCGGCCG CUGAUGAG X CGAA ICGUCCCA 



CGGGGGGC CUGAUGAG X CGAA ICCGUGCG 



CGGCGGGG CUGAUGAG X CGAA ICGGCCGU 



GCGGCGGG CUGAUGAG X CGAA IGCGGCCG 



GGCGGCGG CUGAUGAG X CGAA IGGCGGCC 



GGGCGGCG CUGAUGAG X CGAA IGGGCGGC 



GGGGCGGC CUGAUGAG X CGAA IGGGGCGG 



GAGGGGGC CUGAUGAG X CGAA I CGGGGGG 



AAGGAGGG CUGAUGAG X CGAA I CGGCGGG 



GAAGGAGG CUGAUGAG X CGAA IGCGGCGG 



GGAAGGAG CUGAUGAG X CGAA IGGCGGCG 



CGGAAGGA CUGAUGAG X CGAA IGGGCGGC 



GCGGAAGG CUGAUGAG X CGAA IGGGGCGG 



UGGCGGAA CUGAUGAG X CGAA IAGGGGGC 



CUGGCGGA CUGAUGAG. X CGAA IGAGGGGG 



CACCUGGC CUGAUGAG X CGAA IAAGGAGG 



GGACACCU CUGAUGAG X CGAA I CGGAAGG 



AGGACACC CUGAUGAG X CGAA IGCGGAAG 



UUCAGGCA CUGAUGAG X CGAA IACACCUG 



CUUCAGGC CUGAUGAG X CGAA IGACACCU 



CUCCUUCA CUGAUGAG X CGAA ICAGGACA 



GCUCCUUC CUGAUGAG X CGAA IGCAGGAC" 



GGGCCACC CUGAUGAG X CGAA ICUCCUUC 



AGCACUCG CUGAUGAG X CGAA" ICCACCAG 



CAGCACUC CUGAUGAG X CGAA IGCCACCA 



GCCUCUGC CUGAUGAG X CGAA ICACUCGG 



ACAGCCUC CUGAUGAG X CGAA ICAGCACU 



GCUCGCAC CUGAUGAG X CGAA ICCUCUGC 



CGAAGGCC CUGAUGAG X CGAA ICACGUUC 



TGGGGCCC C AGGGCTGG 



GGGGCCCC A GGGCTGGC 



CCCAGGGC T GGCGGCTG 



CTGGCGGC T GGTGCAGC 



GCTGGTGC A GCGCGGGG 



GCGGGGAC C CGGCGGCT 



CGGGGACC C GGCGGCTT 



CCGGCGGC T TTCCGCGC 



CGGCTTTC C GCGCGCTG 



CCGCGCGC T GGTGGCCC 



CTGGTGGC C CAGTGCCT 



TGGTGGCC C AGTGCCTG 
GGTGGCCC A GTGCCTGG 



CCCAGTGC C TGGTGTGC 



CCAGTGCC T GGTGTGCG 



GTGCGTGC C CTGGGACG 



TGCGTGCC C TGGGACGC 



GCGTGCCC T GGGACGCA 



TGGGACGC A CGGCCGCC 



CGCACGGC C GCCCCCCG 



ACGGCCGC C CCCCGCCG 



CGGCCGCC C CCCGCCGC 



GGCCGCCC C CCGCCGCC 



GCCGCCCC C CGCCGCCC 



CCGCCCCC C GCCGCCCC 



CCCCCCGC C GCCCCCTC 



CCCGCCGC C CCCTCCTT 



CCGCCGCC C CCTCCTTC 



CGCCGCCC C CTCCTTCC 



GCCGCCCC C TCCTTCCG 



CCGCCCCC T CCTTCCGC 



GCCCCCTC C TTCCGCCA 



CCCCCTCC T TCCGCCAG 



CCTCCTTC C GCCAGGTG 



CCTTCCGC C AGGTGTCC 



CTTCCGCC A GGTGTCCT 



CAGGTGTC C TGCCTGAA 



AGGTGTCC T GCCTGAAG 



TGTCCTGC C TGAAGGAG 



GTCCTGCC T GAAGGAGC 



GAAGGAGC T GGTGGCCC 
CTGGTGGC C CGAGTGCT 
TGGTGGCC C GAGTGCTG 



CCGAGTGC T GCAGAGGC 



AGTGCTGC A GAGGCTGT 



GCAGAGGC T GTGCGAGC 



GAACGTGC T GGCCTTCG 
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Table 14 





AAGCCGAA CUGAUGAG X CGAA ICCAGCAC 




GTGCTGGC C TTCGGCTT 




350 


GAAGCCGA CUGAUGAG X CGAA IGCCAGCA 




TGCTGGCC T TCGGCTTC 
CCTTCGGC T TCGCGCTG 




363 


CAGCGCGA CUGAUGAG X CGAA ICCGAAGG 
CGUCCAGC CUGAUGAG X CGAA ICGCGAAG 




CTTCGCGC T GCTGGACG 







CCCCGUCC CUGAUGAG X CGAA ICAGCGCG 




CGCGCTGC T GGACGGGG 




376 


CCCCCGCG CUGAUGAG X CGAA ICCCCGUC 




GACGGGGC C CGCGGGGG 
ACGGGGCC C GCGGGGGC 




377 
386 


"GCCCCCGC CUGAUGAG X CGAA IGCCCCGU 
CUCGGGGG CUGAUGAG X CGAA ICCCCCGC 




GCGGGGGC C CCCCCGAG 


— 


387 


CCUCGGGG CUGAUGAG X CGAA IGCCCCCG 




CGGGGGCC C CCCCGAGG 
GGGGGCCC C CCCGAGGC 




388 

389 


GCCUCGGG CUGAUGAG X CGAA IGGCCCCC 
GGCCUCGG CUGAUGAG X CGAA IGGGCCCC 




" GGGGCCCC C CCGAGGCC 
GGGCCCCC C CGAGGCCT 




390 


AGGCCUCG CUGAUGAG X CGAA IGGGGCCC 
AAGGCCUC CUGAUGAG X CGAA IGGGGGCC 




GGCCCCCC C GAGGCCTT 
CCCGAGGC C TTCACCAC 





397 
398 


GUGGUGAA CUGAUGAG X CGAA ICCUCGGG 
GGUGGUGA CUGAUGAG X CGAA IGCCUCGG 




CCGAGGCC T TCACCACC 




401 


GCUGGUGG CUGAUGAG X CGAA IAAGGCCU 









403 


ACGCUGGU CUGAUGAG X CGAA IUGAAGGC 




GCCTTCAC C ACCAGCGT 





404 


CACGCUGG CUGAUGAG X CGAA IGUGAAGG 




CCTTCACC A CCAGCGTG 




406 


CGCACGCU CUGAUGAG X CGAA IUGGUGAA 




TTCACCAC ^ ^I^cgc 




407 


GCGCACGC CUGAUGAG X CGAA IGUGGUGA 




TCACCACC A ^ CTACCTQ 




416 


CAGGUAGC CUGAUGAG X CGAA ICGCACGC 




TGCGCAGC T ACCTGCCC 




419 
422 


GGGCAGGU CUGAUGAG X CGAA ICUGCGCA 
GUUGGGCA CUGAUGAG X CGAA IUAGCUGC 




GCAGCTAC C TGCCCAAC 




423 


UGUUGGGC CUGAUGAG X CGAA IGUAGCUG 








426 


CCGUGUUG CUGAUGAG X CGAA ICAGGUAG 




CTACCTGC C CAACACGG 




427 


' ACCGUGUU CUGAUGAG X CGAA IGCAGGUA 




TACCTGCC C AACACGGT 






CACCGUGU CUGAUGAG X CGAA IGGCAGGU 




ACCTGCCC A ACACGGTG 




431 


' GGUCACCG CUGAUGAG X CGAA IUUGGGCA 




TGCCCAAC A CGGTGACC 




439 


AGUGCGUC CUGAUGAG X CGAA IUCACCGU 




ACGGTGAC C GACGCACT 




445 


CCCCGCAG CUGAUGAG X CGAA ICGUCGGU 




ACCGACGC A CTGCGGGG 




447 


UCCCCCGC CUGAUGAG X CGAA IUGCGUCG 
" GCAGCAGC CUGAUGAG X CGAA ICCCCCAC 




CGACGCAC T GCGGGGGA 
GTGGGGGC T GCTGCTGC 




474 
477 
482 


GGCGCAGC CUGAUGAG X CGAA ICAGCCCC 
CGCGGCGC CUGAUGAG X CGAA ICAGCAGC 
. GCCCACGC CUGAUGAG X CGAA ICGCAGCA 
GGUGAACC CUGAUGAG X CGAA ICACGUCG 




GGGGCTGC T GCTGCGCC 
GCTGCTGC T GCGCCGCG 
TGCTGCGC C GCGTGGGC 
CGACGTGC T GGTTCACC 





501 
507 
509 


" CCAGCAGG CUGAUGAG X CGAA IAACCAGC 
" UGCCAGCA CUGAUGAG X CGAA IUGAACCA 




GCTGGTTC A CCTGCTGG 
TGGTTCAC C TGCTGGCA 




513 


GUGCCAGC CUGAUGAG X CGAA IGUGAACC 
AGCGUGCC CUGAUGAG X CGAA ICAGGUGA 




GGTTCACC T GCTGGCAC 
TCACCTGC T GGCACGCT 
CTGCTGGC A CGCTGCGC 




517 
521 
528 


GCGCAGCG CUGAUGAG X CGAA ICCAGCAG 
GAGCGCGC CUGAUGAG X CGAA ICGUGCCA 
" GCACAAAG CUGAUGAG X CGAA ICGCGCAG 




TGGCACGC T GCGCGCTC 
CTGCGCGC T CTTTGTGC 




530 


CAGCACAA CUGAUGAG X CGAA IAGCGCGC 




GCGCGCTC T TTGTGCTG 




537 
544 
546 


GAGCCACC CUGAUGAG X CGAA ICACAAAG 
' CAGCUGGG CUGAUGAG X CGAA ICCACCAG 
~ CGCAGCUG CUGAUGAG X CGAA IAGCCACC 




CTTTGTGC T GGTGGCTC 
CTGGTGGC T CCCAGCTG 
GGTGGCTC C CAGCTGCG 
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"GCGCAGCU CUGAUGAG X CGAA IGAGCCAC~ 

"ggcgcagc cugaugagTc cgaa iggagcca 



GUAGGCGC CUGAUGAG X CGAA ICUGGGAG 



ACCUGGUA CUGAUGAG X C GAA ICGCAGCU 



CACCUGGU CUGAUGAG X CGAA IGCGCAGC 



GCACACCU CUGAUGAG X CGAA~~rUAGGCGC 



CGCACACC CUGAUGAG X CGAA IGUAGGCG 



ACAGCGGC CUGAUGAG X CGAA ICCCGCAC 



"GGUACAGC CUGAUGAG X CGAA ICGGCCCG 



GCUGGUAC CUGAUGAG X CGAA ICGGCGGC 



GCCGAGCU CUGAUGAG X CGAA IUACAGCG 



CGCCGAGC CUGAUGAG X CGAA IGUACAGC 



CAGCGCCG CUGAUGAG X CGAA ICUGGUAC 



UGAGUGGC CUGAUGAG X CGAA ICGCCGAG 



GCCUGAGU CUGAUGAG X CGAA ICAGCGCC 



GGCCUGAG CUGAUGAG X CGAA 



"CGGGCCUG CUGAUGAG X CGAA IUGGCAGC 



GCCGGGCC CUGAUGAG X CGAA IAGUGGCA 



GGGGGCCG CUGAUGAG X CGAA ICC UGAGU 



CGGGGGCC CUGAUGAG X CGAA IGCCUGAG 



GUGGCGGG CUGAUGAG X CGAA ' 



UGUGGCGG CUGAUGAG X CGAA IGCCGGGC 



GUGUGGCG CUGAUGAG X CGAA IGGCCGGG 



' CGUGUGGC CUGAUGAG X CGAA IGGGCCGG 



UAGCGUGU CUGAUGAG X CGAA I CGGGGGC 



"CUAGCGUG CUGAUGAG X CGAA IGCGGGGG 



CACUAGCG CUGAUGAG X CGAA IUGGCGGG 



GGUCCACU CUGAUGAG X CGAA ICGUGUGG 



GCCUUCGG CUGAUGAG X CGAA IUCCACUA 



CGCCUUCG CUGAUGAG X CGAA IGUCCACU 



"ACGCCUUC CUGAUGAG X CGAA IGGUCCAC 



CGCAUCCC CUGAUGAG X CGAA IACGCCUU 



UGGUUCCA CUGAUGAG X CGAA ICCCGUUC 



"AUGGUUCC CUGAUGAG X CGAA IGCCCGUU 



GACGCUAU CUGAUGAG X CGAA IUUCCAGG 



"UGACGCUA CUGAUGAG X CGAA IGUUCCAG 



GGCCUCCC CUGAUGAG X CGAA IACGCUAU 



GGGACCCC CUGAUGAG X CGAA ICCUCCCU 



GCCCAGGG CUGAUGAG X CGAA IACCCCGG~ 



GGCCCAGG CUGAUGAG X CGAA IGACCCCG 



AGGCCCAG CUGAUGAG X CGAA IGGACCCC 



CAGGCCCA CUGAUGAG X CGAA IGGGACCC 



GCAGGCCC CUGAUGAG X CGAA IGGGGACC 



GGCUGGCA CUGAUGAG X CGAA ICCCAGGG 



GGGCUGGC CUGAUGAG X CGAA IGCCCAGG 



CCGGGGCU CUGAUGAG X CGAA ICAGGCCC 



CCCGGGGC CUGAUGAG X CGAA IGCAGGCC 



GTGGCTCC C AGCTGCGC 



TGGCTCCC A GCTGCGCC 



CTCCCAGC T GCGCCTAC^ 



AGCTGCGC C TACCAGGT 



GCTGCGCC T ACCAGGTG 



GCGCCTAC C AGGTGTGC 



CGCCTACC A GGTGTGCG 



GTGCGGGC C GCCGCTGT 



CGGGCCGC C GCTGTACC 



GCCGCCGC T GTACCAGC 



CGCTGTAC C AGCTCGGC 



GCTGTACC A GCTCGGCG 



GTACCAGC T CGGCGCTG 



CTCGGCGC T GCCACTCA 



GGCGCTGC C ACTCAGGC 



GCGCTGCC A CTCAGGCC 



GCTGCCAC T CAGGCCCG 



TGCCACTC A GGCCCGGC 



ACTCAGGC C CGGCCCCC 



CTCAGGCC C GGCCCCCG 



GGCCCGGC C CCCGCCAC 



GCCCGGCC C CCGCCACA 



CCCGGCCC C CGCCACAC 



CCGGCCCC C GCCACACG 



GCCCCCGC C ACACGCTA 



CCCCCGCC A CACGCTAG 



CCCGCCAC A CGCTAGTG 



CCACACGC T AGTGGACC 



TAGTGGAC C CCGAAGGC 



AGTGGACC C CGAAGGCG 



GTGGACCC C GAAGGCGT 



AAGGCGTC T GGGATGCG 



GAACGGGC C TGGAACCA 



AACGGGCC T GGAACCAT 



CCTGGAAC C ATAGCGTC 



CTGGAACC A TAGCGTCA 



ATAGCGTC A GGGAGGCC 



AGGGAGGC C GGGGTCCC 



CCGGGGTC C CCCTGGGC 



CGGGGTCC C CCTGGGCC 



GGGGTCCC C CTGGGCCT 



GGGTCCCC C TGGGCCTG 



GGTCCCCC T GGGCCTGC 



CCCTGGGC C TGCCAGCC 



CCTGGGCC T GCCAGCCC 



GGGCCTGC C AGCCCCGG 



GGCCTGCC A GCCCCGGG 
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GCACCCGG CUGAUGAG X CGAA ICUGGCAG 



CGCACCCG" CUGAUGAG X CGAA IGCUGGCA 



UCGCACCC CUGAUGAG X CGAA IGGCUGGC 



GCUGGCAC CUGAUGAG X CGAA ICCCCCGC 



CUUCGGCU CUGAUGAG X CGAA ICACUGCC 



ACUUCGGC CUGAUGAG X CGAA IGCACUGC 



CAGACUUC CUGAUGAG X CGAA ICUGGCAC 



GCAACGGC CUGAUGAG X CGAA IACUUCGG 



UGGGCAAC CUGAUGAG X CGAA ICAGACUU 



GCCUCUUG CUGAUGAG X CGAA ICAACGGC 



GGCCUCUU CUGAUGAG X CGAA IGCAACGG 



GGGCCUCU CUGAUGAG X CGAA IGGCAACG 



CACGCCUG CUGAUGAG X CGAA ICCUCUUG 



CCACGCCU CUGAUGAG X CGAA IGCCUCUU 



"GCCACGCC CUGAUGAG X CGAA IGGCCUCU 



UCAGGGGC CUGAUGAG X CGAA ICGCCACG 



GGCUCAGG CUGAUGAG X CGAA ICAGCGCC 



"CGGCUCAG CUGAUGAG X CGAA IGCAGCGC 



CCGGCUCA CUGAUGAG X CGAA IGGCAGCG 



UCCGGCUC CUGAUGAG X CGAA IGGGCAGC 



UCCGCUCC CUGAUGAG X CGAA ICUCAGGG 



GCCCAACG CUGAUGAG X CGAA ICGUCCGC 



UGCCCAAC CUGAUGAG X CGAA IGCGUCCG 



' AGGACCCC CUGAUGAG X CGAA ICCCAACG 



UGGGCCCA CUGAUGAG X CGAA IACCCCUG 



GUGGGCCC CUGAUGAG X CGAA IGACCCCU 



CCCGGGUG CUGAUGAG X CGAA ICCCAGGA 



GCCCGGGU CUGAUGAG X CGAA IGCCCAGG 



UGCCCGGG CUGAUGAG X CGAA IGGCCCAG 



CCUGCCCG CUGAUGAG X CGAA IUGGGCCC 



UCCUGCCC CUGAUGAG X CGAA IGUGGGCC 



ACGCGUCC CUGAUGAG X CGAA ICCCGGGU 



GGUCACUC CUGAUGAG X CGAA IUCCACGC 



GAAACCAC CUGAUGAG X CGAA IUCACUCG 



~CACCACAC CUGAUGAG X CGAA IAAACCAC 



CUGGCAGG CUGAUGAG X CGAA IACACCAC 



GUCUGGCA CUGAUGAG X CGAA IUGACACC" 



GGUCUGGC CUGAUGAG X CGAA IGUGACAC 



GCGGGUCU CUGAUGAG X CGAA ICAGGUGA 



GGCGGGUC CUGAUGAG X CGAA IGCAGGUG 



CUUCGGCG CUGAUGAG X CGAA IUCUGGCA 



UCUUCGGC CUGAUGAG X CGAA IGUCUGGC 



GCUUCUUC CUGAUGAG X CGAA ICGGGUCU 



AAAGAGGU CUGAUGAG X. CGAA ICUUCUUC 



CAAAGAGG CUGAUGAG X CGAA IGCUUCUU 



UCCAAAGA CUGAUGAG X CGAA IUGGCUUC 



CUCCAAAG CUGAUGAG X CGAA IGUGGCUU 



CTGCCAGC C CCGGGTGC 



TGCCAGCC C CGGGTGCG 



GCCAGCCC C GGGTGCGA 



GCGGGGGC A GTGCCAGC 



GGCAGTGC C AGCCGAAG 



GCAGTGCC A GCCGAAGT 



GTGCCAGC C GAAGTCTG 



CCGAAGTC T GCCGTTGC 



AAGTCTGC C GTTGCCCA 



GCCGTTGC C CAAGAGGC 



CCGTTGCC C AAGAGGCC 



CGTTGCCC A AGAGGCCC 



CAAGAGGC C CAGGCGTG 



AAGAGGCC C AGGCGTGG 



AGAGGCCC A GGCGTGGC 



CGTGGCGC T GCCCCTGA 



GGCGCTGC C CCTGAGCC 



GCGCTGCC C CTGAGCCG 



CGCTGCCC C TGAGCCGG 



GCTGCCCC T GAGCCGGA 



CCCTGAGC C GGAGCGGA 



GCGGACGC C CGTTGGGC 



CGGACGCC C GTTGGGCA 



CGTTGGGC A GGGGTCCT 



CAGGGGTC C TGGGCCCA 



AGGGGTCC T GGGCCCAC 



TCCTGGGC C CACCCGGG 



CCTGGGCC C ACCCGGGC 



CTGGGCCC A CCCGGGCA 



GGGCCCAC C CGGGCAGG 



GGCCCACC C GGGCAGGA 



ACCCGGGC A GGACGCGT 



GCGTGGAC C GAGTGACc" 



CGAGTGAC C GTGGTTTC 



GTGGTTTC T GTGTGGTG 



GTGGTGTC A CCTGCCAG 



GGTGTCAC C TGCCAGAC 



GTGTCACC T GCCAGACC 



TCACCTGC C AGACCCGC 



CACCTGCC A GACCCGCC 



TGCCAGAC C CGCCGAAG 



GCCAGACC C GCCGAAGA 



AGACCCGC C GAAGAAGC 



GAAGAAGC C ACCTt 



AAGAAGCC A CCTCTTTG 



GAAGCCAC C TCTTTGGA 



AAGCCACC T CTTTGGAG 
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CCCUCCAA CUGAUGAG X CGAA IAGGUGGC 
UGCCAGAG CUGAUGAG X CG AA ICGCACCC 
CGUGCCAG CUGAUGAG X CGAA IAGCGCAC 



CGCGUGCC CUGAUGAG X CGAA IAGAGCGC 



GUGGCGCG CUGAUGAG X CGAA ICCAGAGA 



GUGGGAGU CUGAUGAG X CGAA ICGCGUGC 



GGUGGGAG CUGAUGAG X CGAA IGCGCGUG 



UGGGUGGG CUGAUGAG X CGAA IUGGCGCG 



GAUGGGUG CUGAUGAG X CGAA IAGUGGCG 



GGAUGGGU CUGAUGAG X CGAA IGAGUGGC 



CGGAUGGG CUGAUGAG X CGAA IGGAGUGG 



5 CUGAUGAG X CGAA IUGGGAGU 



CACGGAUG 

CCACGGAU CUGAUGAG X CGAA IGUGGGAG 



CCCACGGA CUGAUGAG X CGAA IGGUGGGA 



CGGCCCAC CUGAUGAG X CGAA IAUGGGUG 



GUGCUGGC CUGAUGAG X. CGAA ICCCACGG 



GUGGUGCU CUGAUGAG X CGAA ICGGCCCA 



CGUGGUGC CUGAUGAG X CGAA IGCGGCCC 



CCGCGUGG CUGAUGAG X CGAA ICUGGCGG 



GCCCGCGU CUGAUGAG X CGAA IUGCUGGC 



GGCCCGCG CUGAUGAG X CGAA IGUGCUGG 



GGAUGGGG CUGAUGAG X CGAA ICCCGCGU 



UGGAUGGG CUGAUGAG X CGAA IGCCCGCG 



5 CUGAUGAG X CGAA IGGCCCGC 
J CUGAUGAG X CGAA IGGGCCCG 



AUGUGGAU CUGAUGAG X CGAA IGGGGCCC 



GAUGUGGA CUGAUGAG X CGAA IGGGGGCC 



CGCGAUGU CUGAUGAG X CGAA IAUGGGGG 



CCGCGAUG CUGAUGAG X CGAA IGAUGGGG 



GGCCGCGA CUGAUGAG X CGAA IUGGAUGG 



GCCACCTC T TTGGAGGG 
GGGTGCGC T CTCTGGCA 
GTGCGCTC T CTGGCACG 
GCGCTCTC T GGCACGCG 



TCTCTGGC A CGCGCCAC 
~GCACGCGC C ACTCCCAC 



CACGCGCC A CTCCCACC 



CGCGCCAC T CCCACCCA 
CGCCACTC C CACCCATC 



GCCACTCC C ACCCATCC 
CCACTCCC A CCCATCCG 
ACTCCCAC C CATCCGTG 



CTCCCACC C ATCCGTGG 



TCCCACCC A TCCGTGGG 



CACCCATC C GTGGGCCG 



CCGTGGGC C GCCAGCAC 



TGGGCCGC C AGCACCAC 



GGGCCGCC A GCACCACG 



CCGCCAGC A CCACGCGG 



GCCAGCAC C ACGCGGGC 



CCAGCACC A CGCGGGCC 



ACGCGGGC C CCCCATCC 
CGCGGGCC C CCCATCCA 
GCGGGCCC C CCATCCAC 



CGGGCCCC C CATCCACA 



GGGCCCCC C ATCCACAT 



GGCCCCCC A TCCACATC 



CCCCCATC C ACATCGCG 
CCCCATCC A CATCGCGG 



CCATCCAC A TCGCGGCC 



GACGUGGU CUGAUGAG X CGAA ICCGCGAU 



GGACGUGG CUGAUGAG X CGAA IGCCGCGA 



AGGGACGU CUGAUGAG X CGAA IUGGCCGC 



CAGGGACG CUGAUGAG X CGAA IGUGGCCG 
UGUCCCAG CUGAUGAG X CGAA IACGUGGU 



GUGUCCCA CUGAUGAG X CGAA IGACGUGG 
CGUGUCCC CUGAUGAG X CGAA I GGACGUG 
ACAAGGCG CUGAUGAG X CGAA IUCCCAGG 
GGGGACAA CUGAUGAG X CGAA ICGUGUCC 
GGGGGACA CUGAUGAG X CGAA IGCGUGUC 
ACACCGGG CUGAUGAG X CGAA IACAAGGC 
UACACCGG CUGAUGAG X CGAA IGACAAGG 
GUACACCG 

CGUACACC CUGAUGAG X CGAA 
UUGGUCUC CUGAUGAG X CGAA 
AAGUGCUU CUGAUGAG X CG AA IUCUCGGC 
GAAGUGCU CUGAUGAG X CGAA IGUCUCGG 



CUGAUGAG X CGAA IGGACAAG 
IGGGACAA 



ATCGCGGC C ACCACGTC 
TCGCGGCC A CCACGTCC 



GCGGCCAC C ACGTCCCT 



CGGCCACC A CGTCCCTG 



ACCACGTC C CTGGGACA 



CCACGTCC C TGGGACAC 
CACGTCCC T GGGACACG 
CCTGGGAC A CGCCTTGT 
GGACACGC C TTGTCCCC 
GACACGCC T TGTCCCCC 
GCCTTGTC C CCCGGTGT 
CCTTGTCC C CCGGTGTA 
CTTGTCCC C CGGTGTAC 
TTGTCCCC C GGTGTACG 
GTGTACGC C GAGACCAA 
GCCGAGAC C AAGCACTT 
CCGAGACC A AGCACTTC 
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AGAGGAAG CUGAUGAG X CGAA ICUUGGUC 
GUAGAGGA CUGAUGAG X CGAA IUGCUUGG 
GGAGUAGA CUGAUGAG X CGAA IAAGUGCU 
AGGAGUAG CUGAUGAG X CGAA IGAAGUGC 
UGAGGAGU CUGAUGAG X CGAA IAGGAAGU 
GCCUGAGG CUGAUGAG X CGAA IUAGAGGA 
UCGCCUGA CUGAUGAG X CGAA IAGUAGAG 
GUCGCCUG CUGAUGAG X CGAA IGAGUAGA 
UUGUCGCC CUGAUGAG X CGAA IAGGAGUA 
CUGCUCCU CUGAUGAG X CGAA IUCGCCUG 
GCCGCAGC CUGAUGAG X CGAA ICUCCUUG 
AGGGCCGC CUGAUGAG X CGAA ICUGCUCC 
GGAAGGAG CUGAUGAG X CGAA ICCGCAGC 



AGGAAGGA CUGAUGAG X CGAA IGCCGCAG 



UAGGAAGG CUGAUGAG X CGAA IGGCCGCA 



GACCAAGC A CTTCCTCT 
CCAAGCAC T TCCTCTAC 
AGCACTTC C TCTACTCC 
GCACTTCC T CTACTCCT 



TCCTCTAC T CCTCAGGC 
CTCTACTC C TCAGGCGA 
TCTACTCC T CAGGCGAC 
TACTCCTC A GGCGACAA 
CAGGCGAC A AGGAGCAG 
CAAGGAGC A GCTGCGGC 
GGAGCAGC T GCGGCCCT 
GCTGCGGC C CTCCTTCC 



CTGCGGCC C TCCTTCCT 



TGCGGCCC T CCTTCCTA 



AGUAGGAA CUGAUGAG X CGAA IAGGGCCG 



s. CUGAUGAG X CGAA IGAGGGCC 
k CUGAUGAG X CGAA IAAGGAGG 



AGCUGAGU 
GAGAGCUG 



CUGAUGAG X CGAA IGAAGGAG 



CUGAUGAG X CGAA IUAGGAAG 



CAGAGAGC CUGAUGAG X CGAA IAGUAGGA 



CCUCAGAG CUGAUGAG X CGAA ICUGAGUA 



GGCCUCAG CUGAUGAG X CGAA IAGCUGAG 



CGGCCCTC C TTCCTACT 



GGCCCTCC T TCCTACTC 



CCTCCTTC C TACTCAGC 



CTCCTTCC T ACTCAGCT 



CTTCCTAC T CAGCTCTC 



TCCTACTC A GCTCTCTG 



TACTCAGC T CTCTGAGG 



CTCAGCTC T CTGAGGCC 



UGGGCCUC CUGAUGAG X CGAA IAGAGCUG 



CAGCTCTC T GAGGCCCA 



UCAGGCUG CUGAUGAG X CGAA ICCUCAGA 



GUCAGGCU CUGAUGAG X CGAA IGCCUCAG 



AGUCAGGC CUGAUGAG X CGAA IGGCCUCA 



GCCAGUCA CUGAUGAG X CGAA ICUGGGCC 



CGCCAGUC CUGAUGAG X CGAA IGCUGGGC 



TCTGAGGC C CAGCCTGA 



CTGAGGCC C AGCCTGAC 



TGAGGCCC A GCCTGACT 



GGCCCAGC C TGACTGGC 



GCCCAGCC T GACTGGCG 



CGAGCGCC CUGAUGAG X CGAA IUCAGGCU 



AGCCTGAC T GGCGCTCG 



AGCCUCCG CUGAUGAG X CGAA ICGCCAGU 
UCUCCACG CUGAUGAG X CGAA ICCUCCGA 
AGAAAGAU CUGAUGAG X CGAA IUCUCCAC 
CAGAAAGA CUGAUGAG X CGAA IGUCUCCA 
ACCCAGAA CUGAUGAG X CGAA IAUGGUCU 
UGGAACCC CUGAUGAG X CGAA IAAAGAUG 
CAGGGCCU CUGAUGAG X CGAA IAACCCAG 
CCAGGGCC CUGAUGAG X CGAA IGAACCCA 
GCAUCCAG CUGAUGAG X CGAA ICCUGGAA 
GGCAUCCA CUGAUGAG X CGAA IGCCUGGA 
UGGCAUCC CUGAUGAG X CGAA IGGCCUGG 
GAGUCCCU CUGAUGAG X CGAA ICAUCCAG 
GGAGUCCC CUGAUGAG X CGAA IGCAUCCA 
CUGCGGGG CUGAUGAG X CGAA IUCCCUGG 
ACCUGCGG CUGAUGAG X CGAA IAGUCCCU 
AACCUGCG CUGAUGAG X CGAA IGAGUCCC 
CAACCUGC CUGAUGAG X CGAA IGGAGUCC 



T CGGAGGCT 
TCGGAGGC T CGTGGAGA 
GTGGAGAC C ATCTTTCT 
TGGAGACC A TCTTTCTG 
AGACCATC T TTCTGGGT 
CATCTTTC T GGGTTCCA 
CTGGGTTC C AGGCCCTG 
TGGGTTCC A GGCCCTGG 
TTCCAGGC C CTGGATGC 
TCCAGGCC C TGGATGCC 
CCAGGCCC T GGATGCCA 
CTGGATGC C AGGGACTC 
TGGATGCC A GGGACTCC 
CCAGGGAC T CCCCGCAG 
AGGGACTC C CCGCAGGT 
~ GGGACTCC C CGCAGGTT 
GGACTCCC C GCAGGTTG 
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GGGCAACC CUGAUGAG X CGAA I CGGGGAG 
GCAGGCGG CUGAUGAG X CGAA ICAACCUG 



CTCCCCGC A GGTTGCCC 
CAGGTTGC C CCGCCTGC 



1236 
1242 



GGCAGGCG CUGAUGAG X CGAA IGCAACCU 



GGGCAGGC CUGAUGAG X CGAA IGGCAACC 



CUGGGGCA CUGAUGAG X CGAA ICGGGGCA 



GCUGGGGC CUGAUGAG X CGAA IGCGGGGC 



AGCGCUGG CUGAUGAG X CGAA I CAGGCGG 



UAGCGCUG CUGAUGAG X CGAA IGCAGGCG 
GUAGCGCU CUGAUGAG X CGAA I GGCAGGC 
AGUAGCGC CUGAUGAG X CGA A IGGGCAGG 
UUGCCAGU CUGAUGAG X CGAA ICGCUGGG 
CAUUUGCC CUGAUGAG X CGAA IUAGCGCU 
GCCGCAUU CUGAUGAG X CGAA ICCAGUAG 
GAAACAGG CUGAUGAG X CGAA ICCGCAUU 
AGAAACAG CUGAUGAG X CGAA IGCCGCAU 
CAGAAACA CUGAUGAG X CGAA IGGCCGCA 
CCAGAAAC CUGAUGAG X CGA A IGGGCCGC 
GCAGCUCC CUGAUGAG X CGAA IAAACAGG 
UCCCAAGC CUGAUGAG X CGAA ICUCCAGA 
GGUUCCCA CUGAUGAG X CGAA ICAGCUCC 
CUGCGCGU CUGAUGAG X CGAA IUUCCCAA 
ACUGCGCG CUGAUGAG X C GAA IGUUCCCA 
AGGGGCAC CUGAUGAG X CGAA ICGCGUGG 
CCCGUAGG CUGAUGAG X CGAA ICACUGCG 
CCCCGUAG CUGAUGAG X CGAA IGCACUGC 
ACCCCGUA CUGAUGAG X CGAA IGGCACUG 
CACCCCGU CUGAUGAG X CGAA IGGGCACU 
UCUUGAGG CUGAUGAG X CGAA ICACCCCG 
CGUCUUGA CUGAUGAG X CGAA IAGCACCC 
GCGUCUUG CUGAUGAG X CGAA IGAGCACC 
GUGCGUCU CUGAUGAG X CGAA IAGGAGCA 
GCGGGCAG CUGAUGAG X CGAA ICGUCUUG 
CAGCGGGC CUGAUGAG X CGAA IUGCGUCU 
UCGCAGCG CUGAUGAG X CGAA ICAGUGCG 
CUCGCAGC CUGAUGAG X CGAA IGCAGUGC 
CAGCUCGC CUGAUGAG X CGAA ICGGGCAG 
GUGACCGC CUGAUGAG X CGAA ICUCGCAG 
UGCUGGGG CUGAUGAG X CGAA IACCGCAG 
GCUGCUGG CUGAUGAG X CGAA IUGACCGC 
GGCUGCUG CUGAUGAG X CGAA IGUGACCG 
CGGCUGCU CUGAUGAG X CGAA IGGUGACC 
CCGGCUGC CUGAUGAG X CGAA IGGGUGAC 
ACACCGGC CUGAUGAG X CGAA ICUGGGGU 
CAGACACC CUGAUGAG X CGAA ICUGCUGG 
CCGGGCAC CUGAUGAG X CGAA IACACCGG 
UUCUCCCG CUGAUGAG X CGAA ICACAGAC 
CUUCUCCC CUGAUGAG X CGAA IGCACAGA 



AGGTTGCC C CGCCTGCC 



GGTTGCCC C GCCTGCCC 



TGCCCCGC C TGCCCCAG 



GCCCCGCC T GCCCCAGC 



CCGCCTGC C CCAGCGCT 



CGCCTGCC C CAGCGCTA 
GCCTGCCC C AGCGCTAC 
CCTGCCCC A GCGCTACT 
CCCAGCGC T ACTGGCAA 
AGCGCTAC T GGCAAATG 
CTACTGGC A AATGCGGC 
AATGCGGC C CCTGTTTC 
ATGCGGCC C CTGTTTCT 
TGCGGCCC C TGTTTCTG 
GCGGCCCC T GTTTCTGG 
CCTGTTTC T GGAGCTGC 
TCTGGAGC T GCTTGGGA 
GGAGCTGC T TGGGAACC 
TTGGGAAC C ACGCGCAG 
TGGGAACC A CGCGCAGT 
CCACGCGC A GTGCCCCT 
CGCAGTGC C CCTACGGG 
GCAGTGCC C CTACGGGG 
CAGTGCCC C TACGGGGT 
AGTGCCCC T ACGGGGTG 
CGGGGTGC T CCTCAAGA 
GGGTGCTC C TCAAGACG 
GGTGCTCC T CAAGACGC 
TGCTCCTC A AGACGCAC 
CAAGACGC A CTGCCCGC 
AGACGCAC T GCCCGCTG 
CGCACTGC C CGCTGCGA 
GCACTGCC C GCTGCGAG 
CTGCCCGC T GCGAGCTG 
CTGCGAGC T GCGGTCAC 
CTGCGGTC A CCCCAGCA 
GCGGTCAC C CCAGCAGC 
CGGTCACC C CAGCAGCC 
GGTCACCC C AGCAGCCG 
GTCACCCC A GCAGCCGG 
ACCCCAGC A GCCGGTGT 
CCAGCAGC C GGTGTCTG 
CCGGTGTC T GTGCCCGG 
GTCTGTGC C CGGGAGAA 
TCTGTGCC C GGGAGAAG 
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AGCCCUGG CUGAUGAG X CGAA ICUUCUCC' 



GAGCCCUG CUGAUGAG X CGAA IGCUUCUC 



AGAGCCCU CUGAUGAG X CGAA IGGCUUCU 



CAGAGCCC CUGAUGAG X CGAA IGGGCUUC 



CGCCACAG CUGAUGAG 



GCCGCCAC 
UCCUCGGG 
CUCCUCGG 



X CGAA I 
Y~CGAA^1 



CUGAUGAG X CGAA ICCGCCAC 



CUGAUGAG X CGAA IGCCGCCA 



CCUCCUCG CUGAUGAG X CGAA IGGCCGCC 



UCCUCCUC 
GGGGUCUG 



CUGAUGAG X CGAA IGGGCCGC 



CUGAUGAG X CGAA IUCCUCCU 



CGGGGGUC CUGAUGAG X CGAA IUGUCCUC 



GCGACGGG CUGAUGAG X CGAA 



GGCGACGG CUGAUGAG X CGAA 
AGGCGACG CUGAUGAG X CGAA 



IUCUGUGU 
IGUCUGUG 



CAGGCGAC CUGAUGAG X CGAA IGGGUCUG 
CUGCACCA CUGAUGAG X CGAA I CGACGGG 



GCUGCACC CUGAUGAG X CGAA 



GGAGCAGC CUGAUGAG X CGAA 



IGCGACGG 
ICACCAGG 



GGCGGAGC CUGAUGAG X CGAA ICUGCACC 



GCUGGCGG CUGAUGAG X CGAA ICAGCUGC 



GUGCUGGC CUGAUGAG X CGAA IAGCAGCU 



GCUGUGCU CUGAUGAG X CGAA ICGGAGCA 



UGCUGUGC CUGAUGAG X CGAA IGCGGAGC 



GGCUGCUG CUGAUGAG X CGAA ICUGGCGG 



GGGGCUGC CUGAUGAG X CGAA IUGCUGGC 



CCAGGGGC CUGAUGAG X CGAA ICUGUGCU 



CUGCCAGG CUGAUGAG X CGAA ICUGCUGU 



CCUGCCAG CUGAUGAG X CGAA IGCUGCUG 
ACCUGCCA CUGAUGAG X CGA A IGGCUGCU 
CACCUGCC CUGAUGAG X CGAA IGGGCUGC 
CGUACACC CUGAUGAG X CGAA I CCAGGGG 
CCGCACGA CUGAUGAG X CGAA ICCGUACA 
CGCAGGCA CUGAUGAG X C GAA ICCCGCAC 
GCGGAGGC CUGAUGAG X CGAA IGCCCGCA 
CCGGCGCA CUGAUGAG X CGAA ICAGGCCC 
GCCGGCGC CUGAUGAG X CGAA IGCAGGCC 
CACCAGCC CUGAUGAG X CGAA ICGCAGGC 
GGGGCACC CUGAUGAG X CGAA ICCGGCGC 
GGCCUGGG CUGAUGAG X CGAA ICACCAGC 
AGGCCUGG CUGAUGAG X CGAA IGCACCAG 
GAGGCCUG CUGAUGAG X CGAA IGGCACCA 
AGAGGCCU CUGAUGAG X CGAA IGGGCACC 
CAGAGGCC CUGAUGAG X CGAA IGGGGCAC 
GCCCCAGA CUGAUGAG X CGAA ICCUGGGG 
AGCCCCAG CUGAUGAG X CGAA IGCCUGGG 
GGAGCCCC CUGAUGAG X CGAA IAGGCCUG 



GGAGAAGC C CCAGGGCT 



GAGAAGCC C CAGGGCTC 



AGAAGCCC C AGGGCTCT 



GAAGCCCC A GGGCTCTG 



CCCAGGGC T CTGTGGCG 



CAGGGCTC T GTGGCGGC 



GTGGCGGC C CCCGAGGA 



"TGGCGGCC C CCGAGGAG 
GGCGGCCC C CGAGGAGG 



GCGGCCCC C GAGGAGGA 



AGGAGGAC A CAGACCCC 



GAGGACAC A GACCCCCG 
ACACAGAC C CCCGTCGC 



CACAGACC C CCGTCGCC 



ACAGACCC C CGTCGCCT 
CAGACCCC C GTCGCCTG 



CCCGTCGC C TGGTGCAG 



CCGTCGCC T GGTGCAGC 



CCTGGTGC A GCTGCTCC 



GGTGCAGC T GCTCCGCC 



GCAGCTGC T CCGCCAGC 



AGCTGCTC C GCCAGCAC 



TGCTCCGC C AGCACAGC 



GCTCCGCC A GCACAGCA 



CCGCCAGC A C 
GCCAGCAC A < 



AGCACAGC A GCCCCTGG 



ACAGCAGC C CCTGGCAG 
CAGCAGCC C CTGGCAGG 
AGCAGCCC C TGGCAGGT 
GCAGCCCC T GGCAGGTG 
CCCCTGGC A GGTGTACG 
TGTACGGC T TCGTGCGG 
GTGCGGGC C TGCCTGCG 
TGCGGGCC T GCCTGCGC 
GGGCCTGC C TGCGCCGG 
GGCCTGCC T GCGCCGGC 
GCCTGCGC C GGCTGGTG 
GCGCCGGC T GGTGCCCC 
GCTGGTGC C CCCAGGCC 
CTGGTGCC C CCAGGCCT 
TGGTGCCC C CAGGCCTC 
GGTGCCCC C AGGCCTCT 
GTGCCCCC A GGCCTCTG 
CCCCAGGC C TCTGGGGC 
CCCAGGCC T CTGGGGCT 
CAGGCCTC T GGGGCTCC 
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GUGCCUGG CUGAUGAG X CGAA ICCCCAGA 



UUGUGCCU CUGAUGAG X CGAA IAGCCCCA 



GUUGUGCC CUGAUGAG 



1 CGAA IGAGCCCC 



TCTGGGGC T CCAGGCAC 



TGGGGCTC C AGGCACAA 



i GGCACAAC 



GUUCGUUG CUGAUGAG X CGAA ICCUGGAG 



GCGUUCGU CUGAUGAG X CGAA IUGCCUGG 



GAGGAAGC CUGAUGAG X CGAA ICGUUCGU 



CCUGAGGA CUGAUGAG X CGAA ICGGCGUU 



CTCCAGGC A CAACGAAC 



CCAGGCAC A ACGAACGC 



ACGAACGC C GCTTCCTC 



AACGCCGC T TCCTCAGG 



GUUCCUGA CUGAUGAG X CGAA IAAGCGGC 



UGUUCCUG CUGAUGAG X CGAA I GAAGCGG 



GGUGUUCC CUGAUGAG X CGAA IAGGAAGC 



CUUCUUGG CUGAUGAG X CGAA IUUCCUGA 



AACUUCUU CUGAUGAG X CGAA IUGUUCCU 



GAACUUCU CUGAUGAG X CGAA IGUGUUCC 



CAGGGAGA CUGAUGAG X CGAA IAACUUCU 



CCCCAGGG CUGAUGAG X CGAA IAUGAACU 



UUCCCCAG CUGAUGAG X CGAA I AG AUG AA 



GCCGCTTC C TCAGGAAC 



CCGCTTCC T CAGGAACA 



GCTTCCTC A GGAACACC 



TCAGGAAC A CCAAGAAG 



AGGAACAC C AAGAAGTT 



GGAACACC A AGAAGTTC 



AGAAGTTC A TCTCCCTG 



AGTTCATC T CCCTGGGG 



TTCATCTC C CTGGGGAA 



CUUCCCCA CUGAUGAG X CGAA IGAGAUGA 



GCUUCCCC CUGAUGAG X CGAA IGGAGAUG 



GCUUGGCA CUGAUGAG X CGAA ICUUCCCC 



GAGAGCUU CUGAUGAG X CGAA ICAUGCUU 



CGAGAGCU CUGAUGAG X CGAA IGCAUGCU 



GCAGCGAG CUGAUGAG X CGAA ICUUGGCA 



CUGCAGCG CUGAUGAG X CGAA IAGCUUGG 



GCUCCUGC CUGAUGAG X CGAA ICGAGAC-C 



UCAGCUCC CUGAUGAG X CGAA ICAGCGAG 



UCCACGUC CUGAUGAG X CGAA ICUCCUGC 



CCAAGCGC CUGAUGAG X CGAA IUCCCGCA 
CGCAGCCA CUGAUGAG X CGAA ICGCAGUC 
UCCUGCGC CUGAUGAG X CGAA ICCAAGCG 



UGGGCUCC CUGAUGAG X CGAA ICGCAGCC 



AACCCCUG CUGAUGAG X CGAA ICUCCUGC 



TCATCTCC C TGGGGAAG 



CATCTCCC T GGGGAAGC 



GGGGAAGC A TGCCAAGC 



AAGCATGC C AAGCTCTC 



AGCATGCC A AGCTCTCG 



TGCCAAGC T CTCGCTGC 



CCAAGCTC T CGCTGCAG 



GCTCTCGC T GCAGGAGC 



CTCGCTGC A GGAGCTGA 



GCAGGAGC T GACGTGGA 



TGCGGGAC T GCGCTTGG 



GACTGCGC T TGGCTGCG 



CGCTTGGC T GCGCAGGA 



GGCTGCGC A GGAGCCCA 



GCAGGAGC C CAGGGGTT 



CAACCCCU CUGAUGAG X CGAA IGCUCCUG 



CCAACCCC CUGAUGAG X CGAA IGGCUCCU 



CGGAACAC CUGAUGAG X CGAA I 
CUGCGGCC CUGAUGAG X CGAA 5 



UGCUCUGC CUGAUGAG X CGAA ICCGGAAC 



1 CUGAUGAG X CGAA ICGGCCGG 
J CUGAUGAG 



CGAA ICUCUGCG 
ACGCAGAC CUGAUGAG - X~CGAA IUGCUCUG 
CCUCACGC CUGAUGAG X CGAA 



, IACGGUGC 



CAGGAGCC C AGGGGTTG 



AGGAGCCC A GGGGTTGG 



GGGTTGGC T GTGTTCCG 



CTGTGTTC C GGCCGCAG 



GTTCCGGC C GCAGAGCA 



CCGGCCGC A GAGCACCG 



CGCAGAGC A CCGTCTGC 



CAGAGCAC C GTCTGCGT 



GCACCGTC T GCGTGAGG 



1689 
1692 



ACUUGGCC 



CUGAUGAG X CGAA IAUCUCCU 
CUGAUGAG X CGAA IGAUCUCC 



AGGAACUU CUGAUGAG X CGAA ICCAGGAU 



CAGGAACU CUGAUGAG X CGAA I GCCAGGA 



CCAGUGCA CUGAUGAG X CGAA 1 
GCCAGUGC CUGAUGAG X CGAA ] 



UCAGCCAG CUGAUGAG X CGAA ICAGGAAC 



AGGAGATC C TGGCCAAG 



GGAGATCC T GGCCAAGT 



ATCCTGGC C AAGTTCCT 



TCCTGGCC A AGTTCCTG 



CCAAGTTC C TGCACTGG 



: T GCACTGGC 
: A CTGGCTGA 
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CAUCAGCC CUGAUGAG X CGAA IUGCAGGA 



CACUCAUC CUGAUGAG X CGAA ICCAGUGc" 



ACCUGAGC CUGAUGAG X CGAA ICUCGAClT 



AAGACCUG CUGAUGAG X CGAA ICAGCUCG 



GAAAGACC CUGAUGAG X CGAA IAGCAGCu" 



UAAAAGAA CUGAUGAG X CGAA IACCUGAG^ 



"CACAUAAA CUGAUGAG X CGAA IAAAGACC 



GGUCUCCG CUGAUGAG X CGAA IACAUAAA 



" UGAAACGU CUGAUGAG X CGAA IUCUCCGU 



UUGAAACG CUGAUGAG X CGAA IGUCUCCG 



~UGUUCUUU CUGAUGAG X CGAA IAAACGUG 



'AAAGAGCC CUGAUGAG X CGAA IUUCUUUU~ 
AGAAAAAG CUGAUGAG X CGAA ICCUGUUC 



1814 
1823 



"GUAGAAAA CUGAUGAG X CGAA IAGCCUGU 



"CUUCCGGU CUGAUGAG X CGAA IAAAAAGA 



"aCUCUUCC CUGAUGAG X CGAA IUAGAAAA 



"CUUGCUCC CUGAUGAG X CGAA IACACUCU 



UUGCAACU CUGAUGAG X CGAA ICUCCAGA 



CAAUGCUU CUGAUG AG X CGAA ICAACUUG 
"GAUUCCAA CUGAUGAG X CGAA ICUUUGCA 



GUGCUGUC CUGAUGAG X CGAA IAUUCCAA 



UCAAGUGC CUGAUGAG X CGAA IUCUGAUU" 



UCUUCAAG CUGAUGAG X CGAA ICUGUCUG 



CCUCUUCA CUGAUGAG X CGAA lUGCUGUC" 



CCCGCAGC CUGAUGAG X CGAA ICACCCUC 



GCUCCCGC CUGAUGAG X CGAA ICUGCACC 



CUUCCGAC CUGAUGAG X CGAA ICUCCCGC 



~CUGACCUC CUGAUGAG X CGAA ICUUCCGA 



AUGCUGCC CUGAUGAG X CGAA IACCUCUG 



"CCCGAUGC CUGAUGAG X CGAA ICCUGACC 



CUUCCCGA CUGAUGAG X CGAA ICUGCCUG 



GCGGGCCU CUGAUGA G X CGAA ICUUCCCG 
GGCGGGCC CUGAUGAG X CGAA IGCUUCCC 



"GCAGGGCG CUGAUGAG X CGAA I CCUGGCU 



AGCAGGGC CUGAUGAG X CGAA IGCCUGGC 



"gUCAGCAG CUGAUGAG X CGAA ICGGGCCU 



CGUCAGCA CUGAUGAG X CGAA IGCGGGCC 



ACGUCAGC CUGAUGAG X CGAA IGGCGGGC 



UGGACGUC CUGAUGAG X CGAA ICAGGGCg" 



"CGGAGUCU CUGAUGAG X CGAA IACGUCAG 



GCGGAGUC CUGAUGAG X CGAA IGACGUCA 



UGAAGCGG CUGAUGAG X CGAA IUCUGGAC 



GAUGAAGC CUGAUGAG X CGAA IAGUCUGG~ 



"GGGGAUGA CUGAUGAG X CGAA ICGGAGUCT 



"CUUGGGGA CUGAUGAG X CGAA IAAGCGGa" 



" AGGCUUGG CUGAUGAG X CGAA IAUGAAGC 



CAGGCUUG CUGAUGAG X CGAA IGAUGAAG 



TCCTGCAC T GGCTGATG 



GCACTGGC T GATGAGTG 



CGTCGAGC T GCTCAGGT 



CGAGCTGC T CAGGTCTT 



AGCTGCTC A GGTCTTTC 



CTCAGGTC T TTCTTTTA 



GGTCTTTC T TTTATGTC 



TTTATGTC A CGGAGACC 



ACGGAGAC C ACGTTTCA 



CGGAGACC A CGTTTCAA 



CACGTTTC A AAAGAACA 



AAAAGAAC A GGCTCTT 



GAACAGGC T CTTTTTCT 



ACAGGCTC T TTTTCTAC 



TCTTTTTC T ACCGGAAG 



TTTTCTAC C GGAAGAGT 



AGAGTGTC T GGAGCAAG 



TCTGGAGC A AGTTGCAA 



CAAGTTGC A AAGCATTG 



TGCAAAGC A TTGGAATC 



TTGGAATC A GACAGCAC 



AATCAGAC A GCACTTGA 



CAGACAGC A CTTGAAGA 



GACAGCAC T TGAAGAGG 



GAGGGTGC A GCTGCGGG 



GGTGCAGC T. GCGGGAGC 



GCGGGAGC T GTCGGAAG 



TCGGAAGC A GAGGTCAG 



CAGAGGTC A GGCAGCAT 



GGTCAGGC A G CATCGGG 
CAGGCAGC A TCGGGAAG 



CGGGAAGC C AGGCCCGC 



GGGAAGCC A GGCCCGCC 



AGCCAGGC C CGCCCTGC 



GCCAGGCC C GCCCTGCT 



AGGCCCGC C CTGCTGAC 



GGCCCGCC C TGCTGACG 



GCCCGCCC T GCTGACGT 



CGCCCTGC T GACGTCCA 



CTGACGTC C AGACTCCG 



TGACGTCC A GACTCCGC 



' GTCCAGAC T CCGCTTCA 



CCAGACTC C GCTTCATC 
GACTCCGC T TCATCCCC 



TCCGCTTC A TCCCCAAG 



GCTTCATC C CCAAGCCT 



CTTCATCC C CAAGCCTG 
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UCAGGCUU CUGAUGAG X CGAA IGGAUGAA 



TTCATCCC C AAGCCTGA 



GUCAGGCU CUGAUGAG X CGAA IGGGAUGA 



GCCCGUCA CUGAUGAG X CGAA ICUUGGGG 



TCATCCCC A AGCCTGAC 



CCCCAAGC C TGACGGGC 



AGCCCGUC CUGAUGAG X CGAA IGCUUGGG 



UCGGCCGC CUGAUGAG X CGAA ICCCGUCA 



CCCAAGCC T GACGGGCT 



TGACGGGC T GCGGCCGA 



UCACAAUC CUGAUGAG X CGAA ICCGCAGC 



GUAGUCCA CUGAUGAG X CGAA IUUCACAA 



CACGACGU CUGAUGAG X CGAA IUCCAUGU 



GCTGCGGC C GATTGTGA 



TTGTGAAC A TGGACTAC 



ACATGGAC T ACGTCGTG 



AACGUUCU CUGAUGAG X CGAA ICUCCCAC 



GTGGGAGC C AGAACGTT 



GAACGUUC CUGAUGAG X CGAA IGCUCCCA 



UUCUCUGC CUGAUGAG X CGAA IAACGUUC 



TGGGAGCC A GAACGTTC 



GAACGTTC C GCAGAGAA 



CUUUUCUC CUGAUGAG X CGAA ICGGAACG 



GTTCCGC A GAGAAAAG 



AGACGCUC CUGAUGAG X CGAA ICCCUCUU 



UCGAGGUG CUGAUGAG X CGAA IACGCUCG 



AAGAGGGC C GAGCGTCT 



CGAGCGTC T CACCTCGA 



CCUCGAGG CUGAUGAG X CGAA IAGACGCU 



AGCGTCTC A CCTCGAGG 



ACCCUCGA CUGAUGAG X CGAA IUGAGACG 



CACCCUCG CUGAUGAG X CGAA I GUGAGAC 



CGTCTCAC C TCGAGGGT 



GTCTCACC T CGAGGGTG 



5 CUGAUGAG X CGAA ICCUUCAC 



CUGAACAG 

CGCUGAAC CUGAUGAG X CGAA IUGCCUUC 



GTGAAGGC A CTGTTCAG 



GAAGGCAC T GTTCAGCG 



GAGCACGC CUGAUGAG X CGAA IAACAGUG 



CGUAGUUG CUGAUGAG X CGAA ICACGCUG 



CUCGUAGU CUGAUGAG X CGAA IAGCACGC 



CCGCUCGU CUGAUGAG X CGAA IUUGAGCA 



CACTGTTC A GCGTGCTC 



CAGCGTGC T CAACTACG 



GCGTGCTC A ACTACGAG 



TGCTCAAC T ACGAGCGG 



GAGGCCGG CUGAUGAG X CGAA ICGCCGCG 



GGAGGCCG CUGAUGAG X CGAA IGCGCCGC 



AGGAGGCC CUGAUGAG X CGAA IGGCGCCG 



CGCGGCGC C CCGGCCTC 



GCGGCGCC C CGGCCTCC 



CGGCGCCC C GGCCTCCT 



GCCCAGGA CUGAUGAG X CGAA ICCGGGGC 



GCCCCGGC C TCCTGGGC 



CGCCCAGG CUGAUGAG X CGAA 1 
GGCGCCCA CUGAUGAG X CGAA ] 



AGGCGCCC CUGAUGAG X CGAA I GAGGCCG 



AGCACAGA CUGAUGAG X CGAA ICGCCCAG 



CAGCACAG CUGAUGAG X CGAA IGCGCCCA 



CCGGCCTC C TGGGCGCC 



CGGCCTCC T GGGCGCCT 



CTGGGCGC C TCTGTGCT 



TGGGCGCC T CTGTGCTG 



CCCAGCAC CUGAUGAG X CGAA IAGGCGCC 



CCAGGCCC CUGAUGAG X CGAA ICACAGAG 



AUCGUCCA CUGAUGAG X CGAA ICCCAGCA 



GGCGCCTC T GTGCTGGG 



CTCTGTGC T GGGCCTGG 



TGCTGGGC C TGGACGAT 



UAUCGUCC CUGAUGAG X CGAA IGCCCAGC 



GCTGGGCC T GGACGATA 



GGCCCUGU CUGAUGAG X CGAA IAUAUCGU 



ACGATATC C ACAGGGCC 



AGGCCCUG CUGAUGAG X CGAA IGAUAUCG 



CCAGGCCC CUGAUGAG X CGAA IUGGAUAU 



CGATATCC A CAGGGCCT 



ATATCCAC A GGGCCTGG 



2122 
2123 



GUGCGCCA CUGAUGAG X CGAA ICCCUGUG 



GGUGCGCC CUGAUGAG X CGAA IGCCCUGU 



CACGAAGG CUGAUGAG X CGAA ICGCCAGG 



CACAGGGC C TGGCGCAC 



ACAGGGCC T GGCGCACC 



CCTGGCGC A CCTTCGTG 



AGCACGAA CUGAUGAG X CGAA 1 
CAGCACGA CUGAUGAG X CGAA 5 



GCACACGC CUGAUGAG X CGAA I CACGAAG 



GGGUCCUG CUGAUGAG X CGAA ICCCGCAC 



TGGCGCAC C TTCGTGCT 



GGCGCACC T TCGTGCTG 
CTTCGTGC l 



: T GCGTGTGC 
: C CAGGACCC 



CGGGUCCU CUGAUGAG X CGAA IGCCCGCA 



TGCGGGCC C AGGACCCG 
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GCGGGUCC CUGAUGAG X CGAA IGGCCCGC 
AGGCGGCG CUGAUGAG X CGAA IUCCUGGG 



CAGGCGGC CUGAUGAG 2 



X CGAA 
GCUCAGGC CUGAUGAG X CGAA 
ACAGCUCA CUGAUGAG X CGAA 



UACAGCUC CUGAUGAG X CGAA 



I CGGCGGG 
I GCGGCGG 



X CGAA 
CUUGACAA CUGAUGAG X CGAA 



CAAAGUAC CUGAUGAG 2 



AUCCACCU CUGAUGAG X CGAA 



IUACAGCU 
IACAAAGU 



GGGGAUGG CUGAUGAG X CGAA IUCGUACG 



UGGGGGAU CUGAUGAG X CGAA IUGUCGUA 



CUGGGGGA CUGAUGAG X CGAA IGUGUCGU 



GUCCUGGG CUGAUGAG X CGAA IAUGGUGU 



UGUCCUGG CUGAUGAG X CGAA IGAUGGUG 



CUGUCCUG CUGAUGAG X CGAA I GGAUGGU 



CCUGUCCU CUGAUGAG X CGAA I GGGAUGG 



GCCUGUCC CUGAUGAG X CGAA I GGGGAUG 



CGUGAGCC CUGAUGAG X CGAA IUCCUGGG 



CCUCCGUG CUGAUGAG X CGAA ICCUGUCC 



GACCUCCG CUGAUGAG X CGAA IAGCCUGU 



GCUGGCGA CUGAUGAG X CGAA IACCUCCG 



AUGAUGCU CUGAUGAG X CGAA ICGAUGAC 



GAUGAUGC CUGAUGAG X CGAA IGCGAUGA 



UUUGAUGA CUGAUGAG X CGAA ICUGGCGA 



GGGUUUGA CUGAUGAG X CGAA IAUGCUGG 



CUGGGGUU CUGAUGAG X CGAA IAUGAUGC 



UGUUCUGG CUGAUGAG X CGAA IUUUGAUG 



GUGUUCUG CUGAUGAG X CGAA IGUUUGAU 



CGUGUUCU CUGAUGAG X CGAA IGGUUUGA 



ACGUGUUC CUGAUGAG X CGAA IGGGUUUG 



GCAGUACG CUGAUGAG X CGAA IUUCUGGG 



ACGCACGC CUGAUGAG X CGAA IUACGUGU 



UGGACCAC CUGAUGAG X CGAA ICAUACCG 



GGCCUUCU CUGAUGAG X CGAA IACCACGG 



CGGCCUUC CUGAUGAG X CGAA IGACCACG 



CCAUGGGC CUGAUGAG X CGAA ICCUUCUG 



UGCCCAUG CUGAUGAG X CGAA ICGGCCUU 



GUGCCCAU CUGAUGAG X CGAA IGCGGCCU 



CGUGCCCA CUGAUGAG X CGAA IGGCGGCC 



UGCGGACG CUGAUGAG X CGAA ICCCAUGG 



GGCCUUGC CUGAUGAG X CGAA IACGUGCC 



GAAGGCCU CUGAUGAG X CGAA ICGGACGU 



CUCUUGAA CUGAUGAG X CGAA ICCUUGCG 



GCUCUUGA CUGAUGAG X CGAA IGCCUUGC 



GUGGCUCU CUGAUGAG X CGAA IAAGGCCU 



AGAGACGU CUGAUGAG X CGAA ICUCUUGA 



UAGAGACG CUGAUGAG X CGAA IGCUCUUG 



GCGGGCCC A GGACCCGC 
CCCAGGAC C CGCCGCCT 



CCAGGACC C GCCGCCTG 



GGACCCGC C GCCTGAGC 



CCCGCCGC C TGAGCTGT 



CCGCCGCC T GAGCTGTA 
GCCTGAGC T GTACTTTG 



AGCTGTAC T TTGTCAAG 



ACTTTGTC A AGGTGGAT 



CGTACGAC A CCATCCCC 



TACGACAC C ATCCCCCA 



ACGACACC A TCCCCCAG 



ACACCATC C CCCAGGAC 



CACCATCC C CCAGGACA 



ACCATCCC C CAGGACAG 
CCATCCCC C AGGACAGG 



CATCCCCC A GGACAGGC 



CCCAGGAC A GGCTCACG 



GGACAGGC T CACGGAGG 



ACAGGCTC A CGGAGGTC 



CGGAGGTC A TCGCCAGC 



GTCATCGC C AGCATCAT 



TCATCGCC A GCATCATC 



TCGCCAGC A TCATCAAA 



CCAGCATC A TCAAACCC 
GCATCATC A AACCCCAG 



CATCAAAC C CCAGAACA 



ATCAAACC C CAGAACAC 
TCAAACCC C AGAACACG 



CAAACCCC A GAACACGT 



CCCAGAAC A CGTACTGC 



ACACGTAC T GCGTGCGT 



CGGTATGC C GTGGTCCA 



CCGTGGTC C AGAAGGCC 



CGTGGTCC A GAAGGCCG 



CAGAAGGC C GCCCATGG 



AAGGCCGC C CATGGGCA 



AGGCCGCC C ATGGGCAC 



GGCCGCCC A TGGGCACG 



CCATGGGC A CGTCCGCA 



GGCACGTC C GCAAGGCC 



ACGTCCGC A AGGCCTTC 



CGCAAGGC C TTCAAGAG 



GCAAGGCC T TCAAGAGC 



AGGCCTTC A AGAGCCAC 



TCAAGAGC C ACGTCTCT 



CAAGAGCC A CGTCTCTA 
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Table 14 



2345 


CAAGGUAG CUGAUGAG X CGAA IACGUGGC 




GCCACGTC T CTACCTTG 




2347 


GUCAAGGU CUGAUGAG X CGAA IAGACGUG 




CACGTCTC T ACCTTGAC 




2350 


UCUGUCAA CUGAUGAG X CGAA IUAGAGAC 




GTCTCTAC C TTGACAGA 




2351 


GUCUGUCA CUGAUGAG X CGAA IGUAGAGA 




TCTCTACC T TGACAGAC 





2356 


UGGAGGUC CUGAUGAG X CGAA IUCAAGGU 




ACCTTGAC A GACCTCCA 




2360 


CGGCUGGA CUGAUGAG X CGAA IUCUGUCA 




TGACAGAC C TCCAGCCG 




2361 


ACGGCUGG CUGAUGAG X CGAA IGUCUGUC 




GACAGACC T CCAGCCGT 




2363 


GUACGGCU CUGAUGAG X CGAA IAGGUCUG 




CAGACCTC C AGCCGTAC 




2364 


UGUACGGC CUGAUGAG X CGAA IGAGGUCU 








2367 


GCAUGUAC CUGAUGAG X CGAA ICUGGAGG 




CCTCCAGC C GTACATGC 




2372 


CUGUCGCA CUGAUGAG X CGAA IUACGGCU 




AGCCGTAC A TGCGACAG 




2379 


CCACGAAC CUGAUGAG X CGAA IUCGCAUG 




^TCGTGGC T CACCTGCA 




2389 


UGCAGGUG CUGAUGAG X CGAA ICCACGAA 









2391 


CCUGCAGG CUGAUGAG X CGAA IAGCCACG 




CGTGGCTC A CCTGCAGG 




2393- 


CUCCUGCA CUGAUGAG X CGAA IUGAGCCA 




TGGCTCAC C TGCAGGAG 




2394 


UCUCCUGC CUGAUGAG X CGAA IGUGAGCC 




GGCTCACC T GCAGGAGA 




2397 


UGGUCUCC CUGAUGAG X CGAA ICAGGUGA 




TCACCTGC A GGAGACCA 




2404 


AGCGGGCU CUGAUGAG X CGAA IUCUCCUG 




_ CAGGAG ^ G g^CCGCTG 




2405 


CAGCGGGC CUGAUGAG X CGAA IGUCUCCU 




AGGAGACC A GCC ™^ 




2408 


CCUCAGCG CUGAUGAG X CGAA ICUGGUCU 




AGACCAGC C 





2409 


CCCUCAGC CUGAUGAG X CGAA IGCUGGUC 




GACCAGCC C _______ 





2412 


CAUCCCUC CUGAUGAG X CGAA ICGGGCUG 




CAGCCCGC T GA _ 





2422 


AUGACGAC CUGAUGAG X CGAA ICAUCCCU 




AGGGATGC C GTCGTCA'l 





2429 


CUGCUCGA CUGAUGAG X CGAA IACGACGG 




CCGTCGTC A 





2436 


AGGAGCUC CUGAUGAG X CGAA ICUCGAUG 




CATCGAGC A 




2441 


CAGGGAGG CUGAUGAG X CGAA ICUCUGCU 




AGCAGAGC T 




2443 


UUCAGGGA CUGAUGAG X CGAA IAGCUCUG 








2444 


AUUCAGGG CUGAUGAG X CGAA IGAGCUCU 




AGAGCTCC T CCCTGAAT 




2446 


UCAUUCAG CUGAUGAG X CGAA IAGGAGCU 




^TccTCC C TGAATGAG 




2447 


' CUCAUUCA CUGAUGAG X CGAA IGAGGAGC 




^^cCC T GAATGAGG 




2448 


CCUCAUUC CUGAUGAG X CGAA IGGAGGAG 








2458 


CCACUGCU CUGAUGAG X CGAA ICCUCAUU 




AATGAGGC C AGCAGTGG 




2459 


GCCACUGC CUGAUGAG X CGAA IGCCUCAU 




ATGAGGCC A GCAGTGGC 




2462 


GAGGCCAC CUGAUGAG X CGAA ICUGGCCU 








2468 


GUCGAAGA CUGAUGAG X CGAA ICCACUGC 




GCAGTGGC c T CTTCGAC 




2469 


CGUCGAAG CUGAUGAG X CGAA IGCCACUG 




CAGTGGCC T CTTCGACG 




2471 


GACGUCGA CUGAUGAG X CGAA IAGGCCAC 




GTGGCCTC T TCGACGTC 




2480 


GCGUAGGA CUGAUGAG X CGAA IACGUCGA 




TCGACGTC T TCCTACGC 






GAAGCGUA CUGAUGAG X CGAA IAAGACGU 




ACGTCTTC C TACGCTTC 




2484 


UGAAGCGU CUGAUGAG X CGAA IGAAGACG 




CGTCTTCC T ACGCTTCA 




2489 


GCACAUGA CUGAUGAG X CGAA ICGUAGGA 




TCCTACGC T TCATGTGC 




2492 


GUGGCACA CUGAUGAG X CGAA IAAGCGUA 




TACGCTTC A TGTGCCAC 




2498 


GGCGUGGU CUGAUGAG X CGAA ICACAUGA 




TCATGTGC C ACCACGCC 




2499 


CGGCGUGG CUGAUGAG X CGAA IGCACAUG 




CATGTGCC A CCACGCCG 




2501 


CACGGCGU CUGAUGAG X CGAA IUGGCACA 




TGTGCCAC C ACGCCGTG 




2502 


GCACGGCG CUGAUGAG X CGAA IGUGGCAC 




GTGCCACC A CGCCGTGC 




2506 


AUGCGCAC CUGAUGAG X CGAA ICGUGGUG 




CACCACGC C GTGCGCAT 





WO 01/16312 PCT/US00/23998 



GCCCCUGA CUGAUGAG X CGAA ICGCACGG 



2516 
2522 



CUUGCCCC CUGAUGAG X CGAA IAUGCGCA 



GUAGGACU CUGAUGAG X CGAA ICCCCUGA 



UGGACGUA CUGAUGAG X CGAA IACUUGCC 



2528 
2S34 

2535 



CUGGACGU CUGAUGAG X CGAA IGACUUGC 



CUGGCACU CUGAUGAG X CGAA IACGUAGG 



CCUGGCAC CUGAUGAG X CGAA IGACGUAG 



GAUCCCCU CUGAUGAG X CGAA ICACUGGA 



GGAUCCCC CUGAUGAG X CGAA IGCACUGG 



CCGTGCGC A TCAGGGGC 



TGCGCATC A GGGGCAAG 



TCAGGGGC A AGTCCTAC 



GGCAAGTC C TACGTCCA 



GCAAGTCC T ACGTCCAG 



CCTACGTC C AGTGCCAG 



CTACGTCC A GTGCCAGG 



TCCAGTGC C AGGGGATC 



CCAGTGCC A GGGGATCC 



GCCCUGCG CUGAUGAG X CGAA IAUCCCCU 



AGCCCUGC CUGAUGAG X CGAA IGAUCCCC 



AGGGGATC C CGCAGGGC 



GGGGATCC C GCAGGGCT 



UGGAGCCC CUGAUGAG X CGAA ICGGGAUC 



GAGGAUGG CUGAUGAG X CGAA ICCCUGCG 



GAGAGGAU CUGAUGAG X CGAA IAGCCCUG 



GGAGAGGA CUGAUGAG X CGAA IGAGCCCU 



CGUGGAGA CUGAUGAG X CGAA IAUGGAGC 



GCGUGGAG CUGAUGAG X CGAA IGAUGGAG 
CAGCGUGG CUGAUGAG X CGAA IAGGAUGG 
^GCAGCGU CUGAUGAG X CGAA IAGAGGAU 
GAGCAGCG CUGAUGAG X CGAA I GAGAGGA 
UGCAGAGC CUGAUGAG X CGAA I CGUGGAG 
GGCUGCAG CUGAUGAG X CGAA ICAGCGUG 
CAGGCUGC CUGAUGAG X CGAA IAGCAGCG 
GCACAGGC CUGAUGAG X CGAA ICAGAGCA 



GUAGCACA 
CGUAGCAC 



CUGAUGAG X CGAA ICUGCAGA 
CUGAUGAG X CGAA IGCUGCAG 
GUCGCCGU CUGAUGAG X C GAA ICACAGGC 
GUUCUCCA CUGAUGAG X CGAA IUCGCCGU 
AAACAGCU CUGAUGAG X CGAA IUUCUCCA 
CCGCAAAC ( 
GCAGGAGC CUGAUGAG X CGAA ICCCGUCC 
AACGCAGG CUGAUGAG X CGAA ICAGCCCG 
CAAACGCA CUGAUGAG X CGAA IAGCAGCC 
CCAAACGC CUGAUGAG X CGAA I GAGCAGC 
CACCAACA CUGAUGAG X CG AA IAAAUCAU 
AGGUGAGG CUGAUGAG X CGAA IUCACCAA 
UGAGGUGA 
GUGAGGUG 
GGGUGAGG 



CUGAUGAG X CGAA IUGUCACC 
CUGAUGAG X CGAA IGUGUCAC 
CUGAUGAG X CGAA IAGGUGUC 
GUGGGUGA CUGAUGAG X CGAA IUGAGGUG 
CGUGGGUG CUGAUGAG X CGAA IGUGAGGU 
CGCGUGGG CUGAUGAG X CGAA IAGGUGAG 
UUCGCGUG CUGAUGAG X CGAA IUGAGGUG 
UUUCGCGU CUGAUGAG X CGAA IGUGAGGU 
UUUUCGCG CUGAUGAG X CGAA IGGUGAGG ' 
CUGAGGAA CUGAUGAG X CGAA IUUUUCGC 
CCUGAGGA CUGAUGAG X CGAA IGUUUUCG 



GATCCCGC A GGGCTCCA 



CGCAGGGC T CCATCCTC 



CAGGGCTC C ATCCTCTC 



AGGGCTCC A TCCTCTCC 



GCTCCATC C TCTCCACG 



CTCCATCC T CTCCACGC 
CCATCCTC T CCACGCTG 
ATCCTCTC C ACGCTGCT 
TCCTCTCC A CGCTGCTC 
CTCCACGC T GCTCTGCA 
CACGCTGC T CTGCAGCC 
CGCTGCTC T GCAGCCTG 
TGCTCTGC A GCCTGTGC 
TCTGCAGC C TGTGCTAC 
CTGCAGCC T GTGCTACG 
GCCTGTGC T ACGGCGAC 
ACGGCGAC A TGGAGAAC 
TGGAGAAC A AGCTGTTT 
GAACAAGC T GTTTGCGG 
GGACGGGC T GCTCCTGC 
CGGGCTGC T CCTGCGTT 
GGCTGCTC C TGCGTTTG 
GCTGCTCC T GCGTTTGG 
ATGATTTC T TGTTGGTG 
TTGGTGAC A CCTCACCT 
GGTGACAC C TCACCTCA 
GTGACACC T CACCTCAC 
GACACCTC A CCTCACCC 
CACCTCAC C TCACCCAC 
ACCTCACC T CACCCACG 
CTCACCTC A CCCACGCG 
CACCTCAC C CACGCGAA 
ACCTCACC C ACGCGAAA 
CCTCACCC A CGCGAAAA 
GCGAAAAC C TTCCTCAG 
CGAAAACC T TCCTCAGG 
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2705 


GGUCCUGA CUGAUGAG X CGAA IAAGGUUU 




AAACCTTC C _____ 




2706 


GGGUCCUG CUGAUGAG X CGAA IGAAGGUU 




^ AACCTTCC T CAGGA 




2708 


CAGGGUCC CUGAUGAG X CGAA IAGGAAGG 




_ CC,TTCCTC _ ctggtccg 




2713 


CGGACCAG CUGAUGAG X CGAA IUCCUGAG 




CTCAGGAC C CT 




2714 


UCGGACCA CUGAUGAG X CGAA IGUCCUGA 




TCAGGACC C TGGTCCGA 




2715 


CUCGGACC CUGAUGAG X CGAA IGGUCCUG 




CAGGACCC T GGTCCGAG 




2720 


GACACCUC CUGAUGAG X CGAA IACCAGGG 




CCCTGGTC C GAGGTGTC 


; 


2729 


AUACUCAG CUGAUGAG X CGAA IACACCUC 




GAGGTGTC C CTGAGTAT 




2730 


CAUACUCA CUGAUGAG X CGAA" IGACACCU 




AGGTGTCC C TGAGTATG 




2731 


CCAUACUC CUGAUGAG X CGAA IGGACACC 




GGTGTCCC T GAGTATGG 




2741 


CACCACGC CUGAUGAG X CGAA ICCAUACU 




AGTATGGC T GCGTGGTG 




2753 


CUUCCGCA CUGAUGAG X CGAA IUUCACCA 




TGGTGAAC T TGCGGAAG 




2764 


UUCACCAC CUGAUGAG X CGAA IUCUUCCG 




CGGAAGAC A GTGGTGAA 




2774 


" UACAGGGA CUGAUGAG X CGAA IUUCACCA 




TGGTGAAC T TCCCTGTA 





2777 


UUCUACAG CUGAUGAG X CGAA IAAGUUCA 




TGAACTTC C CTGTAGAA 




2778 


CUUCUACA CUGAUGAG X CGAA IGAAGUUC 




GAACTTCC C TGTAGAAG 




2779 


UCUUCUAC CUGAUGAG X CGAA I GGAAGUU 




AACTTCCC T GTAGAAGA 




2794 


CCACCCAG CUGAUGAG X CGAA ICCUCGUC 




GACGAGGC C CTGGGTGG 




2795 


GCCACCCA CUGAUGAG X CGAA IGCCUCGU 




ACGAGGCC C TGGGTGGC 




2796 


UGCCACCC CUGAUGAG X CGAA IGGCCUCG 




CGAGGCCC T GGGTGGCA 




2804 


AAAAGCCG CUGAUGAG X CGAA ICCACCCA 




TGGGTGGC A CGGCTTTT 




2809 


UGAACAAA CUGAUGAG X CGAA ICCGUGCC 




GGCACGGC T TTTGTTCA 




2817 


CCGGCAUC CUGAUGAG X CGAA IAACAAAA 




TTTTGTTC A GATGCCGG 




2823 


CGUGGGCC CUGAUGAG X CGAA ICAUCUGA 




TCAGATGC C GGCCCACG 






AGGCCGUG CUGAUGAG X CGAA ICCGGCAU 




ATGCCGGC C CACGGCCT 




2828 


" UAGGCCGU CUGAUGAG X CGAA IGCCGGCA 




TGCCGGCC C ACGGCCTA 




2829 


AUAGGCCG CUGAUGAG X CGAA IGGCCGGC 




GCCGGCCC A CGGCCTAT 




2834 


GGGGAAUA CUGAUGAG X CGAA ICCGUGGG 




CCCACGGC C TATTCCCC 




2835 


AGGGGAAU CUGAUGAG X CGAA IGCCGUGG 




CCACGGCC T ATTCCCCT 




2840 


GCACCAGG CUGAUGAG X CGAA IAAUAGGC 




GCCTATTC C CCTGGTGC 





2841 


CGCACCAG CUGAUGAG X CGAA I GAAUAGG 




CCTATTCC C CTGGTGCG 




2842 


"' CCGCACCA CUGAUGAG X CGAA IGGAAUAG 




CTATTCCC C TGGTGCGG 






GCCGCACC CUGAUGAG X CGAA IGGGAAUA 




TATTCCCC T GGTGCGGC 
GGTGCGGC C TGCTGCTG 




2852 


CAGCAGCA CUGAUGAG X CGAA ICCGCACC 
CCAGCAGC CUGAUGAG X CGAA IGCCGCAC 




GTGCGGCC T GCTGCTGG 





2856 


"" UAUCCAGC CUGAUGAG X CGAA ICAGGCCG 




CGGCCTGC T GCTGGATA 




2859 


GGGUAUCC CUGAUGAG X CGAA ICAGCAGG 




CCTGCTGC T GGATACCC 




2866 


AGGGUCCG CUGAUGAG X CGAA IUAUCCAG 




CTGGATAC C CGGACCCT 






CAGGGUCC CUGAUGAG X CGAA IGUAUCCA 




TGGATACC C GGACCCTG 




2872 


ACCUCCAG CUGAUGAG X CGAA IUCCGGGU 




ACCCGGAC C CTGGAGGT 




2873 


CACCUCCA CUGAUGAG X CGAA IGUCCGGG 




CCCGGACC C TGGAGGTG 
CCGGACCC T GGAGGTGC 




2874 
2883 


GCACCUCC CUGAUGAG X CGAA IGGUCCGG 
AGUCGCUC CUGAUGAG X CGAA ICACCUCC 




GGAGGTGC A GAGCGACT 
AGAGCGAC T ACTCCAGC 




2891 
2894 


GCUGGAGU CUGAUGAG X CGAA IUCGCUCU 
AUAGCUGG CUGAUGAG X CGAA IUAGUCGC 




GCGACTAC T CCAGCTAT 




2896 


" GCAUAGCU CUGAUGAG X CGAA IAGUAGUC 




GACTACTC C AGCTATGC 




2897 


GGCAUAGC CUGAUGAG X CGAA IGAGUAGU 




ACTACTCC A GCTATGCC 
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2900 


CCGGGCAU CUGAUGAG X CGAA ICUGGAGU 




ACTCCAGC T ATGCCCGG 






2905 


GAGGUCCG CUGAUGAG X CGAA ICAUAGCU 




AGCTATGC C CGGACCTC 




2906 


GGAGGUCC CUGAUGAG X CGAA IGCAUAGC 










CUGAUGGA CUGAUGAG X CGAA IUCCGGGC 




GCCCGGAC C TCCATCAG 




2912 


UCUGAUGG CUGAUGAG X CGAA IGUCCGGG 




CCCGGACC T CCATCAGA 




2914 


GCUCUGAU CUGAUGAG X CGAA IAGGUCCG 




CGGACCTC C ATCAGAGC' 
GGACCTCC A TCAGAGCC 




2915 
2918 


GGCUCUGA CUGAUGAG X CGAA IGAGGUCC 
ACUGGCUC CUGAUGAG X CGAA IAUGGAGG 




CCTCCATC A GAGCCAGT 




2923 


GUGAGACU CUGAUGAG X CGAA ICUCUGAU 








2924 


GGUGAGAC CUGAUGAG X CGAA IGCUCUGA 




TCAGAGCC A GTCTCACC 




2928 


UGAAGGUG CUGAUGAG X CGAA IACUGGCU 




AGCCAGTC T CACCTTCA 




2930 


GUUGAAGG CUGAUGAG X CGAA IAGACUGG 




CCAGTCTC A CCTTCAAC 




2932 
2933 


CGGUUGAA CUGAUGAG X CGAA IUGAGACU 
GCGGUUGA CUGAUGAG X CGAA IGUGAGAC 




GTCTCACC T TCAACCGC 




2936 


GCCGCGGU CUGAUGAG X CGAA IAAGGUGA 








2939 
2945 


GAAGCCGC CUGAUGAG X CGAA IUUGAAGG 
AGCCUUGA CUGAUGAG X CGAA ICCGCGGU 




CCTTCAAC C GCGGCTTC 
ACCGCGGC T TCAAGGCT 




2948 
2953 


CCCAGCCU CUGAUGAG X CGAA IAAGCCGC 
UUCCUCCC CUGAUGAG X CGAA ICCUUGAA 




GCGGCTTC A AGGCTGGG 

TTCAAGGC T GGGAGGAA 




2963 
2972 


GCGACGCA CUGAUGAG X CGAA IUUCCUCC 
AAAGAGUU CUGAUGAG X CGAA ICGACGCA 




GGAGGAAC A TGCGTCGC 
TGCGTCGC A AACTC1TT 




2976 
2978 


CCCCAAAG CUGAUGAG X CGAA IUUUGCGA 
GACCCCAA CUGAUGAG X CGAA IAGUUUGC 




TCGCAAAC T CTTTGGGG 
GCAAACTC T TTGGGGTC 




2987 


CAGCCGCA CUGAUGAG X CGAA IACCCCAA 








2994 


GACACUUC CUGAUGAG X CGAA ICCGCAAG 




CTTGCGGC T GAAGTGTC 






ACAGGCUG CUGAUGAG X CGAA IACACUUC 




GAAGTGTC A CAGCCTGT 




3005 


AAACAGGC CUGAUGAG X CGAA IUGACACU 




AGTGTCAC A GCCTGTTT 




3008 


CAGAAACA CUGAUGAG X CGAA ICUGUGAC 




GTCACAGC C TGTTTCTG 




3015 


CCAGAAAC CUGAUGAG X CGAA IGCUGUGA 
GCAAAUCC CUGAUGAG X CGAA IAAACAGG 




TCACAGCC T GTTTCTGG 
CCTGTTTC T GGATTTGC 




3032 


UGUUCACC CUGAUGAG X CGAA ICAAAUCC 
CUGGAGGC CUGAUGAG X CGAA IUUCACCU 




GGATTTGC A GGTGAACA 
AGGTGAAC A GCCTCCAG 






3035 
3036 


CGUCUGGA CUGAUGAG X CGAA ICUGUUCA 
CCGUCUGG CUGAUGAG X CGAA IGCUGUUC 




TGAACAGC C TCCAGACG 
GAACAGCC T CCAGACGG 
ACAGCCTC C AGACGGTG 




3039 
3050 


CACCGUCU CUGAUGAG X CGAA IAGGCUGU 
ACACCGUC CUGAUGAG X CGAA IGAGGCUG 
GAUGUUGG CUGAUGAG X CGAA ICACACCG 




CAGCCTCC A GACGGTGT 
CGGTGTGC A CCAACATC 
GTGTGCAC C AACATCTA 






3052 


UAGAUGUU CUGAUGAG X CGAA IUGCACAC 
GUAGAUGU CUGAUGAG X CGAA IGUGCACA 




TGTGCACC A ACATCTAC 
GCACCAAC A TCTACAAG 






3056 
3059 
3062 
3068 


CUUGUAGA CUGAUGAG X CGAA IUUGGUGC 
" GAUCUUGU CUGAUGAG X CGAA IAUGUUGG 
GAGGAUCU CUGAUGAG X CGAA IUAGAUGU 
CAGCAGGA CUGAUGAG X CGAA IAUCUUGU 




CCAACATC T ACAAGATC 
ACATCTAC A AGATCCTC 
ACAAGATC C TCCTGCTG 
CAAGATCC T CCTGCTGC 






3069 
3071 
3072 
3075 


GCAGCAGG CUGAUGAG X CGAA IGAUCUUG 
CUGCAGCA CUGAUGAG X CGAA IAGGAUCU 
CCUGCAGC CUGAUGAG X CGAA IGAGGAUC 
ACGCCUGC CUGAUGAG X CGAA I CAGGAGG 




AGATCCTC C TGCTGCAG 
GATCCTCC T GCTGCAGG 
CCTCCTGC T GCAGGCGT 
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UGUACGCC CUGAUGAG X CGAA ICAGCAGG 



GUGAAACC CUGAUGAG X CGAA IUACGCCU 



CACAUGCG CUGAUGAG X CGAA IAAACCUG 



AGCACACA CUGAUGAG X CGAA I CGUGAAA 



GGAGCUGC CUGAUGAG X CGAA ICACACAU 



AUGGGAGC CUGAUGAG X CGAA ICAGCACA 



GAAAUGGG CUGAUGAG X CGAA ICUGCAGC 



AUGAAAUG CUGAUGAG X CGAA IAGCUGCA 



GAUGAAAU CUGAUGAG X CGAA IGAGCUGC 



UGAUGAAA CUGAUGAG X CGAA IGGAGCUG 



CUUGCUGA CUGAUGAG X CGAA IAAAUGGG 



AAACUUGC CUGAUGAG X CGAA IAUGAAAU 



UCCAAACU CUGAUGAG X CGAA ICUGAUGA 



AAAUGUGG CUGAUGAG X CGAA IUUCUUCC 



AAAAUGUG CUGAUGAG X CGAA IGUUCUUC 



AAAAAUGU CUGAUGAG X CGAA IGGUUCUU 



GAAAAAUG CUGAUGAG X CGAA IGGGUUCU 



AGGAAAAA CUGAUGAG X CGAA IUGGGGUU 
GACGCGCA CUGAUGAG X CGAA I A AAAAUG 
UGACGCGC CUGAUGAG X CGAA IGAAAAAU 
GUCAGAGA CUGAUGAG X CGAA IACGCGCA 



! CUGAUGAG 
: CUGAUGAG 



GGAGGCCG CUGAUGAG X CGAA 
CAGAGGGA CUGAUGAG X CGAA I 
AGAGGG CUGAUGAG X CGAA I 
3CAGAG CUGAUGAG X CGAA 
GUAGCAGA CUGAUGAG X CGAA 
AGUAGCAG CUGAUGAG X CGAA 



C CGAA IAUGACGC 
C CGAA IAGAUGAC 
" IUCAGAGA 



; CUGAUGAG 
J CUGAUGAG 



IAGGCCGU 
IGAGGCCG 
IGGAGGCC 
X CGAA IAGGGAGG 
X CGAA ICAGAGGG 



; X CGAA IUAGCAGA 
UUCAGGAU CUGAUGAG X CGAA IAGUAGCA 
UUUCAGGA CUGAUGAG X C GAA IGAGUAGC 
GGCUUUCA CUGAUGAG X CGAA IAUGGAGU 
UGGCUUUC CUGAUGAG X CGAA IGAUGGAG 
ICUUUCAG 
IGCUUUCA 

GACAUCCC CUGAUGAG X CGAA ICGUUCUU 
UGGCCCCC CUGAUGAG X CGAA ICGACAUC 
GCGCCCUU CUGAUGAG X CGAA ICCCCCAG 
GGCGCCCU CUGAUGAG X CGAA IGCCCCCA 
GGGCCGGC CUGAUGAG X CGAA ICGCCCUU 
ICGGCGCC 
ICCGGCGG 



CAGGAUGG CUGAUGAG 1 



GCGUUCUU CUGAUGAG X CGAA 
UGCGUUCU CUGAUGAG X CGAA 



AGAGGGCC CUGAUGAG X CGAA 
GGGCAGAG CUGAUGAG X CGAA 
AGGGCAGA CUGAUGAG X CGAA IGCCGGCG 
GAGGGCAG CUGAUGAG X CGAA IGGCCGGC 



CCTGCTGC A GGCGTACA 



AGGCGTAC A GGTTTCAC 



: A CGCATGTG 



TTTCACGC A TGTGTGCT 



ATGTGTGC T GCAGCTCC 



TGTGCTGC A GCTCCCAT 



GCTGCAGC T CCCATTTC 



TGCAGCTC C CATTTCAT 



GCAGCTCC C ATTTCATC 



CAGCTCCC A TTTCATCA 



CCCATTTC A TCAGCAAG 



ATTTCATC A GCAAGTTT 



TCATCAGC A AGTTTGGA 
GGAAGAAC C CCACATTT 



GAAGAACC C CACATTT' 



AAGAACCC C ACATTTTT 



AGAACCCC A CATTTTTC 



AACCCCAC A TTTTTCCT 
CATTTTTC C TGCGCGTC 
ATTTTTCC T GCGCGTCA 
TGCGCGTC A TCTCTGAC 
GCGTCATC T CTGACACG 
GTCATCTC T GACACGGC 
TCTCTGAC A CGGCCTCC 
GACACGGC C TCCCTCTG 



ACGGCCTC C CTCTGCTA 
CGGCCTCC C TCTGCTAC 
GGCCTCCC T CTGCTACT 
CCTCCCTC T GCTACTCC 
CCCTCTGC T ACTCCATC 
TCTGCTAC T CCATCCTG 
TGCTACTC C ATCCTGAA 
GCTACTCC A TCCTGAAA 
ACTCCATC C TGAAAGCC 
CTCCATCC T GAAAGCCA 
CTGAAAGC C AAGAACGC 
TGAAAGCC A AGAACGCA 
AAGAACGC A GGGATGTC 
GATGTCGC T GGGGGCCA 
CTGGGGGC C AAGGGCGC 
TGGGGGCC A AGGGCGCC 
AAGGGCGC C GCCGGCCC 
GGCGCCGC C GGCCCTCT 
CCGCCGGC C CTCTGCCC 
CGCCGGCC C TCTGCCCT 
GCCGGCCC T CTGCCCTC 
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3249 


CGGAGGGC CUGAUGAG X CGAA IAGGGCCG 




CGGCCCTC T GCCCTCCG 




3252 


CCUCGGAG CUGAUGAG X CGAA I CAGAGGG 




CCCTCTGC C CTCCGAGG 




3253 


GCCUCGGA CUGAUGAG X CGAA IGCAGAGG 




CCTCTGCC C TCCGAGGC 




3254 


GGCCUCGG CUGAUGAG X CGAA IGGCAGAG 




CTCTGCCC T CCGAGGCC 




3256 


ACGGCCUC CUGAUGAG X CGAA IAGGGCAG 




CTGCCCTC C GAGGCCGT 




3262 


CACUGCAC CUGAUGAG X CGAA ICCUCGGA 




TCCGAGGC C GTGCAGTG 




3267 


ACAGCCAC CUGAUGAG X CGAA ICACGGCC 




GGCCGTGC A GTGGCTGT 




3273 


GGUGGCAC CUGAUGAG X CGAA ICCACUGC 




GCAGTGGC T GTGCCACC 




3278 


UGCUUGGU CUGAUGAG X CGAA ICACAGCC 




GGCTGTGC C ACCAAGCA 






"AUGCUUGG CUGAUGAG X CGAA IGCACAGC 




GCTGTGCC A CCAAGCAT 




3281 


GAAUGCUU CUGAUGAG X CGAA IUGGCACA 




TGTGCCAC C AAGCATTC 




3282 


GGAAUGCU CUGAUGAG X CGAA IGUGGCAC 




GTGCCACC A AGCATTCC 




3286 


AGCAGGAA CUGAUGAG X CGAA ICUUGGUG 




CACCAAGC A TTCCTGCT 




3290 


CUUGAGCA CUGAUGAG X CGAA IAAUGCUU 




AAGCATTC C TGCTCAAG 




3291 


GCUUGAGC CUGAUGAG X CGAA IGAAUGCU 




AGCATTCC T GCTCAAGC 




3294 


UCAGCUUG CUGAUGAG X CGAA I CAGGAAU 




ATTCCTGC T CAAGCTGA 




3296 


AGUCAGCU CUGAUGAG X CGAA IAGCAGGA 




TCCTGCTC A AGCTGACT 




3300 


GUCGAGUC CUGAUGAG X CGAA ICUUGAGC 




GCTCAAGC T GACTCGAC 
AAGCTGAC T CGACACCG 




3304 
3309 


"' CGGUGUCG CUGAUGAG X CGAA IUCAGCUU 
" UGACACGG CUGAUGAG X CGAA IUCGAGUC 




GACTCGAC A CCGTGTCA 





3311 


GGUGACAC CUGAUGAG X CGAA IUGUCGAG 




CTCGACAC C GTGTCACC 




3317 


CACGUAGG CUGAUGAG X CGAA IACACGGU 




ACCGTGTC A CCTACGTG 





3319 


GGCACGUA CUGAUGAG X CGAA IUGACACG 




CGTGTCAC C TACGTGCC 




3320 


UGGCACGU CUGAUGAG X CGAA IGUGACAC 




GTGTCACC T ACGTGCCA 




3327 


CCAGGAGU CUGAUGAG X CGAA ICACGUAG 




CTACGTGC C ACTCCTGG 




3328 


CCCAGGAG CUGAUGAG X CGAA IGCACGUA 




TACGTGCC A CTCCTGGG 




3330 


ACCCCAGG CUGAUGAG X CGAA IUGGCACG 




CGTGCCAC T CCTGGGGT 




3332 


UGACCCCA CUGAUGAG X CGAA IAGUGGCA 




TGCCACTC C TGGGGTCA 




3333 


GUGACCCC CUGAUGAG X CGAA IGAGUGGC 




GCCACTCC T GGGGTCAC 




3340 


GUCCUGAG CUGAUGAG X CGAA IACCCCAG 




CTGGGGTC A CTCAGGAC 





3342 


CUGUCCUG CUGAUGAG X CGAA IUGACCCC 




GGGGTCAC T CAGGACAG 






GGCUGUCC CUGAUGAG X CGAA IAGUGACC 




GGTCACTC A GGACAGCC 




3349 


" GUCUGGGC CUGAUGAG X CGAA IUCCUGAG 




CTCAGGAC A GCCCAGAC 




3352 


UGCGUCUG CUGAUGAG X CGAA ICUGUCCU 




AGGACAGC C CAGACGCA 




3353 


CUGCGUCU CUGAUGAG X CGAA IGCUGUCC 




GGACAGCC C AGACGCAG 




3354 


GCUGCGUC CUGAUGAG X CGAA IGGCUGUC 




GACAGCCC A GACGCAGC 




3360 


GACUCAGC CUGAUGAG X CGAA ICGUCUGG 




CCAGACGC A GCTGAGT C 




3363 


UCCGACUC CUGAUGAG X CGAA ICUGCGUC 




GACGCAGC T GAGTCGGA 






UCCCCGGG CUGAUGAG X CGAA ICUUCCGA 




TCGGAAGC T CCCGGGGA 




3377 


" CGUCCCCG CUGAUGAG X CGAA IAGCUUCC 




GGAAGCTC C CGGGGACG 
GAAGCTCC C GGGGACGA 




3378 
3390 


UCGUCCCC CUGAUGAG X CGAA IGAGCUUC 
GGGCAGUC CUGAUGAG X CGAA ICGUCGUC 




GACGACGC T GACTGCCC 




3394 


UCCAGGGC CUGAUGAG X CGAA IUCAGCGU 




ACGCTGAC T GCCCTGGA 




3397 


GCCUCCAG CUGAUGAG X CGAA ICAGUCAG 




CTGACTGC C CTGGAGGC 




3398 


GGCCUCCA CUGAUGAG X CGAA IGCAGUCA 




TGACTGCC C TGGAGGCC 




3399 


' CGGCCUCC CUGAUGAG X CGAA IGGCAGUC 




GACTGCCC T GGAGGCCG 




3406 


" UUGGCUGC CUGAUGAG X CGAA ICCUCCAG 




CTGGAGGC C GCAGCCAA 
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3421 
3423 



GGGUUGGC CUGAUGAG X CGAA ICGGCCUC 
GCCGGGUU CUGAUGAG X CG AA ICUGCGGC 
UGCCGGGU CUGAUGAG X CGAA IGCUGCGG 
CAGUGCCG CUGAUGAG X CGAA IUUGGCUG 
GCAGUGCC CUGAUGAG X C GAA IGUUGGCU 
GAGGGCAG CUGAUS~AG~X~CGAA ICCGGGUU 
CUGAGGGC CUGAUGAG X CGA A IUGCCGGG 
AGUCUGAG CUGAUGAG X CGAA ICAGUGCC 



AAGUCUGA CUGAUGAG X CGAA IGCAGUGC 



GAAGUCUG CUGAUGAG X CGAA IGGCAGUG 



UUGAAGUC CUGAUGAG X CGAA IAGGGCAG 



GGUCUUGA CUGAUGAG X CGAA IUCUGAGG 



GAUGGUCU CUGAUGAG X CGAA IAAGUCUG 



UCCAGGAU CUGAUGAG X CGAA IUCUUGAA 



GUCCAGGA CUGAUGAG X CGAA IGUCUUGA 



UCAGUCCA CUGAUGAG X CGAA IAUGGUCU 



AUCAGUCC CUGAUGAG X CGAA IGAUGGUC 



UGGCCAUC CUGAUGAG X CGAA IUCCAGGA 



GGGCGGGU CUGAUGAG X CGAA ICCAUCAG 



UGGGCGGG CUGAUGAG X CGAA IGCCAUCA 



UGUGGGCG CUGAUGAG X CGAA IUGGCCAU 



CUGUGGGC CUGAUGAG X CGAA IGUGGCCA 



UGGCUGUG CUGAUGAG X CGAA ICGGGUGG 



CCUGGCUG 
" GGCCUGGC 



IGCGGGUG 
CUGAUGAG X CGAA IGGCGGGU 



CUGGCUGU CUGAUGAG X CGAA 3 



CUGAUGAG X CGAA IUGGGCGG 



CUCGGCCU CUGAUGAG X CGAA ICUGUGGG 



UCUCGGCC CUGAUGAG X CGAA IGCUGUGG 



CUGCUCUC CUGAUGAG X CGAA ICCUGGCU 



; CUGAUGAG X CGAA ICUCUCGG 
3 CUGAUGAG 



5 X CGAA IUCUGCUC 



GGGCUGCU CUGAUGAG X CGAA IUGUCUGC 



AGGGCUGC CUGAUGAG X CGAA IGUGUCUG 
CUGAUGAG X CGAA ICUGGUGU 



GACAGGGC 
CGUGACAG 



i X CGAA ICUGCUGG 



CUGAUGAG 

CUGAUGAG X CGAA IGCUGCUG 



GGCGUGAC CUGAUGAG X CGAA I 
GCCCGGCG CUGAUGAG X CGAA I 
X CGAA I 



.' CUGAUGAG 2 
! CUGAUGAG J 



X CGAA 

CCUCCCUG CUGAUGAG X CGAA 



UGGGACGU CUGAUGAG } 



ICCCGGCG 

"Tagcccgg 



CCCUCCCU CUGAUGAG X CGAA 
UCCCUCCC CUGAUGAG X CGAA 



IACGUAGA 
IGACGUAG 



UGGGUGUG CUGAUGAG X CGAA ICCGCCCC 



CUGGGUGU CUGAUGAG X CGAA IGCCGCCC 



CCUGGGUG CUGAUGAG X CGAA IGGCCGCC 



GAGGCCGC A GCCAACCC 
GCCGCAGC C AACCCGGC 
CCGCAGCC A ACCCGGCA 
CAGCCAAC C CGGCACTG 
AGCCAACC C GGCACTGC 
AACCCGGC A CTGCCCTC 
CCCGGCAC T GCCCTCAG 



GGCACTGC C CTCAGACT 



GCACTGCC C TCAGACTT 



CACTGCCC T CAGACTTC 



CTGCCCTC A GACTTCAA 



CCTCAGAC T TCAAGACC 
CAGACTTC A AGACCATC 



TTCAAGAC C ATCCTGGA 



TCAAGACC A TCCTGGAC 



AGACCATC C TGGACTGA 
GACCATCC T GGACTGAT 
TCCTGGAC T GATGGCCA 



CTGATGGC C ACCCGCCC 
TGATGGCC A CCCGCCCA 



ATGGCCAC C CGCCCACA 
TGGCCACC C GCCCACAG 



CCACCCGC C CACAGCCA 



CACCCGCC C ACAGCCAG 



ACCCGCCC A CAGCCAGG 



CCGCCCAC A GCCAGGCC 



CCCACAGC C AGGCCGAG 
CCACAGCC A 
AGCCAGGC C 



GGCCGAGA 
GAGAGCAG 
GACACCAG 



GAGCAGAC A CCAGCAGC 



GCAGACAC C AGCAGCCC 
CAGACACC A GCAGCCCT 



ACACCAGC A GCCCTGTC 



CCAGCAGC C CTGTCACG 



CAGCAGCC C TGTCACGC 



AGCAGCCC T GTCACGCC 



GCCCTGTC A CGCCGGGC 



TGTCACGC C GGGCTCTA 



CGCCGGGC T CTACGTCC 



CCGGGCTC T ACGTCCCA 



TCTACGTC C CAGGGAGG 



CTACGTCC C AGGGAGGG 



TACGTCCC A GGGAGGGA 



GGGGCGGC C CACACCCA 
GGGCGGCC C ACACCCAG 



GGCGGCCC A CACCCAGG 
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GGCCUGGG CUGAUGAG X C GAA IUGGGCCG 
CGGGCCUG CUGAUGAG X CGAA IUGUGGGC 
GCGGGCCU CUGAUGAG X C GAA I GUGUGGG 
UGCGGGCC CUGAUGAG X CGAA IGGUGUGG 



GCGGUGCG 
AGCGGUGC 



CUGAUGAG X CGAA ICCUGGGU 



CUGAUGAG X CGAA IGCCUGGG 



! CUGAUGAG X CGAA ICGGGCCU 



CUCCCAGC CUGAUGAG X CGAA IUGCGGGC 



AGACUCCC CUGAUGAG X CGAA ICGGUGCG 



CAGGCCUC CUGAUGAG X CGAA IACUCCCA 



CGGCCCAC A CCCAGGCC 
GCCCACAC C CAGGCCCG 
CCCACACC C AGGCCCGC 
CCACACCC A GGCCCGCA 



ACCCAGGC C CGCACCGC 



CCCAGGCC C GCACCGCT 



AGGCCCGC A CCGCTGGG 
GCCCGCAC C GCTGGGAG 



CGCACCGC T GGGAGTCT 



TGGGAGTC T GAGGCCTG 



3669 
3672 



CUCACUCA CUGAUGAG X CGAA ICCUCAGA 



TCTGAGGC C TGAGTGAG 



ACUCACUC CUGAUGAG X CGAA IGCCUCAG 



CTGAGGCC T GAGTGAGT 



CAGGCCUC CUGAUGAG X CGAA ICCAAACA 



GACAUGCA CUGAUGAG X CGAA ICCUCGGC 
IGCCUCGG 
ICAGGCCU 
IACAUGCA 



GGACAUGC CUGAUGAG X CGAA 1 



GCCGGACA CUGAUGAG X CGAA ] 



CUUCAGCC CUGAUGAG X CGAA I 



CAGCCUUC CUGAUGAG X CGAA ICCGGACA 



GGACACUC CUGAUGAG X CGAA ICCUUCAG 



CCUCAGCC CUGAUGAG X CGAA IACACUCA 



CAGGCCUC CUGAUGAG 



CUCGCUCA C 
ACUCGCUC C 



X CGAA 1 
X CGAA 1 



: CGAA IGCCUCAG 



3649 | CCUUGGCU CUGAUGAG X CGAA IACACUCG 

3650 I CCCUUGGC CUGAUGAG X CGAA IGACACUC 



CAGCCCUU CUGAUGAG X CGAA ICUGGACA 



UCAGCCCU CUGAUGAG X CGAA IGCUGGAC 



GGACACUC CUGAUGAG X CGAA ICCCUUGG 



GGUGUGCU CUGAUGAG X CGAA IACACUCA 



AGGUGUGC CUGAUGAG X CGAA IGACACUC 



GGCAGGUG CUGAUGAG X CGAA ICUGGACA 



ACGGCAGG CUGAUGAG X CGAA IUGCUGGA 



AGACGGCA CUGAUGAG X CGAA IUGUGCUG 



AAGACGGC CUGAUGAG X CGAA IGUGUGCU 
GUGAAGAC CUGAUGAG X CGAA ICAGGUGU 
GGAAGUGA CUGAUGAG X CGAA IACGGCAG 
IAAGACGG 
IUGAAGAC 
IAAGUGAA 
IGAAGUGA 



UGGGGAAG CUGAUGAG X CGAA 
UGUGGGGA CUGAUGAG X CGAA 
GCCUGUGG CUGAUGAG X CGAA 



J CUGAUGAG 
1 CUGAUGAG 



IGGAAGUG 
IGGGAAGU 

CUGAUGAG X CGAA IUGGGGAA 



CCAGCCUG CUGAUGAG X CGAA 1 



CGCCAGCC 

CGAGCGCC CUGAUGAG X CGAA ICCUGUGG 



UGGAGCCG CUGAUGAG 
UGGGGUGG CUGAUGAG 
CCUGGGGU CUGAUGAG X CGAA IAGCCGAG 



X CGAA 1 
X CGAA i 



GTTTGGC C GAGGCCTG 



GCCGAGGC C TGCATGTC 



CCGAGGCC T GCATGTCC 



AGGCCTGC.A TGTCCGGC 



TGCATGTC C GGCTGAAG 



TGTCCGGC T GAAGGCTG 



CTGAAGGC T GAGTGTCC 



TGAGTGTC C GGCTGAGG 



TGTCCGGC T GAGGCCTG 



GCTGAGGC C TGAGCGAG 



CTGAGGCC T GAGCGAGT 



CGAGTGTC C AGCCAAGG 



GAGTGTCC A GCCAAGGG 



TGTCCAGC C AAGGGCTG 



GTCCAG.CC A AGGGCTGA 



CCAAGGGC T GAGTGTCC 



TGAGTGTC C AGCACACC 



GAGTGTCC A GCACACCT 



TGTCCAGC A CACCTGCC 



TCCAGCAC A CCTGCCGT 



CAGCACAC C TGCCGTCT 



AGCACACC T GCCGTCTT 
ACACCTGC C GTCTTCAC 
CTGCCGTC T TCACTTCC 
CCGTCTTC A CTTCCCCA 
GTCTTCAC T TCCCCACA 
TTCACTTC C CCACAGGC 
TCACTTCC C CACAGGCT 
CACTTCCC C ACAGGCTG 
ACTTCCCC A CAGGCTGG 
TTCCCCAC A GGCTGGCG 
CCACAGGC T GGCGCTCG 
GCTGGCGC T CGGCTCCA 
CGCTCGGC T CCACCCCA 
CTCGGCTC C ACCCCAGG 
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CCCUGGGG CUGAUGAG X CGAA IGAGCCGA 
"GGCCCUGG CUGAUGAG X CGAA IUGGAGCC 



UGGCCCUG CUGAUGAG X CGAA IGUGGAGC 



"CUGGCCCU CUGAUGAG X CGAA IGGUGGAG~ 



GCUGGCCC CUGAUGAG X CGAA IGGGUGGA 



'CAAAAGCU CUGAUGAG X CGAA ICCCUGGC" 



GGAAAAGC CUGAU GAG X CGAA IGCCCUGG 
UGAGGAAA CUGAUGAG X CGAA ICUGGCCC 



CCUGGUGA CUGAUGAG X CGAA IAAAAGCU 



UCCUGGUG CUGAUGAG X CGAA IGAAAAGC 



"GCUCCUGG CUGAUGAG X CGAA IAGGAAAA 



3755 
3757 



"GGGCUCCU CUGAUGAG X CGAA FUGAGGAA 



CGGGCUCC CUGAUGAG X CGAA IGUGAGGA 



GGAAGCCG CUGAUGAG X CGAA ICUCCUGG 



UGGAAGCC CUGAUGAG X CGAA IGCUCCUG 



GGAGUGGA CUGAUGAG X CGAA ICCGGGCU" 



UGGGGAGU CUGAUGAG X CGAA IAAGCCGG 



GUGGGGAG CUGAUGAG X CGAA IGAAGCCG 



AUGUGGGG CUGAUG AG X CGAA I UGGAAGC 
CUAUGUGG CUGAUGAG X CGAA IAGUGGAA 



CCUAUGUG CUGAUGAG X CGAA IGAGUGGA 



UCCUAUGU CUGAUGAG X CGAA IGGAGUGG 



UUCCUAUG CUGAUGAG X CGAA IGGGAGUG 



UAUUCCUA CUGAUGAG X CGAA IUGGGGAG 



CUGGGGAU CUGAUGAG X CGAA IACUAUUC 



UCUGGGGA CUGAUGAG X CGAA IGACUAUU 



GAAUCUGG CUGAUGAG X CGAA IAUGGACU 



CGAAUCUG" CUGAUGAG X CGAA IGAUGGAC 



GCGAAUCU CU GAUGAG X CGAA IGGAUGGA 
GGCGAAUC CUGAUGAG X CGAA IGGGAUGc" 



UGAACAAU CUGA UGAG X CGAA ICGAAUCU 
GUGAACAA CUGAUGAG X CGAA IGCGAAUC 



GCGAGGGG' CUGAUGAG X CGAA IAACAAUG 



GGGCGAGG CUGAUGAG X CGAA IUGAACAA 



AGGGCGAG CUGAUGAG X CGAA IGUGAACA 



CAGGGCGA CUGAUGAG X CGAA IGGUGAAC^ 



GCAGGGCG CUGAUGAG X CGAA IGGGUGAA 



GAGGGCAG CUGAUGAG X CGAA ICGAGGGG 



GGAGGGCA CUGAUGAG X CGAA I GCGAGGG 



AGGAGGGC CUGAUGAG X CGAA IGGCGAGG 



"CAAAGGAG CUGAUGAG X CGAA ICAGGGCG" 



"GCAAAGGA CUGAUGAG X CGAA IGCAGGGC" 



GGCAAAGG CUGAUGAG X CGAA IGGCAGGG 



"aAGGCAAA CUGAUGAG X CGAA IAGGGCAC" 



GAAGGCAA CUGAUGAG X CGAA IGAGGGCA 



"GGGUGGAA CUGAUGAG X CGAA I CAAAGGA 



GGGGUGGA CUGAUGAG X CGAA IGCAAAGG 



TCGGCTCC A CCCCAGGG 



GGCTCCAC C CCAGGGCC 



GCTCCACC C CAGGGCCA 



CTCCACCC C AGGGCCAG 



TCCACCCC A GGGCCAGC 



" CCCAGGGC C AGCTTTTC 



CCAGGGCC A GCTTTTCC 



GGGCCAGC T TTTCCTCA 



AGCTTTTC C TCACCAGG 



GCTTTTCC T CACCAGGA 



TTTTCCTC A CCAGGAGC 



TTCCTCAC C AGGAGCCC 



TCCTCACC A GGAGCCCG 



CCAGGAGC C CGGCTTCC 



CAGGAGCC C GGCTTCCA 



-AGCCCGGC T TCCACTCC 



CCGGCTTC C ACTCCCCA 



CGGCTTCC A CTCCCCAC 



GCTTCCAC T CCCCACAT 



TTCCACTC C CCACATAG 



TCCACTCC C CACATAGG 



CCACTCCC C ACATAGGA 



CACTCCCC A CATAGGAA 



CTCCCCAC A TAGGAATA . 



GAATAGTC C ATCCCCAG 



AATAGTCC A TCCCCAGA 



AGTCCATC C CCAGATTC 



" GTCCATCC C CAGATTCG ~~ 



TCCATCCC C AGATTCGC 



CCATCCCC A GATTCGCC 



AGATTCGC C ATTGTTCA ' 



GATTCGCC A TTGTTCAC 



CATTGTTC A CCCCTCGC 



TTGTTCAC C CCTCGCCC 



TGTTCACC C CTCGCCCT 



GTTCACCC C TCGCCCTG 



TTCACCCC T CGCCCTGC 



CCCCTCGC C CTGCCCTC 



CCCTCGCC C TGCCCTCC 



- CCTCGCCC T GCCCTCCT ~ 



CGCCCTGC C CTCCTTTG 



GCCCTGCC C TCCTTTGC 



CCCTGCCC T CCTTTGCC 



CTGCCCTC C TTTGCCTT 



TGCCCTCC T TTGCCTTC 



TCCTTTGC C TTCCACCC 



?TTGCC T TCCACCCC 
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GUGGGGGU CUGAUGAG X CGAA IAAGGCAA 



GGUGGGGG CUGAUGAG X CGAA IGAAGGCA 



AUGGUGGG CUGAUGAG X CGAA IUGGAAGG 



TTGCCTTC C ACCCCCAC 



TGCCTTCC A CCCCCACC 



CCTTCCAC C CCCACCAT 



GAUGGUGG CUGAUGAG X CGAA IGUGGAAG 



GGAUGGUG CUGAUGAG X CGAA IGGUGGAA 



UGGAUGGU CUGAUGAG X CGAA IGGGUGGA 



CUGGAUGG CUGAUGAG X CGAA IGGGGUGG 



CTTCCACC C CCACCATC 



TTCCACCC C CACCATCC 



TCCACCCC C ACCATCCA 



CCACCCCC A CCATCCAG 



ACCUGGAU CUGAUGAG X CGAA IUGGGGGU 



CACCUGGA CUGAUGAG X CGAA IGUGGGGG 



ACCCCCAC C ATCCAGGT 



CCCCCACC A TCCAGGTG 



CUCCACCU CUGAUGAG X CGAA IAUGGUGG 



UCUCCACC CUGAUGAG X CGAA IGAUGGUG 



CUUCUCAG CUGAUGAG X CGAA IUCUCCAC 



CCUUCUCA CUGAUGAG X CGAA IGUCUCCA 



UCCUUCUC CUGAUGAG X CGAA IGGUCUCC 



GCUCCCAG CUGAUGAG X CGAA IUCCUUCU 



CCACCATC C AGGTGGAG 



CACCATCC A GGTGGAGA 



GTG5AGAC C CTGAGAAG 



TGGAGACC C TGAGAAGG 



GGAGACCC T GAGAAGGA 



AGAAGGAC C CTGGGAGC 



IGUCCUUC 

GAGCUCCC CUGAUGAG X CGAA IGGUCCUU 



AGCUCCCA CUGAUGAG X CGAA ] 



AUUCCCAG CUGAUGAG X CGAA ICUCCCAG 



AAAUUCCC CUGAUGAG X CGAA IAGCUCCC 



ACACCUUU CUGAUGAG X CGAA IUCACUCC 



CACACCUU CUGAUGAG X CGAA IGUCACUC 



GUGUACAG CUGAUGAG X CGAA ICACACCU 



UGUGUACA CUGAUGAG X CGAA ] 
CUGUGUAC CUGAUGAG X CGAA ! 



CUCGCCUG CUGAUGAG X CGAA I UACAGGG 



UCCUCGCC CUGAUGAG X CGAA IUGUACAG 



AGGUGCAG CUGAUGAG X CGAA IUCCUCGC 



CAGGUGCA CUGAUGAG X CGAA IGUCCUCG 



CCAGGUGC CUGAUGAG X CGAA IGGUCCUC 



CAUCCAGG CUGAUGAG X CGAA ICAGGGUC 



CCCAUCCA CUGAUGAG 



: CGAA IUGCAGGG 



CCCCAUCC CUGAUGAG X CGAA IGUGCAGG 



ACCCACAG CUGAUGAG X CGAA IACCCCCA 



GACCCACA CUGAUGAG X CGAA IGACCCCC 



UGACCCAC CUGAUGAG X CGAA IGGACCCC 



GAAGGACC C TGGGAGCT 



AAGGACCC T GGGAGCTC 



CTGGGAGC T CTGGGAAT 



GGGAGCTC T GGGAATTT 



GGAGTGAC C AAAGGTGT 
GAGTGACC A AAGGTGTG 



AGGTGTGC C ( 
GGTGTGCC C 1 



GTGTGCCC T GTACACAG 



CCCTGTAC A CAGGCGAG 



CTGTACAC A GGCGAGGA 



GCGAGGAC C CTGCACCT 



CGAGGACC C TGCACCTG 



GAGGACCC T GCACCTGG 



GACCCTGC A CCTGGATG 



CCCTGCAC C TGGATGGG 



CCTGCACC T GGATGGGG 



TGGGGGTC C CTGTGGGT 



GGGGGTCC C TGTGGGTC 



GGGGTCCC T GTGGGTCA 



CCCCAAUU CUGAUGAG X CGAA IACCCACA 



ACUCCCAC CUGAUGAG X CGAA ICACCUCC 



AUAUAUUC CUGAUGAG X CGAA IUAUUUUA 



UUCAAAAC CUGAUGAG X CGAA IAAAAACU 



TGTGGGTC A AATTGGGG 



GGAGGTGC T GTGGGAGT 



TAAAATAC T GAATATAT 



AGTTTTTC A GTTTTGAA 



Stem Length = 8 . Core Sequence = CUGAUGAG X CGAA (X = GCCGUUAGGC or other stem II 
sequence and length (greater than or equal to 2 base-pairs)). I = Inosine nucleotide 
Seql = TERT (Homo sapiens telomerase reverse transcriptase (TERT) mRNA, 4015 bp); Nakamura el 
a!.. Science 277 (5328), 955-959 (1997) 
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v a < u c 


CGCGG UGAUGGCAUGCACUAUGCGCG AUCGCGGGGG 
GCGCG UGAUGGCAUGCACUAUGCGCG GGCAUCGCGG 
' GAGCG UGAUGGCAUGCACUAUGCGCG GCGGCAUCGC 
GGGAG UGAUGGCAUGCACUAUGCGCG GCGCGGCAUC 
GGCAG UGAUGGCAUGCACUAUGCGCG GGGGAGCGCG 


CUCGG UGAUGGCAUGCACUAUGCGCG AGCUUUUAUU 
" CGGCU UGAUGGCAUGCACUAUGCGCG GGCAGCGGGG 


GAGCG UGAUGGCAUGCACUAUGCGCG ACGUCULbOL 
' GGGAG UGAUGGCAUGCACUAUGCGCG GCACGGCUCG 
' CGCAG UGAUGGCAUGCACUAUGCGCG AGGGAGCGCA 
CUGCG UGAUGGCAUGCACUAUGCGCG AGCAGGGAGC 


GGCUG UGAUGGCAUGCACUAUGCGCG GCAGCAGUUA 
CCUCG UGAUGGCAUGCACUAUGCGCG GGUAGUGGCU 
CACCU UGAUGGCAUGCACUAUGCGCG GCGGUAGUGG 


1 GGCAG UGAUGGCAUGCACUAUGCGCG ACCUCGCGGU 


1 AGCGG UGAUGGCAUGCACUAUGCGCG AUCACLULbL 


GCCAG UGAUGGCAUGCACUAUGCGCG GGCAGCACVU 
CGCCG UGAUGGCAUGCACUAUGCGCG ACGAACGUGG 


CCAGG UGAUGGCAUGCACUAUGCGCG GCCGCACGAA 
CGCUG UGAUGGCAUGCACUAUGCGCG ACCAGCCGCC 


6 8 

II 

8 8 
3 8 

< s 
B B 
6 t 
B £ 

o c 
g 2 

ii 


1 CAGCG UGAUGGCAUGCACUAUGCGCG GCGGAAAGCC 


Seq ID 

Nos 
























Substrate Sequence 

GCUGCGUCCU G CUGCG 
GCGUCCUGCU G CGCAC 
GUCCUGCUGC G CACGU 
GGCCACCCCC G CGAUG 


i 8 B B B 8 
\ 8 8 8 B 8 

3 O C5 O O O 

3 1 8 8 8 8 
B 8 1 B 8 3 

1 1 1 1 1 i 


GCUCCCCGCU G CCGAG 
CCCCGCUGCC G AGCCG 


GCCGAGCCGU G CGCUC 
CGAGCCGUGC G CUCCC 
UGCGCUCCCU G CUGCG 
GCUCCCUGCU G CGCAG 


UCCCUGCUGC G CAGCC 
AGCCACUACC G CGAGG 
CCACUACCGC G AGGUG 


ACCGCGAGGU G CUGCC 


GCGAGGUGCU G CCGCU 


AGGUGCUGCC G CUGGC 
CCACGUUCGU G CGGCG 


UUCGUGCGGC G CCUGG 
GGCGGCUGGU G CAGCG 


CUGGUGCAGC G CGGGG 


3 O 

3 8 

5 B 

J CD 


nt. 
Position 

16 
19 
21 
53 
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" AGAGG UGAUGGCAUGCACUAUGCGCG GCCCAGGAGG 
CAGCA UGAUGGCAUGCACUAUGCGCG AGAGGCGCCC 
CCCAG UGAUGGCAUGCACUAUGCGCG ACAGAGGCGC 
GAUAU UGAUGGCAUGCACUAUGCGCG GUCCAGGCCC 
AC-GUG UGAUGGCAUGCACUAUGCGCG GCCAGGCCCU 
CGCAG UGAUGGCAUGCACUAUGCGCG ACGAAGGUGC 
ACACG UGAUCGCAUGCACUAUGCGCG AGCACGAAGG 
CCGCA UGAUGGCAUGCACUAUGCGCG ACGCAGCACG 
GCCCG UGAUGGCAUGCACUAUGCGCG ACACGCAGCA 
GGCGG UGAUGGCAUGCACUAUGCGCG GGGUCCUGGG 
UCAGG UGAUGGCAUGCACUAUGCGCG GGCGGGUCCU 
CAGCU UGAUGGCAUGCACUAUGCGCG AGGCGGCGGG 
AAGUA UGAUGGCAUGCACUAUGCGCG AGCUCAGGCG 
CUUGA UGAUGGCAUGCACUAUGCGCG AAAGUACAGC 
CGUCA UGAUGGCAUGCACUAUGCGCG AUCCACCUUG 
CCCGU UGAUGGCAUGCACUAUGCGCG ACAUCCACCU 
GUACG UGAUGGCAUGCACUAUGCGCG GCCCGUCACA 
GGUGU UGAUGGCAUGCACUAUGCGCG GUACGCGCCC 
GCUGG UGAUGGCAUGCACUAUGCGCG GAUGACCUCC 


GCACG UGAUGGCAUGCACUAUGCGCG AGUACGUGUU 
CGACG UGAUGGCAUGCACUAUGCGCG ACGCAGUACG 
CACGG UGAUGGCAUGCACUAUGCGCG AUACCGACGC 
AUGGG UGAUGGCAUGCACUAUGCGCG GGCCUUCUGG 
CCUUG UGAUGGCAUGCACUAUGCGCG GGACGUGCCC 
UCUGU UGAUGGCAUGCACUAUGCGCG AAGGUAGAGA 


UGUCG UGAUGGCAUGCACUAUGCGCG AUGUACGGCU 
ACUGU UGAUGGCAUGCACUAUGCGCG GCAUGUACGG 
UCCUG UGAUGGCAUGCACUAUGCGCG AGGUGAGCCA 
CUCAG UGAUGGCAUGCACUAUGCGCG GGGCUGGUCU 


| UCCCU UGAUGGCAUGCACUAUGCGCG AGCGGGCUGG 


1 GACGG UGAUGGCAUGCACUAUGCGCG AUCCCUCAGC 












CCUCCUGGGC G CCUCU 
GGGCGCCUCU G UGCUG 
GCGCCUCUGU G CUGGG 
GGGCCUGGAC G AUAUC 
AGGGCCUGGC G CACCU 
GCACCUUCGU G CUGCG 
CCUUCGUGCU G CGUGU 
CGUGCUGCGU G UGCGG 
UGCUGCGUGU G CGGGC 
CCCAGGACCC G CCGCC 
AGGACCCGCC G CCUGA 
CCCGCCGCCU G AGCUG 
CGCCUGAGCU G UACUU 
GCUGUACUUU G UCAAG 
CAAGGUGGAU G UGACG 
AGGUGGAUGU G ACGGG 
UGUGACGGGC G CGUAC 
GGGCGCGUAC G ACACC 
GGAGGUCAUC G CCAGC 


AACACGUACU G CGUGC 
CGUACUGCGU G CGUCG ■ 
GCGUCGGUAU G CCGUG 
CCAGAAGGCC G CCCAU 
GGGCACGUCC G CAAGG 
UCUCUACCUU G ACAGA 


1 AGCCGUACAU G CGACA 
CCGUACAUGC G ACAGU 
UGGCUCACCU G CAGGA 
AGACCAGCCC G CUGAG 


CCAGCCCGCU G AGGGA 


| GCUGAGGGAU G CCGUC- 
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CUGCU UGAUGGCAUGCACUAUGCGCG GAUGACGACG 1 


UCAUU UGAUGGCAUGCACUAUGCGCG AGGGAUGAGC 


GGCCU UGAUGGCAUGCACUAUGCGCG AUUCAGUbAG 


GACGU UGAUGGCAUGCACUAUGCGCG GAAGAGULCA 
UGAAG UGAUGGCAUGCACUAUGCGCG GUAGGAAGAC 
UGGCA UGAUGGCAUGCACUAUGCGCG AUGAAGSGUA 
GGUGG UGAUGGCAUGCACUAUGCGCG ACAUGAAGCG 


CACGG UGAUGGCAUGCACUAUGCGCG UUUUUliliCAi. 
AUGCG UGAUGGCAUGCACUAUGCGCG ACGGCGUGGU 
UGAUG UGAUGGCAUGCACUAUGCGCG GCACGGCGUG 
CCUGG UGAUGGCAUGCACUAUGCGCG ACUGGACGUA 


CCCUG UGAUGGCAUGCACUAUGCGCG GGGAUCCCCU 


AGCAG UGAUGGCAUGCACUAUGCGCG bUUGAb/u^A 


< P 

< 8 
5 < 

3 < 

8 8 
8 8 

B c 

8 2 

1 1 

u c 
11 

1 t 


UAGCA UGAUGGCAUGCACUAUGCGCG AGGCUGCAGA 


j CGUAG UGAUGGCAUGCACUAUGCGCG ACAGGCUGCA 


| CAUGU UGAUGGCAUGCACUAUGCGCG uccuuauual 


GCAAA UGAUGGCAUGCACUAUGCGCG AUCUUUUUUU 


| CCCCG UGAUGGCAUGCACUAUGCGCG AAACrtW-UUl, 


AGGAG UGAUGGCAUGCACUAUGCGCG AGCCCGUCCC 
AAACG UGAUGGCAUGCACUAUGCGCG AGGAGCAGCC 


GAAAU UGAUGGCAUGCACUAUGCGCG AUCCACCAAA 
ACCAA UGAUGGCAUGCACUAUGCGCG AAGAAAUCAU 


| GGUGU UGAUGGCAUGCACUAUGCGCG ACCAACAAGA 
imnrr, UGAUGCCAUGCACUAUGCGCG GUGGGUGAGG 


GUUUU UGAUGGCAUGCACUAUGCGCG GCGUGGGUGA 
rarrn UGAUGGCAUGCACUAUGCGCG GGACCAGGGU 


j AGGGA UGAUGGCAUGCACUAUGCGCG ACCUCGGACC 


| AUACU UGAUGGCAUGCACUAUGCGCG AGOGAcA<_lU 


| CCACG UGAUGGCAUGCACUAUGCGCG AGCCAUACUC 










































CGUCGUCAUC G AGCAG F 


GCUCCUCCCU G AAUGA 


CUCCCUGAAU G AGGCC 


UGGCCUCUUC G ACGUC 
GUCUUCCUAC G CUUCA 
UACGCUUCAU G UGCCA 
CGCUUCAUGU G CCACC 


GUGCCACCAC G CCGUG 
ACCACGCCGU G CGCAU 
CACGCCGUGC G CAUCA 
UACGUCCAGU G CCAGG 


AGGGGAUCCC G CAGGG 


UCCUCUCCAC G CUGCU 


UCUCCACGCU G CUCUG 


J < 

i 8 

J D 

3 B 
\ \ 

\ I 
< p 


UGCAGCCUGU G CUACG 


i GUGCUACGGC G ACAUG 


AGAACAAGCU G UUUGC 


CAAGCUGUUU G CGGGG 


GGGACGGGCU G CUCCU 
GGCUGCUCCU G CGUUU 


UUUGGUGGAU G A'JUUC 
AUGAUUUCUU G UUGGU 


UCUUGUUGGU G ACACC 


ii! 

5 O 

5 8 
j 6 

< u 

i % 
i B 


GGUCCGAGGU G UCCCU 


AGGUGUCCCU G AGUAU 


| GAGUAUGGCU G CGUGG 
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ACGCG UGAUGGCAUGCACUAUGCGCG AGGAAAAAUG 1 


UGACG UGAUGGCAUGCACUAUGCGCG GCAGGAAAAA 1 


J C 

< e 

1 1 

< < 

8 8 
p > 
'■\ £ 
§ s 
6 g 
1 1 
8 E 
gg 
3 £ 


GCUUU UGAUGGCAUGCACUAUGCGCG AGGAUGGAGU 


CCCUG UGAUGGCAUGCACUAUGCGCG GUUCUUGGCU j 


AGCGA UGAUGGCAUGCACUAUGCGCG AUCCCUGCUU j 


CCCAG UGAUGGCAUGCACUAUGCGCG GACAUCCCUG 


y § 

l 8 

y § 

3 L 
= I 

H ? 

1 I 
8 & 

B r 
CP c 
0 C 


GAGGG UGAUGGCAUGCACUAUGCGCG AGAGGGCCGG 


GGCCU UGAUGGCAUGCACUAUGCGCG GGAGGGCAGA | 


CACUG UGAUGGCAUGCACUAUGCGCG ACGGCCUCGG 


UGGCA UGAUGGCAUGCACUAUGCGCG AGCCACUGCA | 


H 

y k 
< < 

& 5 

3 c 
B ^ 

0 c 

1 1 

B r 

8 ^ 

1 i 


j CGAGU UGAUGGCAUGCACUAUGCGCG AGCUUGAGCA j 


| . GGUGU UGAUGGCAUGCACUAUGCGCG GAGUCAGCUU 


| GGUGA UGAUGGCAUGCACUAUGCGCG ACGGUGUCGA 


| AGUGG UGAUGGCAUGCACUAUGCGCG ACGUAGGUGA 


| AGCUG UGAUGGCAUGCACUAUGCGCG GUCUGGGCUG 


| CGACU UGAUGGCAUGCACUAUGCGCG AGCUGCGUCU 


| AGCGU UGAUGGCAUGCACUAUGCGCG GUCCCCUUUA 


| GUCAG UGAUGGCAUGCACUAUGCGCG GUCGUCCCCG 


| GCAGU UGAUGGCAUGCACUAUGCGCG AGCGUCGUCC 


| CAGGG UGAUGGCAUGCACUAUGCGCG AGUCAGCGUC 


GGCUG UGAUGGCAUGCACUAUGCGCG GGCCUCCAGG 


| GAGGG UGAUGGCAUGCACUAUGCGCG AGUGCCGGGU 


| GCCAU UGAUGGCAUGCACUAUGCGCG AGUCCAGGAU 


| GUGGG UGAUGGCAUGCACUAUGCGCG GGGUGGCCAU 


| GCUCU UGAUGGCAUGCACUAUGCGCG GGCCUGGCUG 


























































CAUUUUUCCU G CGCGU 1 


UUUUUCCUGC G CGUCA 


CGUCAUCUCU G ACACG 


ACUCCAUCCU G AAAGC 


AGCCAAGAAC G CAGGG 


ACGCAGGGAU G UCGCU 


CAGGGAUGUC G CUGGG 


GGCCAAGGGC G CCGCC 


CCGGCCCUCU G CCCUC 


UCUGCCCUCC G AGGCC 


CCGAGGCCGU G CAGUG 


UGCAGUGGCU G UGCCA 


CAGUGGCUGU G CCACC 


UGCUCAAGCU G ACUCG ! 


AAGCUGACUC G ACACC 


1 UCGACACCGU G UCACC 


UCACCUACGU G CCACU 


CAGCCCAGAC G CAGCU 


AGACGCAGCU G AGUCG 


UCCCGGGGAC G ACGCU 


CGGGGACGAC G CUGAC 


GGACGACGCU G ACUGC 


GACGCUGACU G CCCUG 


CCUGGAGGCC G CAGCC 


ACCCGGCACU G CCCUC 


AUCCUGGACU G AUGGC 


AUGGCCACCC G CCCAC 


j CAGCCAGGCC G AGAGC 
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GUGUA UGAUGGCAUGCACUAUGCGCG AGGGCACACC 
GUCCU UGAUGGCAUGCACUAUGCGCG GCCUGUGUAC 


AGGUG UGAUGGCAUGCACUAUGCGCG AGGGUCCUCG I 


ACCCA UGAUGGCAUGCACUAUGCGCG AGGGACCJCUC | 


CACAG UGAUGGCAUGCACUAUGCGCG ACCUCCCCUU I 


i i 

J g 

6 | 

* i 
1 I 

6 t 
| s 
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1 !! 
§ 1 


AAACU UGAUGGCAUGCACUAUGCGCG AUAUAUUCAG 


| UUUUU UGAUGGCAUGCACUAUGCGCG AAAACUGAAA 
















GGUGUGCCCU G UACAC 
GUACACAGGC G AGGAC 


CGAGGACCCU G CACCU 


GGGGGUCCCU G UGGGU 


, GGGGGGAGGU G CUGUG | 


1 GGGAGGUGCU G UGGGA 


CUGAAUAUAU G AGUUU 


| UUUCAGUUUU G AAAAA 
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Seq. ID Nos 






















































Substrate 


GCAGCGCT G GCGTCCTG 


AGCGCTGC G GTCCTGCT 


TGCGTCCT G GCTGCGCA 


GTCCTGCT G GCGCACGT 


CCTGCTGC G GCACGTGG 


TGCTGCGC A ACGTGGGA 


CTGCGCAC G GTGGGAAG 


CGTGGGAA G GCCCTGGC 


AAGCCCTG G GCCCCGGC 


TGGCCCCG G GCCACCCC 


CCCCGGCC A ACCCCCGC I 


CCACCCCC G GCGATGCC 1 


CCCCCGCG A ATGCCGCG 


j CCCGCGAT G GCCGCGCG j 


GCGATGCC G GCGCGCTC 


j GATGCCGC G GCGCTCCC 


TGCCGCGC G GCTCCCCG 


j TCCCCGCT G GCCGAGCC 


| GCTGCCGA G GCCGTGCG 


1 GCCGAGCC G GTGCGCTC 


| CGAGCCGT G GCGCTCCC 


AGCCGTGC G GCTCCCTG 


< 

J 
H 

H 
S 


| TCCCTGCT G GCGCAGCC 


| CCTGCTGC G GCAGCCAC 


| GCTGCGCA G GCCACTAC 


Seq. ID 
Nos 






















































DNAzyme Sequence 


CAGGACGC GGCTAGCTACAACGA AGCGCTGC 1 


AGCAGGAC GGCTAGCTACAACGA GCAGCGCT; 1 


TGCGCAGC GGCTAGCTACAACGA AGGACGCA 1 


ACGTGCGC GGCTAGCTACAACGA AGCAGGAC 1 


CCACGTGC GGCTAGCTACAACGA GCAGCAGG 1 


TCCCACGT GGCTAGCTACAACGA GCGCAGCA | 


CTTCCCAC GGCTAGCTACAACGA GTGCGCAG 1 


GCCAGGGC GGCTAGCTACAACGA TTCCCACG | 


GCCGGGGC GGCTAGCTACAACGA CAGGGCTT 1 


GGGGTGGC GGCTAGCTACAACGA CGGGGCCA 1 


GCGGGGGT GGCTAGCTACAACGA GGCCGGGG I 


GGCATCGC GGCTAGCTACAACGA GGGGGTGG J 


| CGCGGCAT GGCTAGCTACAACGA CGCGGGGG 1 


CGCGCGGC GGCTAGCTACAACGA ATCGCGGG ! 


GAGCGCGC GGCTAGCTACAACGA GGCATCGC 


GGGAGCGC GGCTAGCTACAACGA GCGGCATC 


CGGGGAGC GGCTAGCTACAACGA GCGCGGCA 


j GGCTCGGC GGCTAGCTACAACGA AGCGGGGA 


CGCACGGC GGCTAGCTACAACGA TCGGCAGC 


GAGCGCAC GGCTAGCTACAACGA GGCTCGGC 


GGGAGCGC GGCTAGCTACAACGA ACGGCTCG 


j CAGGGAGC GGCTAGCTACAACGA GCACGGCT 


| TGCGCAGC GGCTAGCTACAACGA AGGGAGCG 


GGCTGCGC GGCTAGCTACAACGA AGCAGGGA 


GTGGCTGC GGCTAGCTACAACGA GCAGCAGG 


j GTAGTGGC GGCTAGCTACAACGA TGCGCAGC 


nt. 
Position 
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GGCCCAGT G GCCTGGTG 1 


AGTGCCTG G GTGTGCGT 


TGCCTGGT G GTGCGTGC I 


CCTGGTGT G GCGTGCCC 


TGGTGTGC G GTGCCCTG 


GTGTGCGT G GCCCTGGG 


GCCCTGGG A ACGCACGG 


CCTGGGAC G GCACGGCC 


TGGGACGC A ACGGCCGC 


GACGCACG G GCCGCCCC 


O 

1 

a 


CGCCCCCC G GCCGCCCC 


CCCCCGCC G GCCCCCTC 1 


CTCCTTCC G GCCAGGTG 


TCCGCCAG G GTGTCCTG 


CGCCAGGT G GTCCTGCC 1 


GGTGTCCT G GCCTGAAG 


CTGAAGGA G GCTGGTGG 


AGGAGCTG G GTGGCCCG 


AGCTGGTG G GCCCGAGT 


I 
I 

< 

8 
S 

H 


O 
CP 

3 

u 


j CGAGTGCT G GCAGAGGC 


| CTGCAGAG G GCTGTGCG 


| CAGAGGCT G GTGCGAGC 


j GAGGCTGT G GCGAGCGC 


1 CTGTGCGA G GCGCGGCG 


< % 

8 c 

0 c 

8 b 

1 I 


j CGCGAAGA A ACGTGCTG 




























































CACCAGGC GGCTAGCTACAACGA ACTGGGCC | 


ACGCACAC GGCTAGCTACAACGA CAGGCACT 


GCACGCAC GGCTAGCTACAACGA ACCAGGCA | 


GGGCACGC GGCTAGCTACAACGA ACACCAGG 1 


CAGGGCAC GGCTAGCTACAACGA GCACACCA | 


CCCAGGGC GGCTAGCTACAACGA ACGCACAC | 


CCGTGCGT GGCTAGCTACAACGA CCCAGGGC 1 


GGCCGTGC GGCTAGCTACAACGA GTCCCAGG 1 


GCGGCCGT GGCTAGCTACAACGA GCGTCCCA 1 


GGGGCGGC GGCTAGCTACAACGA CGTGCGTC 


CGGGGGGC GGCTAGCTACAACGA GGCCGTGC | 


GGGGCGGC GGCTAGCTACAACGA GGGGGGCG 1 


GAGGGGGC GGCTAGCTACAACGA GGCGGGGG 


CACCTGGC GGCTAGCTACAACGA GGAAGGAG | 


CAGGACAC GGCTAGCTACAACGA CTGGCGGA | 


GGCAGGAC GGCTAGCTACAACGA ACCTGGCG | 


CTTCAGGC GGCTAGCTACAACGA AGGACACC I 


CCACCAGC GGCTAGCTACAACGA TCCTTCAG 


| CGGGCCAC GGCTAGCTACAACGA CAGCTCCT 


j ACTCGGGC GGCTAGCTACAACGA CACCAGCT 


1 TGCAGCAC GGCTAGCTACAACGA TCGGGCCA 


TCTGCAGC GGCTAGCTACAACGA ACTCGGGC 


GCCTCTGC GGCTAGCTACAACGA AGCACTCG 


CGCACAGC GGCTAGCTACAACGA CTCTGCAG 


| GCTCGCAC GGCTAGCTACAACGA AGCCTCTG 


GCGCTCGC GGCTAGCTACAACGA ACAGCCTC 


CGCCGCGC GGCTAGCTACAACGA TCGCACAG 


CTTCGCGC GGCTAGCTACAACGA CGCGCTCG 


| CAGCACGT GGCTAGCTACAACGA TCTTCGCG 
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CTCCAGGC A ACAACGAA 


CAGGCACA A ACGAACGC 


CACAACGA A ACGCCGCT 


CAACGAAC G GCCGCTTC 


CGAACGCC G GCTTCCTC 
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< 

< 

H 


< 
< 
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u 

3 

'1 

u 
< 


AGAAGTTC A ATCTCCCT 


CTGGGGAA G GCATGCCA 


GGGGAAGC A ATGCCAAG 


j 

1 


CATGCCAA G GCTCTCGC 1 


AAGCTCTC G GCTGCAGG 


1 
a 


CTGCAGGA G GCTGACGT 


AGGAGCTG A ACGTGGAA 


GAGCTGAC G GTGGAAGA ] 


< 
< 

1 
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a 
u 

u 

Eh 
< 
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AGATGAGC G GTGCGGGA | 


a 
< 


CGTGCGGG A ACTGCGCT | 


GCGGGACT G GCGCTTGG 


Eh 
Eh 

o 
y 

Eh 

< 
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8 
U 
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GCTTGGCT G GCGCAGGA [ 


■5 

a 
u 
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GCGCAGGA G GCCCAGGG 


GCCCAGGG G GTTGGCTG 
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H 
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TTCGTTGT GGCTAGCTACAACGA GCCTGGAG | 


GCGTTCGT GGCTAGCTACAACGA TGTGCCTG 


AGCGGCGT GGCTAGCTACAACGA TCGTTGTG 


GAAGCGGC GGCTAGCTACAACGA GTTCGTTG J 


GAGGAAGC GGCTAGCTACAACGA GGCGTTCG j 


CTTGGTGT GGCTAGCTACAACGA TCCTGAGG I 


TTCTTGGT GGCTAGCTACAACGA GTTCCTGA 


AGATGAAC GGCTAGCTACAACGA TTCTTGGT 1 


H 
H 

< 

1 

6 
| 

H 
<C 
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TGGCATGC GGCTAGCTACAACGA TTCCCCAG 1 


CTTGGCAT GGCTAGCTACAACGA GCTTCCCC j 
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CCTGCAGC GGCTAGCTACAACGA GAGAGCTT 
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u 
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Eh 


ACGTCAGC GGCTAGCTACAACGA TCCTGCAG 


TTCCACGT GGCTAGCTACAACGA CAGCTCCT 


TCTTCCAC GGCTAGCTACAACGA GTCAGCTC 


ACGCTCAT GGCTAGCTACAACGA CTTCCACG 


CCGCACGC GGCTAGCTACAACGA TCATCTTC 
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Eh 
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o 

Eh 
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1 

o 
u 
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AGCGCAGT GGCTAGCTACAACGA CCCGCACG 1 
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H 
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| 
< 

< 

H 
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AGCCAAGC GGCTAGCTACAACGA GCAGTCCC 


TGCGCAGC GGCTAGCTACAACGA CAAGCGCA j 


TCCTGCGC GGCTAGCTACAACGA AGCCAAGC 


g 
§ 
g 

9 
h 
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CCCTGGGC GGCTAGCTACAACGA TCCTGCGC 


CAGCCAAC GGCTAGCTACAACGA CCCTGGGC 


< 
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CCGCCTGA G GCTGTACT 1 


CCTGAGCT G GTACTTTG 


TGAGCTGT A ACTTTGTC 


TGTACTTT G GTCAAGGT 


TTGTCAAG G GTGGATGT 


CAAGGTGG A ATGTGACG 


AGGTGGAT G GTGACGGG 


TGGATGTG A ACGGGCGC 


TGTGACGG G GCGCGTAC 1 


TGACGGGC G GCGTACGA 


ACGGGCGC G GTACGACA 


GGGCGCGT A.ACGACACC 


CGCGTACG A ACACCATC 


CGTACGAC A ACCATCCC 


ACGACACC A ATCCCCCA 


CCCCCAGG A ACAGGCTC 


CAGGACAG G GCTCACGG 


ACAGGCTC A ACGGAGGT 


TCACGGAG G GTCATCGC 


CGGAGGTC A ATCGCCAG 


AGGTCATC G GCCAGCAT 


CATCGCCA G GCATCATC 


TCGCCAGC A ATCATCAA 


CCAGCATC A ATCAAACC 


ATCATCAA A ACCCCAGA 


ACCCCAGA A ACACGTAC 


CCCAGAAC A ACGTACTG 


CAGAACAC G GTACTGCG 


GAACACGT A ACTGCGTG 


CACGTACT G GCGTGCGT 


CGTACTGC G GTGCGTCG 
































































AGTACAGC GGCTAGCTACAACGA TCAGGCGG 1 


CAAAGTAC GGCTAGCTACAACGA AGCTCAGG 1 


GACAAAGT GGCTAGCTACAACGA ACAGCTCA 1 


ACCTTGAC GGCTAGCTACAACGA AAAGTACA 1 


ACATCCAC GGCTAGCTACAACGA CTTGACAA 1 


CGTCACAT GGCTAGCTACAACGA CCACCTTG 1 


CCCGTCAC GGCTAGCTACAACGA ATCCACCT I 


GCGCCCGT GGCTAGCTACAACGA CACATCCA 1 


GTACGCGC GGCTAGCTACAACGA CCGTCACA 1 


TCGTACGC GGCTAGCTACAACGA GCCCGTCA j 


• TGTCGTAC GGCTAGCTACAACGA GCGCCCGT 1 


GGTGTCGT GGCTAGCTACAACGA ACGCGCCC 1 


GATGGTGT GGCTAGCTACAACGA CGTACGCG | 


GGGATGGT GGCTAGCTACAACGA GTCGTACG | 


TGGGGGAT GGCTAGCTACAACGA GGTGTCGT 1 


GAGCCTGT GGCTAGCTACAACGA CCTGGGGG 


CCGTGAGC GGCTAGCTACAACGA CTGTCCTG 


ACCTCCGT GGCTAGCTACAACGA GAGCCTGT 


GCGATGAC GGCTAGCTACAACGA CTCCGTGA 


CTGGCGAT GGCTAGCTACAACGA GACCTCCG 


ATGCTGGC GGCTAGCTACAACGA GATGACCT 


GATGATGC GGCTAGCTACAACGA TGGCGATG 


TTGATGAT GGCTAGCTACAACGA GCTGGCGA 


GGTTTGAT GGCTAGCTACAACGA GATGCTGG 


TCTGGGGT GGCTAGCTACAACGA TTGATGAT 


GTACGTGT GGCTAGCTACAACGA TCTGGGGT 


CAGTACGT GGCTAGCTACAACGA GTTCTGGG 


CGCAGTAC GGCTAGCTACAACGA GTGTTCTG 


CACGCAGT GGCTAGCTACAACGA ACGTGTTC 


ACGCACGC GGCTAGCTACAACGA AGTACGTG 


CGACGCAC GGCTAGCTACAACGA GCAGTACG 
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CCAGTCTC A ACCTTCAA | 


< 

Eh 


p 

1 

r: 


CAACCGCG G GCTTCAAG | 


GCTTCAAG G GCTGGGAG | 


< 
1 

Eh 


GGAGGAAC A ATGCGTCG 


AG3AACAT G GCGTCGCA 


GAACATGC G GTCGCAAA 


CATGCGTC G GCAAACTC 


CGTCGCAA A ACTCTTTG 


3 

8 
1 

H 
Eh 


GGGGTCTT G GCGGCTGA | 


1 

u 

U 

u 
o 


CGGCTGAA G GTGTCACA | 


i t 

o t- 


\ 

g 

Eh 
O 


CACAGCCT G GTTTCTGG 


GTTTCTGG A ATTTGCAG 


CTGGATTT G GCAGGTGA 


Eh 


| GCAGGTGA A ACAGCCTC 


l GCCTCCAG A ACGGTGTG 


| TCCAGACG G GTGTGCAC 


| CAGACGGT G GTGCACCA 


1 GACGGTGT G GCACCAAC 


| CGGTGTGC A ACCAACAT 


| GTGCACCA A ACATCTAC 


| 

\ 

l 
< 
O 




























































TTGAAGGT GGCTAGCTACAACGA GAGACTGG 


GCCGCGGT GGCTAGCTACAACGA TGAAGGTG 


GAAGCCGC GGCTAGCTACAACGA GGTTGAAG 


CTTGAAGC GGCTAGCTACAACGA CGCGGTTG J 


CTCCCAGC GGCTAGCTACAACGA CTTGAAGC 


ACGCATGT GGCTAGCTACAACGA TCCTCCCA | 


CGACGCAT GGCTAGCTACAACGA GTTCCTCC 1 


TGCGACGC GGCTAGCTACAACGA ATGTTCCT 


TTTGCGAC GGCTAGCTACAACGA GCATGTTC 


GAGTTTGC GGCTAGCTACAACGA GACGCATG | 


CAAAGAGT GGCTAGCTACAACGA TTGCGACG 


CGCAAGAC GGCTAGCTACAACGA CCCAAAGA 


TCAGCCGC GGCTAGCTACAACGA AAGACCCC 


I 

u 

Eh 

i 

o 
u 
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u 
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TGTGACAC GGCTAGCTACAACGA TTCAGCCG 


GCTGTGAC GGCTAGCTACAACGA ACTTCAGC 


l 
Eh 

< 
< 

u 
u 

i 


CCAGAAAC GGCTAGCTACAACGA AGGCTGTG 


CTGCAAAT GGCTAGCTACAACGA CCAGAAAC 


TCACCTGC GGCTAGCTACAACGA AAATCCAG 
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1 
1 

U 

o 
u 

Eh 

o 
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GAGGCTGT GGCTAGCTACAACGA TCACCTGC 


CACACCGT GGCTAGCTACAACGA CTGGAGGC 


GTGCACAC GGCTAGCTACAACGA CGTCTGGA 
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< 
u 
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U 
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GTAGATGT GGCTAGCTACAACGA TGGTGCAC 


TTGTAGAT GGCTAGCTACAACGA GTTGGTGC 
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Table 17 



Table 17: Anti-TERT HH and G-Cleaver Ribozymes 



Alias 


Ribozyme Sequence 


Length 
(nt) 


HH 






TERT- 1051 


AGGAGUA CUGAUGAGGCCGUUAGGCCGAA AGGAAGU 


36 


TERT-1053 


UGAGGAG CUGAUGAGGCCGUUAGGCCGAA AGAGGAA 


36 


TERT-1918 


UGAAGCG CUGAUGAGGCCGUUAGGCCGAA AGUCUGG 


36 


TERT-2383 


GAGCCAC CUGAUGAGGCCGUUAGGCCGAA AACUGUC 


36 


TERT-2485 


UGAAGCG CUGAUGAGGCCGUUAGGCCGAA AGGAAGA 


36 


TERT-2566 


GCGUGGA CUGAUGAGGCCGUUAGGCCGAA AGGAUGG 


36 


TERT-3181 


AGUAGCA CUGAUGAGGCCGUUAGGCCGAA AGGGAGG 


36 


TERT-3691 


CUGUGGG CUGAUGAGGCCGUUAGGCCGAA AAGUGAA 


36 


TERT-3758 


AUGUGGG CUGAUGAGGCCGUUAGGCCGAA AGUGGAA 


36 


TERT-3794 


GGUGAAC CUGAUGAGGCCGUUAGGCCGAA AUGGCGA 


36 


G-Cleaver 






TERT-757 


UUGGG UGAUGGCAUGCACUAUGCGCG AACGGCAGAC 


36 


TERT-2353 


UCUGU UGAUGGCAUGCACUAUGCGCG AAGGUAGAGA 


36 


TERT-3795 


GUGAA UGAUGGCAUGCACUAUGCGCG AAUGGCGAAU 


36 
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Table 18 

Table 18: Human BACE Hammerhead Ribozyme and Target Sequence 



CCACGCGU C CGCAGCCC 



GGGCUGCG CUGAUGAG X CGAA ACGCGUGG 



AGCUGGAU U AUGGUGGC 



GCCACCAU CUGAUGAG X CGAA AUCCAGCU 



GCUGGAUU A UGGUGGCC 



GGCCACCA CUGAUGAG X CGAA AAUCCAGC 



GGAGCCCU U GCCCCUGC 



GCAGGGGC CUGAUGAG X CGAA AGGGCUCC 



CCGCCCCU C CCAGCCCC 



GGGGCUGG CUGAUGAG X CGAA AGGGGCGG 



GCCGAUGU A GCGGGCUC 



GAGCCCGC CUGAUGAG X CGAA ACAUCGGC 



AGCGGGCU C CGGAUCCC 



CUCCGGAU C CCAGCCUC 



CCCAGCCU C UCCCCUGC 



CAGCCUCU C CCCUGCUC 



CCCCUGCU C CCGUGCUC 



CCCGUGCU C UGCGGAUC 



CUGCGGAU C UCCCCUGA 



GGGAUCCG CUGAUGAG X CGAA AGCCCGCU 



GAGGCUGG CUGAUGAG X CGAA AUCCGGAG 



GCAGGGGA CUGAUGAG X CGAA AGGCUGGG 



GAGCAGGG CUGAUGAG X CGAA AGAGGCUG 



GAGCACGG CUGAUGAG X CGAA AGCAGGGG 



GAUCCGCA CUGAUGAG X CGAA AGCACGGG 



UCAGGGGA CUGAUGAG X CGAA AUCCGCAG 



GCGGAUCU C CCCUGACC 



GGUCAGGG CUGAUGAG X CGAA AGAUCCGC 



UGACCGCU C UCCACAGC 



GCUGUGGA CUGAUGAG X CGAA AGCGGUCA 



ACCGCUCU C CACAGCCC 



GGGCUGUG CUGAUGAG X CGAA AGAGCGGU 



CCUGGCGU C CUGAUGCC 



GGCAUCAG CUGAUGAG X CGAA ACGCCAGG 



CCCAAGCU C CCUCUCCU 



AGGAGAGG CUGAUGAG X CGAA AGCUUGGG 



AGCUCCCU C UCCUGAGA 



UCUCAGGA CUGAUGAG X CGAA AGGGAGCU 



CUCCCUCU C CUGAGAAG 



! CUGAUGAG X CGAA AGAGGGAG 



CCCAGACU U GGGGGCAG 



CUGCCCCC CUGAUGAG X CGAA AGUCUGGG 



CCCUGGCU C CUGCUGUG 



CACGGCAU C CGGCUGCC 



GGGCAGCU U UGUGGAGA 



GGCAGCUU U GUGGAGAU 



CACAGCAG CUGAUGAG X CGAA AGCCAGGG 



GGCAGCCG CUGAUGAG X CGAA AUGCCGUG 



OCUC hCR I ( IGAG X CGAA AGCUGCCC 



AUCUCCAC CUGAUGAG X CGAA AAGCUGCC 



CCUGCCCC CUGAUGAG X CGAA ACUUGCCC~ 



1798 
1799 



GGGCAAGU C GGGGCAGG 



GCAGGGCU A CUACGUGG 



CCACGUAG CUGAUGAG X CGAA AGCCCUGC 



GGGCUACU A CGUGGAGA 



UCUCCACG CUGAUGAG X CGAA AGUAGCCC 



CAGACGCU C AACAUCCU 



AGGAUGUU CUGAUGAG X CGAA AGCGUCUG~~ 



CUCAACAU C CUGGUGGA 



UCCACCAG CUGAUGAG X CGAA AUGUUGAG 



UGGUGGAU A CAGGCAGC 



GCUGCCUG CUGAUGAG X CGAA AUCCACCA~ 



GCAGCAGU A ACUUUGCA 



UGCAAAGU CUGAUGAG X CGAA ACUGCUGC 



CAGUAACU U UGCAGUGG 



CCACUGCA CUGAUGAG X CGAA AGUUACUG 



1805 
1806 
1907 



AGUAACUU U GCAGUGGG 



CCACCCCU U CCUGCAUC 



'CACCCCUU C CUGCAUCG 



UCCUGCAU C GCUACUAC 



GCAUCGCU A CUACCAGA 



UCGCUACU A CCAGAGGC 



GCAGCUGU C CAGCACAU 



CAGCACAU A CCGGGACC 



CGGGACCU C CGGAAGGG 
GGGUGUGU A UGUGCCCU 



CCCACUGC CUGAUGAG X CGAA AAGUUACU 



i CUGAUGAG X CGAA AGGGGUGG 



i CUGAUGAG X CGAA AAGGGGUG 



: CUGAUGAG X CGAA AUGCAGGA 



UCUGGUAG CUGAUGAG X CGAA AGCGAUGC 



GCCUCUGG CUGAUGAG X CGAA AGUAGCGA 



AUGUGCUG CUGAUGAG X CGAA ACAGCUGC~ 



GGUCCCGG CUGAUGAG X CGAA AUGUGCUG 



CCCUUCCG CUGAUGAG X CGAA AGGUCCCG~ 



AGGGCACA CUGAUGAG X CGAA ACACACCC~ 



CCUGGGUG CUGAUGAG X CGAA AGGGCACA 



UGUGCCCU A CACCCAGG 



GACCUGGU A AGCAUCCC 



GGGAUGCU CUGAUGAG X 



CGAA ACCAGGUC 



GUAAGCAU C CCCCAUGG 



CCAUGGGG CUGAUGAG X 



CGAA AUGCUUAC 



1821 
1822 



CCCAACGU C ACUGUGCG 



CGCACAGU CUGAUGAG X CGAA ACGUUGGG~ 
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GCCAACAU U GCUGCCAU 
GCUGCCAU C ACUGAAUC 
CACUGAAU C AGACAAGU 
AGACAAGU U CUUCAUCA 
GACAAGUU C UUCAUCAA 
CAAGUUCU U CAUCAACG 
AAGUUCUU C AUCAACGG 
UUCUUCAU C AACGGCUC 
C CAACUGGG 



AUGGCAGC CUGAUGAG X CGAA AUGUUGGC 
GAUUCAGU CUGAUGAG X CGAA AUGGCAGC 
ACUUGUCU CUGAUGAG X CGAA AUUCAGUG 
UGAUGAAG CUGAUGAG X CGAA ACUUGUCU 
UUGAUGAA CUGAUGAG X CGAA AACUUGUC 
CGUUGAUG CUGAUGAG X CGAA AGAACUUG 
CCGUUGAU CUGAUGAG X CGAA AAGAACUU 
GAGCCGUU CUGAUGAG X CGAA AUGAAGAA 
CCCAGUUG CUGAUGAG X CGAA AGCCGUUG 



GAAGGCAU C CUGGGGCU 



AGCCCCAG CUGAUGAG X CGAA AUGCCUUC 



1004 
1014 
1031 
1042 
1043 



GCUGGCCU A UGCUGAGA 



UCUCAGCA CUGAUGAG X CGAA AGGCCAGC 



GCUGAGAU U GCCAGGCC 



GGCCUGGC CUGAUGAG X CGAA AUCUCAGC 



UGACGACU C CCUGGAGC 



GCUCCAGG CUGAUGAG X CGAA AGUCGUCA 



UGGAGCCU U UCUUUGAC 



GUCAAAGA CUGAUGAG X CGAA AGGCUCCA 



GGAGCCUU U CUUUGACU 



AGUCAAAG CUGAUGAG X CGAA AAGGCUCC 



GAGCCUUU C UUUGACUC 



GAGUCAAA CUGAUGAG X CGAA AAAGGCUC 



GCCUUUCU U UGACUCUC 



GAGAGUCA CUGAUGAG X CGAA AGAAAGGC 



CCUUUCUU U GACUCUCU 



AGAGAGUC CUGAUGAG X CGAA AAGAAAGG 



CUUUGACU C UCUGGUAA 



UUACCAGA CUGAUGAG X CGAA AGUCAAAG 



UUGACUCU C UGGUAAAG 



CUUUACCA CUGAUGAG X CGAA AGAGUCAA 



1059 
1074 



UCUCUGGU A AAGCAGAC 



GUCUGCUU CUGAUGAG X CGAA ACCAGAGA 



ACCCACGU U CCCAACCU 



AGGUUGGG CUGAUGAG X CGAA ACGUGGGU 



CCCACGUU C CCAACCUC 



GAGGUUGG CUGAUGAG X CGAA AACGUGGG 



CCCAACCU C UUCUCCCU 



AGGGAGAA CUGAUGAG X CGAA AGGUUGGG 



CAACCUCU U CUCCCUGC 



GCAGGGAG CUGAUGAG X CGAA AGAGGUUG 



AACCUCUU C UCCCUGCA 



UGCAGGGA CUGAUGAG X CGAA AAGAGGUU 



1088 
1098 
1099 



CCUCUUCU C CCUGCAGC 



GCUGCAGG CUGAUGAG X CGAA AGAAGAGG 



CUGCAGCU U UGUGGUGC 



GCACCACA CUGAUGAG X CGAA AGCUGCAG 



UGCAGCUU U GUGGUGCU 



AGCACCAC CUGAUGAG X CGAA AAGCUGCA 



UGCUGGCU U CCCCCUCA 



UGAGGGGG CUGAUGAG X CGAA AGCCAGCA 



GCUGGCUU C CCCCUCAA 



UUGAGGGG CUGAUGAG X CGAA AAGCCAGC 



1142 
1146 
1161 



UUCCCCCU C AACCAGUC 



GACUGGUU CUGAUGAG X CGAA AGGGGGAA 



CAACCAGU C UGAAGUGC 



GCACUUCA CUGAUGAG X CGAA ACUGGUUG 



GCUGGCCU C UGUCGGAG 



CUCCGACA CUGAUGAG X CGAA AGGCCAGC 



GCCUCUGU C GGAGGGAG 



CUCCCUCC CUGAUGAG X CGAA ACAGAGGC 



AGCAUGAU C AUUGGAGG 



CCUCCAAU CUGAUGAG X CGAA AUCAUGCU 



AUGAUCAU U GGAGGUAU 



AUACCUCC CUGAUGAG X CGAA AUGAUCAU 



1171 

1173 



UUGGAGGU A UCGACCAC 



GUGGUCGA CUGAUGAG X CGAA ACCUCCAA 



1200 
1205 



1231 
1232 
1242 
1245 



GGAGGUAU C GACCACUC 



CGACCACU C 
CUCGCUGU A CACAGGCA 
""AC'icAGU C UCUGGUAU 
GGCAGUCU C UGGUAUAC 
UCUCUGGU A UACACCCA 
UCUGGUAU A CACCCAUC 
ACACCCAU C CGGCGGGA 
GGAGUGGU A UUAUGAGG 
AGUGGUAU U AUGAGGUG 
GUGGUAUU A UGAGGUGA 
GAGGUGAU C AUUGUGCG 
GUGAUCAU U GUGCGGGU 



GAGUGGUC CUGAUGAG X CGAA AUACCUCC 
UG UACAGC CUGAUGAG X CGAA AGUGC-UCG 
UGCCUGUG CUGA UGAG X CGAA ACAGCGAG 
AUACCAGA CUGAUGAG X CGAA ACUGCCUG 
GUAUACCA CUGAUGAG X CGAA AGACUGCC 
UGGGUGUA CUGAUGAG X CGAA ACCAGAGA 
GAUGGGUG CUGAUGAG X CGAA AUACCAGA 
UCCCGCCG CUGAUGAG X CGAA AUGGGUGU 
CCUCAUAA CUG AUGAG X CGAA ACCACUC~ 
CACCUCAU CUGAUGAG X CGAA AUACCACU 
UCACCUCA CUGAUGAG X CGAA AAUACCAC~ 
CGCACAAU CUGAUGAG X CGAA AUCACCUC 
ACCCGCAC CUGAUGAG X CGAA AUGAUCAC 
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1358 
1359 
1371 
137S 
1380 
1391 
1394 



1419 
1425 
1465 
1473 
1474 
1475 
1475 
1482 



1491 
1493 
1497 



1530 
1536' 
1539 



1595 
1601 



1610 
1616 



GUGGAGAU C AAUGGACA 
GACAGGAU C UGAAAAUG 
CAAGGAGU A CAACUAUG 
GUACAACU A UGACAAGA 
AAGAGCAU U GUGGACAG 
ACCAACCU U CGUUUGCC 
CCAACCUU C GUUUGCCC 
ACCUUCGU U UGCCCAAG 
CCUUCGUU U GCCCAAGA 
GAAAGUGU U UGAAGCUG 
AAAGUGUU U GAAGCUGC 
GCUGCAGU C AAAUCCAU 
AGUCAAAU C CAUCAAGG 
AAAUCCAU C AAGGCAGC 
GGCAGCCU C CUCCACGG 
AGCCUCCU C CACGGAGA 
GGAGAAGU U CCCUGAUG 
GAGAAGUU C CCUGAUGG 
CUGAUGGU U UCUGGCUA 
UGAUGGUU U CUGGCUAG 
GAUGGUUU C UGGCUAGG 
UUCUGGCU A GGAGAGCA 
CCACCCCU U GGAACAUU 
UGGAACAU U UUCCCAGU 
GGAACAUU U UCCCAGUC 
GAACAUUU U CCCAGUCA 
AACAUUUU C CCAGUCAU 
UUCCCAGU C AUCUCACU 
CCAGUCAU C UCACUCUA 
AGUCAUCU C ACUCUACC 
AUCUCACU C UACCUAAU 
CUCACUCU A CCUAAUGG 
CUCUACCU A AUGGGUGA 
GGUGAGGU U ACCAACCA 
GUGAGGUU A CCAACCAG 
CAACCAGU C CUUCCGCA 
CCAGUCCU U CCGCAUCA 
CAGUCCUU C CGCAUCAC 
UUCCGCAU C ACCAUCCU 
AUCACCAU C CUUCCGCA 
ACCAUCCU U CCGCAGCA 
CCAUCCUU C CC-CAGCAA 
GCAGCAAU A CCUGCGGC 
GGCCACGU C CCAAGACG 
ACGACUGU U ACAAGUUU 
CGACUGUU A CAAGUUUG 
UUACAAGU U UGCCAUCU 
UACAAGUU U GCCAUCUC 
UUUGCCAU C UCACAGUC 
UGCCAUCU C ACAGUCAU 
CUCACAGU C AUCCACGG 



UGUCCAUU CUGAUGAG X CGAA AUCUCCAC 
CAUUUUCA CUGAUGAG X CGAA AUCCUGUC 
CAUAGUUG CUGAUGAG X CGAA ACUCCUUG 
UCUUGUCA CUGAUGAG X CGAA AGUUGUAC 
CUGUCCAC CUGAUGAG X CGAA AUGCUCUU 
GGCAAACG CUGAUGAG X CGAA AGGUUGGU 
GGGCAAAC CUGAUGAG X CGAA AAGGUUGG 
CUUGGGCA CUGAUGAG X CGAA ACGAAGGU 
UCUUGGGC CUGAUGAG X CGAA AACGAAGG 
CAGCUUCA CUGAUGAG X CGAA ACACUUUC 
GCAGCUUC CUGAUGAG X CGAA AACACUUU 
AUGGAUUU CUGAUGAG X CGAA ACUGCAGC 
CCUUGAUG CUGAUGAG X CGAA AUUUGACU 
GCUGCCUU CUGAUGAG X CGAA AUGGAUUU 
CCGUGGAG CUGAUGAG X CGAA AGGCUGCC 
UCUCCGUG CUGAUGAG X CGAA AGGAGGCU 
CAUCAGGG CUGAUGAG X CG AA ACUPCUCC ~ 
CCAUCAGG CUGAUGAG X CGAA AACUUCUC 
UAGCCAGA CUGAUGAG X CGAA ACCAUCAG 
CUAGCCAG CUGAUGAG X CGAA AACCAUCA 
CCUAGCCA CUGAUGAG X CGAA AAACCAUC 
UGCUCUCC CUGAUGAG X CGAA AGCCAGAA 
AAUGUUCC CUGAUGAG X CGAA AGGGGUGG 
ACUGGGAA C UGAUGAG X CGAA AUGUUCCA 
GACUGGGA CUGAUGAG X CGAA AAUGUUCC 
UGACUGGG CUGAUGAG X CGAA AAAUGUUC 
AUGACUGG CUGAUGAG X CGAA AAAAUGUU 
AGUGAGAU CUGAUGAG X CGAA ACUGGGAA 
UAGAGUGA CUGAUGAG X CGAA AUGACUGG 
GGUAGAGU CUGAUGAG X CGAA AGAUGACU 
AUUAGGUA CUG AUGAG X CGAA AGUGAGAU 
CCAUUAGG CUGAUGAG X CGAA AGAGUGAG 
UCACCCAU CUGAUGAG X CGAA AGGUAGAG 
UGGUUGGU CUGAUGAG X CGAA ACCUCACC 
CUGGUUGG CUGAUGAG X CGAA AACCUCAC~ 
UGCGGAAG CUGAUGAG X CGAA ACUGGUUG 
UGAUGCGG CUGAUGAG X CGAA AGGACUGG 
GUGAUGCG CUGAUGAG X CGA A AAGGACUG 
AGGAUGGU C UGAUGAG X CGAA AUGCGGAA 
UGC GGAAG CUGAUGAG X CGAA AUGGUGAU 
UGCUGCGG CUGAUGAG X CGAA AGGAUGGU 
UUGCUGCG CUGAUGAG X CGAA AAGGAUGG 
GCCGCAGG CUGAUGAG X CGAA AUUGCUGC 
CGUCUUGG CUGAUGAG X CGAA ACGUGGCC 
AAACUUGU CUGAUGAG X CGAA ACAGUCGU 
CAAACUUG CUGAUGAG X CGAA AACAGUCG 
AGAUGGCA CUGAUGAG X CGAA ACUUGUAA 
GAGAUGGC CUGAUGAG X CGAA AACUUGUA~ 
GACUGUGA CUGAUGAG X CGAA AUGGCAAA 
AUGACUGU CUGAUGAG X CGAA AGAUGGCA 
CCGUGGAU CUGAUGAG X CGAA ACUGUGAG 
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ACAGUC AU C CACGGGCA 
GGCACOGU U AUGGGAGC 
GCACUGUU A UGGGAGCU 
GGAGCUGU U AUCAUGGA 
GAGCUGUU A UCAUGGAG 
GCUGUUAU C AUGGAGGG 
GGAGGGC U U CUACGUUG 
GAGGGCUU C UACGUUGU 
GGGCUUCU A CGUUGUCU 
UUCUACGU U GUCUUUGA 
UACGUUGU C UUUGAUCG 



UGCCCGUG CUGAUGAG X CGAA AUGACUGU 
GCUCCCAU CUGAUG AG X CGAA ACAGUGCC 
AGCUCCCA CUGAUGAG X CGAA AACAGUGC 
UCCAUGAU CUGAUGAG X CGAA ACAGCUCC 
CUCCAUGA CUGAUGAG X CGAA AACAGCUC 
CCCUCCAU CU GAUGAG X CGAA AUAACAGC 
CAACGUAG CUGAUGAG X CG AA AGCCCUCC 
ACAACGUA CUGAUGAG X CGAA AAGCCCUC ~ 
A GACAACG CUGAUGAG X CGAA AGAAGCCC 
UCAAAGAC CUGAUGAG X CGAA ACGUAGAA 
CGAUCAAA CUGAUGAG X CGAA ACAACGUA ~ 



CGUUGUCU U UGAUCGGG 



CCCGAUCA CUGAUGAG X CGAA AGACAACG 



GUUGUCUU U GAUCGGGC 
UCUUUGAU C GGGCCCGA 



GCCCGAUC CUGAUGAG X CGAA AAGACAAC 



UCGGGCCC CUGAUGAG X CGAA AUCAAAGA 



AAACGAAU U GGCUUUGC 



GCAAAGCC CUGAUGAG X CGAA AUUCGUUU 



AAUUGGCU U UGCUGUCA 
AUUGGCUU U GCUGUCAG 



UGACAGCA CUGAUGAG X CGAA AGCCAAUU 



CUGACAGC CUGAUGAG X CGAA AAGCCAAU 



CAAGCGCU CUGAUGAG X CGAA ACAGCAAA 
CAC AUGGC CUGAUGAG X CGAA AGCGCUGA 
CCGUCCUG CUGAUG AG X CGAA ACUCAUCG 
GCCGUCCU CUGAUGAG X CGAA AACUCAUcT" 
GGUGACAA CUGAUGAG X CG AA AGGGCCUU~ 
AGGUGACA CUGAUGAG X CGAA AAGGGCCU 
AAGGUGAC CUG AUGAG X CGAA AAAGGGCC ~ 
UCCAAGGU CUGAU GAG X CGAA ACAAAAGG ~ 
CCAUGUCC CUGA UGAG X CGAA AGGUGACA 
GAAUGUUG CUGAUGAG X CGAA AGCCACAG ~ 
GUCUGUGG CUGAUGAG X CGAA AUGUUGUA~ 
UGUCUGUG CUGAUGAG X C GAA AAUGUUGU 
UGAGGGUU CUGAUGAG X CGAA ACUCAUCU 
AUGGUCAU CUGAUGAG X CGAA AGGGUUGA 
ACAUAGGC CUGAUG AG X CGAA AUGGUCAU 
CCAUGACA CUGAUGAG X CGAA AGGCUAUG 
GCAGCCAU CUGAUGAG X CGAA ACAUAGGC 
AGGG CGCA CUGAUGAG X CGAA AUGGCAGC 
AGCAUGAA CUGAUG AG X CGAA AGGGCGCA 
GCAGCAUG CUGAUGAG X CGAA AGAGGGCG 
G GCAGCAU CUGAUGAG X CGAA AAGAGGGC 
AUGAGGCA CUGAUGA G X CGAA AGUGGCAG 
CACACCAU CUGAU GAG X CGAA AGGCAGAG 
GCGCCACU CUGAUGAG X CGAA ACACACCA~ 
AGGCAGCG CUGAUGAG X CGA A AGGCAGCG 
CAUCAGCA CUGAUGAG X CGAA AGUCAUCA~ 
UCAUCAGC CUGAUGAG X C GAA AAGUCAUC~ 
AGCAGGGA CUGAUGAG X CG AA AUGUCAUC~ 
UCAGCAGG CUGAUGAG X CG AA AGAUGUCA 
GGAAUCUC CUGAUGA G X CGAA AUCUUCUG ~ 
UCCAGGGG CUGAUGAG X CGAA AUCUCUAU ~ 
GUCCAGG G CUGAUGAG X CGAA AAUCUCUA 
GAACCACG CUGAUG AG X CGAA AGGUGUGG 
CCAAAGUG CUGAUGAG X CGAA ACCACGGA ~ 



1730 
1731 
1756 
1757 
1758 
1761 
1766 
1787 
1794 
1795 



1827 
1832 
1836 



I860 
1872 



1931 
1932 



2004 
2011 



UUUGCUG U C AGCGCUUG 
UCAGCGCU U GCCAUGUG 
CGAUGAGU U CAGGACGG 
GAUGAGUU C AGGACGGC 
AAGGCCCU U UUGUCACC 
AGGCCCUU U UGUCACCU 
GGCCCUUU U GUCACCUU 
CCUUUUGU C ACCUUGGA 
UGUCACCU U GGACAUGG 
CUGUGGCU A CAACAUUC 
UACAACAU U CCACAGAC 
ACAACAUU C CACAGACA 
AGAUGAGU C AACCCUCA 
UCAACCCU C AUGACCAU 
AUGACCAU A GCCUAUGU 
CAUAGCCU A UGUCAUGG 
GCCUAUGU C AUGGCUGC 
GCUGCCAU C UGCGCCCU 
UGCGCCCU C UUCAUGCU 
CGCCCUCU U CAUGCUGC 
GCCCUC UU C AUGCUGCC 
CUGCCACU C UGCCUCAU 
CUCUGCCU C AUGGUGUG 
UGGUGUGU C AGUGGCGC 
CGCUGCCU C CGCUGCCU 
UGAUGACU U UGCUGAUG 
GAUGACUU U GCUGAUGA 
GAUGACAU C UCCCUGCU 
UGACAUCU C CCUGCUGA 
CAGAAGAU A GAGAUUCC 
AUAG AGAU U CCCCUGGA 
UAGAGAUU C CCCUGGAC 
CCACACCU C CGUGGUUC 
UCCGUGGU U CACUUUGG 
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CCGUGGUU C ACUUUGGU 



GGUUCACU U UGGUCACA 



GUUCACUU U GGUCACAA 



ACUUUGGU C ACAAGUAG 



UCACAAGU A GGAGACAC 



ACCAAAGU CUGAUGAG X CGAA AACCACGG 



UGUGACCA CUGAUGAG X CGAA AGUGAACC 



UUGUGACC CUGAUGAG X CGAA AAGUGAAC 



CUACUUGU CUGAUGAG X CGAA ACCAAAGU 



GUGUCUCC CUGAUGAG X CGAA ACUUGUGA 



GAGCACCU C AGGACCCU 



AGGGUCCU CUGAUGAG X CGAA AGGUGCUC 



AGGACCCU C CCCACCCA 



UGGGUGGG CUGAUGAG X CGAA AGGGUCCU 



AAAUGCCU C UGCCUUGA 



UCAAGGCA CUGAUGAG X CGAA AGGCAUUU 



CUCUGCCU U GAUGGAGA 



UCUCCAUC CUGAUGAG X CGAA AGGCAGAG 



AGGUGGGU U CCAGGGAC 



GUCCCUGG CUGAUGAG X CGAA ACCCACCU 



GGUGGGUU C CAGGGACU 



AGUCCCUG CUGAUGAG X CGAA AACCCACC 



GGGACUGU A CCUGUAGG 



CCUACAGG CUGAUGAG X CGAA ACAGUCCC 



GUACCUGU A GGAAACAG 



CUGUUUCC CUGAUGAG X CGAA ACAGGUAC 



GAAGCACU C UGCUGGCG 



CGCCAGCA CUGAUGAG X CGAA AGUGCUUC 



GCGGGAAU A CUCUUGGU 



ACCAAGAG CUGAUGAG X CGAA AUUCCCGC 



GGAAUACU C UUGGUCAC 



GUGACCAA CUGAUGAG X CGAA AGUAUUCC 



AAUACUCU U GGUCACCU 



AGGUGACC CUGAUGAG X CGAA AGAGUAUU 



CUCUUGGU C ACCUCAAA 



UUUGAGGU CUGAUGAG X CGAA ACCAAGAG 



GGUCACCU C AAAUUUAA 



UUAAAUUU CUGAUGAG X CGAA AGGUGACC 



CCUCAAAU U UAAGUCGG 



CCGACUUA CUGAUGAG X CGAA AUUUGAGG 



CUCAAAUU U AAGUCGGG 



CCCGACUU CUGAUGAG X CGAA AAUUUGAG 



UCAAAUUU A AGUCGGGA 



UCCCGACU CUGAUGAG X CGAA AAAUUUGA 



AUUUAAGU C GGGAAAUU 



AAUUUCCC CUGAUGAG X CGAA ACUUAAAU 



CGGGAAAU U CUGCUGCU 



AGCAGCAG CUGAUGAG X CGAA AUUUCCCG 



GGGAAAUU C UGCUGCUU 



AAGCAGCA CUGAUGAG X CGAA AAUUUCCC 



CUGCUGCU U GAAACUUC 



GAAGUUUC CUGAUGAG X CGAA AGCAGCAG 



UUGAAACU U CAGCCCUG 



CAGGGCUG CUGAUGAG X CGAA AGUUUCAA 



UGAAACUU C AGCCCUGA 



UCAGGGCU CUGAUGAG 2 



CUGAACCU U UGUCCACC 



GGUGGACA CUGAUGAG I 



UGAACCUU U GUCCACCA 



UGGUGGAC CUGAUGAG X CGAA AAGGUUCA 



ACCUUUGU C CACCAUUC 



GAAUGGUG CUGAUGAG X CGAA ACAAAGGU 



UCCACCAU U CCUUUAAA 



UUUAAAGG CUGAUGAG X CGAA AUGGUGGA 



CCACCAUU C CUUUAAAU 



AUUUAAAG CUGAUGAG X CGAA AAUGGUGG 



CCAUUCCU U UAAAUUCU 



AGAAUUUA CUGAUGAG X CGAA AGGAAUGG 



CAUUCCUU U AAAUUCUC 



GAGAAUUU CUGAUGAG X CGAA AAGGAAUG 



AUUCCUUU A AAUUCUCC 



GGAGAAUU CUGAUGAG X CGAA AAAGGAAU 



CUUUAAAU U CUCCAACC 



UUUAAAUU C UCCAACCC 



UAAAUUCU C CAACCCAA 



GGUUGGAG CUGAUGAG X CGAA AUUUAAAG 



GGGUUGGA CUGAUGAG X CGAA AAUUUAAA 



UUGGGUUG CUGAUGAG X CGAA AGAAUUUA 



CCCAAAGU A UUCUUCUU 



AAGAAGAA CUGAUGAG X CGAA ACUUUGGG 



CAAAGUAU U CUUCUUUU 



AAAAGAAG CUGAUGAG X CGAA AUACUUUG 



AAAGUAUU C UUCUUUUC 



GAAAAGAA CUGAUGAG X CGAA AAUACUUU 



AGUAUUCU U CUUUUCUU 



AAGAAAAG CUGAUGAG X CGAA AGAAUACU 



GUAUUCUU C UUUUCUUA 



UAAGAAAA CUGAUGAG X CGAA AAGAAUAC 



AUUCUUCU U UUCUUAGU 



ACUAAGAA CUGAUGAG X CGAA AGAAGAAU 



UUCUUCUU U UCUUAGUU 



AACUAAGA CUGAUGAG X CGAA AAGAAGAA 



UCUUCUUU U CUUAGUUU 



AAACUAAG CUGAUGAG X CGAA AAAGAAGA 



CUUCUUUU C UUAGUUUC 



GAAACUAA CUGAUGAG X CGAA AAAAGAAG 



UCUUUUCU U AGUUUCAG 



CUGAAACU CUGAUGAG X CGAA AGAAAAGA 



CUUUUCUU A GUUUCAGA 



UCUGAAAC CUGAUGAG X CGAA AAGAAAAG 



UUCUUAGU U UCAGAAGU 



ACUUCUGA CUGAUGAG X CGAA ACUAAGAA 
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2308 
2309 
2316 
2324 
2335 
233S 
2340 
2350 
2359 
2384 



2455 
2457 



2489 
2491 
2496 
2497 
2510 



UCUUAGUU U CAGAAGUA 
CUUAGUUU C AGAAGUAC 
~ UCAGAAGU A CUGGCAUC 
ACUGGCAU C ACACGCAG 
ACG CAGGU U ACCUUGGC 
CGCAGGUU A CCUUGGCG 
GGUUACCU U GGCGUGUG 
GCGUGUGU C CCUGUGGU 
CCUGUGGU A CCCUGGCA 
ACCAAGCU U GUUUCCCU 
AAGCUUGU U UCCCUGCU 
AGCUUGUU U CCCUGCUG 
GCUUGUUU C CCUGCUGG 
GCCAAAGU C AGUAGGAG 
AAGUCAGU A GGAGAGGA 
UGC ACAGU U UGCUAUUU 
GCACAGUU U GCUAUUUG 
AGUUUGCU A UUUGCUUU 
UUUGCUAU U UGCUUUAG 
UUGCUAUU U GCUUUAGA 
UAUUUGCU U UAGAGACA 
AUUUGCUU U AGAGACAG 
UUUGCUUU A GAGACAGG 
GGGA CUGU A UAAACAAG 
GACUGUAU A AACAAGCC 
ACAAGCCU A ACAUUGGU 
CCUAACAU U GGUGCAAA 
GCAAAGAU U GCCUCUUG 
GAUUGCCU C UUGAAUUA 
UUGCCUCU U GAAUUAAA 
UCUUGAA U U AAAAAAAA 
CUUGAAUU A AAAAAAAA 
AAAAAACU A GAAAAAAA 



UACUUCUG CUGAUGAG X CGAA AACUAAGA 
GUACUUCU CUGAUGAG X CGAA AAACUAAG 
GAUGCCAG CUGAUGAG X CGAA ACUUCUGA 
CUGCGUGU CUGAUGAG X CGAA AUGCCAGU 
GCCAAGGU CUGAUGAG X CGAA ACCUGCGU 
CGCCAAGG CUGAUGAG X CGAA AACCUGCG 
CACACGCC CUGAUGAG X CGAA AGGUAACC 
ACCACAGG CUGAUGAG X CGAA ACACACGC 
UGCCAGGG CUGAUGAG X CGAA ACCACAGG 
AGGGAAAC CUGAUGAG X CGAA AGCUUGGU 
AGCAGGGA CUGAUGAG X CGAA ACAAGCUU 
CAGCAGGG CUGAUGAG X CGAA AACAAGCU 
CCAGCAGG CUGAUGAG X CGAA AAACAAGC 
CUCCUACU C UGAUGAG X CGAA ACUUUGGC 
UCCUCUCC CUGAUGAG X CGAA ACUGACUU 
AAAUAGCA CUGAUGAG X CGAA ACUGUGCA 
CAAAUAGC CUGAUGAG X CGAA AACUGUGC 
AAAGCAAA CUGAU GAG X CGAA AGCAAACU 
CUAAAGCA CUGAUGAG X CGAA AUAGCAAA 
UCUAAAGC CUGAUGAG X CGAA AAUAGCAA 
UGUCUCUA CUGAUGAG X CGAA AGCAAA'JA 
CUGUCUCU CUGAUGAG X CGAA AAGCAAAU 
CCUGUCUC CUGAUGAG X CGAA AAAGCAAA 
CUUGUUUA CUGAUGAG X CGAA ACAGUCCC 
GGCUUGUU CUGAUG AG X CGAA AUACAGUC ~ 
ACCAAUGU CUGAUGAG X CGAA AGGCUUGU 
UUUGCACC CUGAUGAG X CGAA AUGUUAGG 
CAAGAGGC CUGAUGAG X CGAA AUCUUUGC 
UAAUUCAA CU GAUGAG X CGAA AGGCAAUC ~ 
UUUAAUUC CUGAUGAG X CGAA AGAGGCAA 
UUUUUUUU CUGAUGAG X CGAA AUUCAAGA 
UUUUUUUU 



X CGAA AAUUCAAG 
UUUUUUUC CUGAUGAG X CGAA AGUUUUUU 



Input Sequence = AF190725. Cut Site = G/ . 

Stem Length = 8 . Core Sequence = CUGAUGAG X CGAA (X = GCCGUUAGGC or other stem II) 
AF190725 (Homo sapiens beta-site APP cleaving enzyme (BACE) mRNA; 2526 bp) 
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Table 19 

Table 19: Human BACE NCH Ribozyme and Target Sequence 



CACGCGUC C GCAGCCCG 
GCGUCCGC A GCCCGCCC 
UCCGCAGC C CGCCCGGG 
CCGCAGCC C GCCCGGGA 
CAGCCCGC C CGGGAGCU 
AGCCCGCC C GGGAGCUG 
CCGGGAGC U GCGAGCCG 
CUGCGAGC C GCGAGCUG 
CCGCGAGC U G GAUUAUG 
AUGGUGGC C UGAGCAGC 
UGGUGGC C U GAGCAGCC 
GCCUGAGC A G CCAACGC 
UG AGCAGC C AACGCAGC 
GAGCAGC C A ACGCAGCC 
GCCAACGC A GCCGCAGG 
AACGCAGC C GCAGGAGC 
GCAGCCGC A GGAGCCCG 
GCAGGAGC C CGGAGCCC 
CAGGAGCC • C GGAGCCCU 
CCCGGA GC C CUUGCCCC 
CCGGAGCC C UUGCCCCU 
CGGAGCCC U UGCCCCUG 
GCCCUUGC C CCUGCCCG 
CCCUUGCC C CUGCCCGC 
CCUUGCCC C UGCCCGCG 
CUUGCC CC U GCCCGCGC 
GCCCCUGC C CGCGCCGC 
CCCCUGCC C GCGCCGCC 
GCCCGCGC C GCCGCCCG 
CGCGCCGC C GCCCGCCG 
GCCGCCGC C CGCCGGGG 
CCGCCGCC C GCCGGGGG 
CCGCCCGC C GGGGGGAC 
GGGGGGA C C AGGGAAGC 
GGGGGACC A GGGAAGCC 
AGGGAAGC C GCCACCGG 
GAAGCCGC C ACCGGCCC 
AAGCCG CC A CCGGCCCG 
GCCGCCAC C GGCCCGCC 
CCACCGGC C CGCCAUGC 
CACCGGCC C GCCAUGCC 
CGGCCCGC C AUGCCCGC 
GGCCCGCC A UGCCCGCC 
CGCCAUGC C CGCCCCUC 
GCCAUGCC C GCCCCUCC 
AUGCCCGC C CCUCCCAG 
UGCCCGCC C CUCCCAGC 



CGGGCUGC CUGAUGAG X CGAA IACGCGUG 
GGGCGGGC CUGAUGAG X CGAA ICGGACGC 
CCC GGGCG CUGAUGAG X CGAA ICUGCGGA 
UCCCGGGC CUGAUGAG X CGA A IGCUGCGG 
AGCUCCC G CUGAUGAG X CGAA ICGGGCUG 
CAGCUCCC CUGAUGAG X CGAA IGCGGGCU 
CGG CUCGC CUGAUGAG X CGAA ICUCCCGG 
CAGCUCGC CUGAUGAG X CGAA ICUCGCAG 
CAUAAUCC CUGAUGAG X CG AA ICUCGCGG~ 
GCUGCUCA CUGAUGAG X CGAA ICCACCAU~ 
GGCUGCUC CUGAUGA G X CGAA IGCCACCA~ 
GCGUUGGC CUGAUGA G X CGAA ICUCAGG"c~ 
GCU GCGUU CUGAUGAG X CGAA ICUGCUCA 
GGC UGCGU CUGAUGAG X CGAA IGCUGCUC 
CCUGCGGC CUGAUGAG X CGAA ICGUUGGC ~ 
GCUCCUGC CUGAU GAG X CGAA ICUGCGUU~ 
CGGGCUCC CUGAUGAG X C GAA ICGGCUGC ~ 
GGG CUCCG CUGAUGAG X CGAA ICUCCUGC 
AGGGCUCC CUGAUGAG X CGAA IGCUCCUG ~ 



GGGGCAAG CUGAUGAG X CGAA 
AGGGGCAA CUGAUGAG X CGAA 



IGCUCCGG 



CGGGCGGC CUGAUGAG X CGAA 1 



CAG GGGCA CUGAUGAG X CGAA IGGCUCCG 
CGGGCAGG CUGAUGAG X C GAA I CAAGGGC 
GCGGGCAG CUGAUGAG X CGAA I GCAAGGG 
CGCGGGCA CUGAUGAG X CGA A IGGCAAGG 
GCGCGGGC CUGA UGAG X CGAA IGGGCAAG 
GCGGCGCG CUGAUG AG X CGAA I CAGGGGC 
GGCGGCGC CUGAUGAG X CGA A IGCAGGGG 
i ICGCGGGC 

CGG CGGGC CUGAUGACTx CGAA~~ICGGCGCG 
CCCCGGC G CUGAUGAG X CGAA ICGGCGGC 
CCCCCGGC CUGAUGAG X CGAA IGCGGCGG 
GUCCCCCC CUGAUGA G X CGAA I CGGGCGG 
GCUUCCCU CUGAUGAG X CGAA IUCCCCCC 
GGCUUCCC CUGAUGAG X CGAA IGUCCCCC 
CCGGUGG C CUGAUGAG X CGAA ICUUCCCU 
GGGCCGGU CUGA UGAG X CGAA ICGGCUUC 
CGGGCCGG CUGAUGAG X CGAA IGCGGCUU 
GGCG GGCC CUGAUGAG X CGAA IUGGCGGC 
GCAUGGCG CUGAUGAG X CGAA ICCGGUGG 
GGCAUGGC CUGAUGAG X CGAA IGCCGGUG 



GCGGGCAU CUGAUGAG X CGAA I 



i ICGGGCCG 

GGCGGG CA CUGAUGAG X CGAA IGCGGGCC 

GAGGGGCG CUGAUGAG X CGAA ICAUGGCG~ 

GGA GGGGC CUGAUGAG X CGAA IGCAUGGC 

CUGGGAGG CUGAU GAG X CGAA ICGGGCAU~ 

GCUGGGAG CUGAUGAG X CGAA IGCGGGCA 



2072 
2073 



2095 
2096 
2097 
2098 
2099 



2105 
2106 
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GCCCGCCC C UCCCAGCC 
CCCGCCCC U C CCAGCCC 
CGCCCCUC C CAGCCCCG 
GCCCCUCG C AGCCCCGC 
CCCCUCCC A GCCCCGCC 
CUC CCAGC C CCGCCGGG 
UCCCAGCC C CGCCGGGA 
CCCAGCCC C GCCGGGAG 
AGCCCCGC C GGGAGCCC 
CCGGGAGC C CGCGCCCG 
CGGGAGC C C GCGCCCGC 
GCCCGCGC C CGCUGCCC 
CCCGCGCC C G CUGCCCA 
GCGCCCGC U GCCCAGGC 
CCCGCUGC C CAGGCUGG 
CCGCUGCC C AGGCUGGC 
CGCU GCCC A GGCUGGCC 
GCCCAGGC U GGCCGCCG 
AGGCUGGC C GCCGCCGU 
CUGGCCGC C GCCGUGCC 



Table 19 

GGCUGGGA CUGAUG AG X CGAA IGGCGGGC 
GGGCUGGG CUGAUGAG X CGAA IGGGCGGG 
CGGGGCUG CUGAUG AG X CGAA IAGGGGCG 
GCGGGGCU CUGAUGAG X CGAA IGAGGGGC 
GGCGGGGC CUGAUG AG X CGAA IGGAGGGG 
CCCGGCGG CUGAUGAG X CGAA I CUGGGAG 
UCCCGGC G CUGAUGAG X CGAA IGCUGGGA~ 
CUCCCGGC CUGAUGA G X CGAA IGGCUGGG 
GGGCUC CC CUGAUGAG X CGAA ICGGGGCtT 
CGGGCGCG CUGAUGAG X CGAA ICUCCCGG ~ 
GC GGGCGC CUGAUGAG X CGAA IGCUCCCG~ 
GGGCAGCG C UGAUGAG X CGAA ICGCGGGC~ 
UGGGCAGC CUGAUGA G X CGAA IGCGCGGG" 
GCCUGGGC CUGAUGAG X CGAA ICGGGCGc" 
CCAGCCUG CUGAUGAG X CGAA ICAGCGGG~ 
GCCAGCCU CUGAUGA G X CGAA IGCAGCGG" 
GGC CAGCC CUGAUGAG X CGAA IGGCAGCG~ 
CGGCGGCC CUGAUGAG X CGAA ICCUGGGC 
ACGGCGGC CUGAUGAG X CGAA ICCAGCCU 



GGCACGGC CUGAUGAG X CGAA ICGGCCAG~ 



GCCGCCGC C GUGCCGAU 



AUCGGCAC CUGAUGAG X CGAA ICGGCGG"c~ 



GCUACAUC CUGAUGAG X CGAA ICACGGCG 



2124 
2125 



CGCCGUGC C GAUGUAGC 



UAGCGGGC U CCGGAUCC 



GGAUCCGG CUGAUGAG X CGAA ICCCGCUA~ 



GCGGGCUC C GGAUCCCA 



UCCGGAUC C CAGCCUCU 



CCGGAUCC C AGCCUCUC 
CGGAUCCC A GCCUCUCC 
AUCCCAGC C UCUCCCCU 



UCCCAGCC U CUCCCCUG 



CCAGCCUC U CCCCUGCU 



UGGGAUCC CUGAUGAG X CGAA IAGCCCGC 



AGAGGCUG CUGAUGAG X CGAA IAUCCGGA 



GAGAGGCU CUGAUGAG X CGAA IGAUCCGG 



GGAC-AGGC CUGAUGAG X CGAA IGGAUCCG 



AGGGGAGA CUGAUGAG X CGAA ICUGGGAU 



CAGGGGAG CUGAUGAG X CGAA IGCUGGGA 



AGCAGGGG CUGAUGAG X CGAA IAGGCUGG 



262 
270 
272 



275 
279 
282 



AGCCUCUC C CCUGCUCC 



GGAGCAGG CUGAUGAG X CGAA IAGAGGCU 



G CCUCUCC C CUGCUCCC 
CCUCUCCC C UGCUCCCG 
CUCUCCCC U GCUCCCGU 



GGGAGCAG CUGAUGAG X CGAA IGAGAGGC 



CGGGAGCA CUGAUGAG X CGAA IGGAGAGG 



ACGGGAGC CUGAUGAG X CGAA I GGGAGAG 



UCCCCUGC U CCCGUGCU 



AGCACGGG CUGAUGAG X CGAA ICAGGGGA 



2139 
2140 



CCCUGCUC C CGUGCUCU 
CCUGCUCC C GUGCUCUG 



AGAGCACG CUGAUGAG X CGAA IAGCAGGG 



CAGAGCAC CUGAUGAG 



: CGAA IGAGCAGG 



UCCCGUGC U CUGCGGAU 



AUCCGCAG CUGAUGAG X' CGAA ICACGGGA 



CCGUGCUC U GCGGAUCU 
UGCGGAUC U CCCCUGAC 
CGGAUCU C C CCUGACCG 
GGAUCUCC C CUGACCGC 
GAUCUCCC C UGACCGCU 
AUCUCCCC U GACCGCUC 
CCCCUGAC C GCUCUCCA 
CUGACCGC U CUCCACAG 
GACCGCUC U CCACAGCC 
CCGCUCUC C ACAGCCCG 
CGCUCUCC A CAGCCCGG 
CUCUCCAC A GCCCGGAC 
UCCACAGC C CGGACCCG 



AGAUCCGC CUGAUGAG X CGAA IAGCACGG 



GUCAGGGG CUGAUGAG > 



X CGAA 

CGGUCAGG CU GAUGAG X CGAA IAGAUCCG 
GC GGUCAG CUGAUGAG X CGAA IGAGAUCC 
AGCGGU CA CUGAUGAG X CGAA IGGAGAUC 
GAGCGGUC CUGAUGAG X CGAA IGGGAGAU 
UGGAGAGC CUGAUGAG X C GAA IUCAGGGG 
CUGUGGAG CUGAUGAG X CGAA ICGGUCAG~ 
GGCUGU GG CUGAUGAG X CGAA IAGCGGUC 
CGGGCUGU CUGAUGA G X CGAA IAGAGCGG 
CCGG GCUG CUGAUGAG X CGAA IGAGAGCG 
GUC CGGGC CUGAUGAG X CGAA IUGGAGAG 
CGGGUCCG CUGAUGAG X CGAA ICUGUGGA 



2149 
2150 
2151 
2152 
2153 



WO 01/16312 



PCT/US00/23998 



Table 19 



CCACAGCC C GGACCCGG 



GCCCGGAC C CGGGGGCU 



CCCGGACC C GGGGGCUG~ 



CCGGGGGC U GGCCCAGG 



GGGCUGGC C CAGGGCCC 



GGCUGGCC C AGGGCCCU~ 



GCUGGCCC A GGGCCCUG 



CCCAGGGC C CUGCAGGC 



CCAGGGCC C UGCAGGCC 



CAGGGCCC U GCAGGCCC 



GGCCCUGC A GGCCCUGG 



CUGCAGGC C CUGGCGUC" 



UGCAGGCC C UGGCGUCC~ 



GCAGGCCC U GGCGUCCU 



CUGGCGUC C UGAUGCCC 



UGGCGUCC 5 GAUGCCCC 



CCGGGUCC CUGAUGAG X CGAA IGCUGUGG 



AGCCCCCG CUGAUGAG X CGAA IUCCGGGC 



CAGCCCCC CUGAUGAG X CGAA IGUCCGGG 



CCUGGGCC CUGAUGAG X CGAA ICCCCCGG 



CCUGAUGC C CCCAAGCU 



CUGAUGCC C CCAAGCUC 



UGAUGCCC C CAAGCUCC 



GAUGCCCC C AAGCUCCC 



AUGCCCCC A AGCUCCCU~ 



CCCCAAGC U CCCUCUCC~ 



CCAAGCUC C CUCUCCUG 



CAAGCUCC C UCUCCUGA 



AAGCUCCC U CUCCUGACT 



GCUCCCUC U CCUGAGAA 



UCCCUCUC C UGAGAAGC 



CCCUCUCC U GAGAAGCC 



UGAGAAGC C ACCAGCAC 



GAGAAGCC A CCAGCACC 



GAAGCCAC C AGCACCAC 



AAGCCACC A GCACCACc" 



CCACCAGC A CCACCCAG 



ACCAGCAC C ACCCAGAC 



GGGCCCUG CUGAUGAG X CGAA ICCAGCCC 



AGGGCCCU CUGAUGAG X CGAA IGCCAGCC" 



CAGGGCCC CUGAUGAG X CGAA IGGCCAGC 



GCCUGCAG CUGAUGAG X CGAA ICCCUGGtT 



GGCCUGCA CUGAUGAG X CGAA IGCCCUGG 



GGGCCUGC- CUGAUGAG X CGAA IGGCCCUG~ 



CCAGGGCC CUGAUGAG X CGAA ICAGGGCC~ 



GACGCCAG CUGAUGAG X CGAA ICCUGCAG~ 



GGACGCCA CUGAUGAG X CGAA IGCCUGCA 



AGGACGCC CUGAUGAG X CGAA IGGCCUGC~ 



GGGCAUCA CUGAUGAG X CGAA IACGCCAG 



GGGGCAUC CUGAUGAG X CGAA IGACGCCA 



AGCUUGGG CUGAUGAG X CGAA ICAUCAGG~ 



GAGCUUGG CUGAUGAG X CGAA IGCAUCAG 



GGAGCUUG CUGAUGAG X CGAA IGGCAUCA 



GGGAGCUU CUGAUGAG X CGAA IGGGCAUC 



AGGGAGCU CUGAUGAG X CGAA IGGGGCAU~ 



GGAGAGGG CUGAUGAG X CGAA ICUUGGGG 



CAGGAGAG CUGAUGAG X CGAA IAGCUUGG 



CCAGCACC A CCCAGACU 



AGCACCAC C CAGACUUG 



GCACCACC C AGACUUGG~ 



CACCACCC A GACUUGGG 



ACCCAGAC U UGGGGGCA 



UUGGGGGC A GGCGCCAG 



GCAGGCGC C AGGGACGG 



CAGGCGCC A GGGACGGA 



ACGUGGGC C AGUGCGAG 



CGUGGGCC A GUGCGAGC 



GUGCGAGC C CAGAGGGC 



UGCGAGCC C AGAGGGCC 



GCGAGCCC A GAGGGCCC 



CAGAGGGC C CGAAGGCC 



AGAGGGCC C GAAGGCCG 



CCGAAGGC C GGGGCCCA 



GCCGGGGC C CACCAUGG 



UCAGGAGA CUGAUGAG X CGAA IGAGCUUG 



CUCAGGAG CUGAUGAG X CGAA IGGAGCUtT 



UUCUCAGG CUGAUGAG X CGAA IAGGGAGC 



GCUUCUCA CUGAUGAG X CGAA IAGAGGGA 



GGCUUCUC CUGAUGAG X CGAA IGAGAGGG 



GUGCUGGU CUGAUGAG X CGAA ICUUCUCA 



GGUGCUGG CUGAUGAG X CGAA IGCUUCUC 



GUGGUGCU CUGAUGAG X CGAA IUGGCUUC 



GGUGGUGC CUGAUGAG X CGAA IGUGGCUU 



CUGGGUGG CUGAUGAG X CGAA ICUGGUGG 



GUCUGGGU CUGAUGAG X CGAA IUGCUGGU 



AGUCUGGG CUGAUGAG X CGAA IGUGCUGG 
CAAGUCUG CUGAUGAG X CGAA IUGGUGCU 



CCAAGUCU CUGAUGAG X CGAA IGUGGUGC 



CCCAAGUC CUGAUGAG X CGAA IGGUGGUG 



UGCCCCCA CUGAUGAG X CGAA IUCUGGGtT" 



CUGGCGCC CUGAUGAG X CGAA ICCCCCAA 



CCGUCCCU CUGAUGAG X CGAA ICGCCUGC~ 



UCCGUCCC CUGAUGAG X CGAA IGCGCCUG 



CUCGCACU CUGAUGAG X CGAA ICCCACGU 



GCUCGCAC CUGAUGAG X CGAA IGCCCACG 



GCCCUCUG CUGAUGAG X CGAA ICUCGCAC 



GGCCCUCU CUGAUGAG X CGAA IGCUCGCA 



GGGCCCUC CUGAUGAG X CGAA IGGCUCGC~ 



GGCCUUCG CUGAUGAG X CGAA ICCCUCUG 



CGGCCUUC CUGAUGAG X CGAA IGCCCUCU~ 



UGGGCCCC CUGAUGAG X CGAA ICCUUCGG 



CCAUGGUG CUGAUGAG X CGAA ICCCCGGC' 
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CCGGGGCC C ACCAUGGC 
CGGG GCCC A CCAUGGCC 
GGGCCCAC C A UGGCCCA 
GGCCCACC A U GGCCCAA 
ACCAUGGC C C AAGCCCU 
CCAUGGCC C AAGCCCUG 



539 
54 0. 



GCCAUGGU CUGAUGA G X CGAA IGCCCCGG 
GGCCAUGG CU GAUGAG X CGAA IGGCCCCG~ 
UGGGCCAU CUGAUGA G X CGAA IUGGGCCC~ 
UUGGGC CA CUGAUGAG X CGAA IGUGGGCC 
AGGGCUUG CUGAUGAG X CGAA ICCAUGGU~ 



CAGGGCUU CUGAUGAG X CGAA IGCCAUGG 



CAUGGCCC A AGCCCUGC 
GCCCAAGC C CUGCCCUG 



GCAGGGCU CUGAUGAG X CGAA IGGCCAUG 



CAGGGCAG CUGAUGAG X CGAA ICUUGGGC 



CCCA AGCC C UGCCCUGG 
CCAAGCCC U GCCCUGGC 



CCAGGGCA CUGAUGAG X CGAA IGCUUGGG 



GCCAGGGC CUGAUGAG X CGAA IGGCUUGG 



AGCCCUGC C CUGGCUCC 
GCCCUGCC C UGGCUCCU 



GGAGCCAG CUGAUGAG X CGAA ICAGGGCU 



AGGAGCCA CUGAUGAG X CGAA IGCAGGGC 



CCCUGCCC U GGCUCCUG 
GCCCUGGC U CCUGCUGU 
CCUGGCUC C UGCUGUGG 



CAGGAGCC CUGAUGAG X CGAA I GGCAGGG 



ACAGCAGG CUGAUGAG X CGAA ICCAGGGC 



CCACAGCA CUGAUGAG X CGAA IAGCCAGG 



CUGGCUCC U C 
GCUCCUGC U C 



UCCACAGC CUGAUGAG X CGAA IGAGCCAG 



CCAUCCAC CUGAUGAG X CGAA I CAGGAGC 



GGGAGUGC U GCCUGCCC 
AGUGCUGC C UGCCCACG 



GGGCAGGC CUGAUGAG X CGAA ICACUCCC 



CGUGGGCA CUGAUGAG X CGAA ICAGCACU 



GUGCUGCC U GCCCACGG 
CUGCCUGC C CACGGCAC 



CCGUGGGC CUGAUGAG X CGAA IGCAGCAC 



GUGCCGUG CUGAUGAG X CGAA I CAGGCAG 



UGCCUGCC C ACGGCACC 
GCCUGCCC A CGGCACCC 
CCCACGGC A CCCAGCAC 
CACGGCAC C CAGCACGG 



GGUGCCGU CUGAUGAG X CGAA IGCAGGCA 



GGGUGCCG CUGAUGAG X CGAA IGGCAGGC 



ACGGCA CC C AGCACGGC 
CGGCACCC A GCACGGCA 



CACCCAGC A CGGCAUCC 
AGCACGGC A UCCGGCUG 



ACGGCAUC C GGCUGCCC 



CAUCCGG C U GCCCCUGC 
CCGGCUGC C CCUGCGCA 



CGGCUGCC C CUGCGCAG 



GGCUGCCC C UGCGCAGC 
GCUGCCCC U GCGCAGCG 



CCCUGCGC A GCGGCCUG 



GCAGCGGC C UGGGGGGC 



CAGCGGCC U GGGGGGCG 



GGGGGCGC C CCCCUGGG 
GGGGCGCC C CCCUGGGG 



GGGCGCCC C CCUGGGGC 
GGCGCCCC C CUGGGGCU 



GCGCCCCC C UGGGGCUG 



CGCCCCCC U GGGGCUGC 
CCUGGGGC U GCGGCUGC 
GCUGCGGC U GCCCCGGG 
GCGGCUGC C CCGGGAGA 
CGGCUGCC C CGGGAGAC 
GGCUGCCC C GGGAGACC 
CGGGA GAC C GACGAAGA 
CGAAGAGC C CGAGGAGC 



GUGCUGGG CUGAUGAG X CGAA ICCGUGGG 



CCGUGCUG CUGAUGAG X CGAA IUGCCGUG 



GCCGUGCU CUGAUGAG X CGAA IGUGCCGU 



UGCCGUGC CUGAUGAG X CGAA IGGUGCCG 



GGAUGCCG CUGAUGAG X CGAA ICUGGGUG 



CAGCCGGA CUGAUGAG X CGAA ICCGUGCU 



GGGCAGCC CUGAUGAG X CGAA IAUGCCGU 



GCAGGGGC CUGAUGAG X CGAA ICCGGAUG 



UGCGCAGG CUGAUGAG X CGAA ICAGCCGG 



CUGCGCAG CUGAUGAG X CGAA IGCAGCCG 



GCUGCGCA CUGAUGAG X CGAA IGGCAGCC 



CGCUGCGC CUGAUGAG X CGAA IGGGCAGC 



CAGGCCGC CUGAUGAG X CGAA ICGCAGGG 



GCCCCCCA CUGAUGAG X CGAA ICCGCUGC 



CGCCCCCC CUGAUGAG X CGAA IGCCGCUG 



CCCAGGGG CUGAUGAG X CGAA ICGCCCCC 



CCCCAGGG CUGAUG AG X CGAA 
X CGAA 



GCCCCAGG CUGAUGAG 5 



IGCGCCCC 
IGGCGCCC 



AGCCCCAG CUGAUGAG X CGAA IGGGCGCC 



CAGCCCCA CUGAUGAG X CGAA IGGGGCGC " 



GCAGCCCC CUGAUGAG X CGAA IGGGGGCG 



GCAGCCGC CUGAUGAG X CGAA ICCCCAGG 



CCCGGGG C CUGAUGAG X CGAA ICCGCAGC 
UCUCCCGG CUGAUG AG X CGAA ICAGCCGC 
GUCUC CCG CUGAUGAG X CGAA IGCAGCCG 
GGUCUC CC CUGAUGAG X CGAA IGGCAGCC 
UCUUCGUC CUGAUGAG X CGAA IUCUCCCG 
GCUCCUCG CUGAUGAG X CGAA ICUCUUCG~ 
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GAAGAGCC C GAGGAGCC 
CGAGG AGC C CGGCCGGA 
GAGGAGCC C GGCCGGAG 



GGCUCCUC CUGAU GAG X CGAA IGCUCUUC 
UCCGGCCG CU GAUGAG X CGAA ICUCCUCG~ 
CUCCGGCC CUGAUGAG X CGAA IGCUCCUC 



AGCCCGGC C GGAGGGGC 
GGAGGGGC A GCUUUGUG 



GCCCCUCC CUGAUGAG X CGAA ICCGGGCU~ 



CACAAAGC CUGAUGAG X CGAA ICCCCUCC 



GGGGCAGC U UUGUGGAG 
UGGUGGAC A ACCUGAGG 



UGGACAAC C UGAGGGGC 



GGACAACC U GAGGGGCA 



UGAGGGGC A AGUCGGGG 



GUCGGGGC A GGGCUACU 



GGCAGGGC U ACUACGUG 



AGGGCUAC U ACGUGGAG 



GAGAUGAC C GUGGGCAG 
CCGUGGGC A GCCCCCCG 
UGGGCAGC C CCCCGCAG 
GGGCAGCC C CCCGCAGA 
GGCAGCCC C CCGCAGAC 



GCAGCCCC C CGCAGACG 



CAGCCCCC C GCAGACGC 
CCCCCCGC A GACGCUCA 
GCAGACG C U CAACAUCC 
AGACGCUC A ACAUCCUG 



CGCUCAAC A UCCUGGUG 
UCAACAUC C UGGUGGAU 



CAACAUCC U GGUGGAUA 



GUGGAUAC A GGCAGCAG 



AUACAGGC A GCAGUAAC 



CAGGCAG C A GUAACUUU 
GCAGUAAC U UUGCAGUG 
AACUUUG C A GUGGGUGC 
GUGGGUGC U GCCCCCCA 



CUCCACAA CUGAUGAG X CGAA ICUGCCCC 



CCUCAGGU CUGAUGAG X CGAA IUCCACCA 



GCCCCUCA CUGAUGAG X CGAA IUUGUCCA 



UGCCCCUC C UGAUGAG X CGAA IGUUGUCC 
CCCCGACU CUGAUGAG X CGAA ICCCCUCA~ 



AGUAGCCC CUGAUGAG X CGAA ICCCCGAC 



CACGUAGU CUGAUGAG X CGAA ICCCUGCC 



CUCCACGU CUGAUGAG X CGAA IUAGCCCU 



CUGCCCAC CUGAUGAG X CGAA IUCAUCUC 



CGGGGGGC CUGAUGAG X CGAA ICCCACGG 



CUGCGGGG CUGAUGAG X CGAA ICUGCCCA 



UCUGCGGG CUGAUGAG X CGAA IGCUGCCC 



GUCUGCGG CUGAUGAG X CGAA IGGCUGCC 



CGUCUGCG CUGAUGAG X CGAA IGGGCUGC 



GCGUCUGC CUGAUGAG X CGAA IGGGGCUG 



UGAGCGUC CUGAUGAG X CGAA I CGGGGGG 



GGAUGUUG CUGAUGAG X CGAA ICGUCUGC 



CAGGAUGU CUGAUGAG X CGAA IAGCGUCU 



CACCAGGA CUGAUGAG X CGAA IUUGAGCG 



AUCCACCA CUGAUGAG X CGAA IAUGUUGA 



UAUCCACC CUGAUGAG X CGAA IGAUGUUG 



CUGCUGCC CUGAUGAG ) 



GUUACUGC CUGAUGAG 5 



GGUGCUGC C CCCCACCC 
GUGCUGCC C CCCACCCC 



UGCUGCCC C CCACCCCU 



GCUGCCCC C CACCCCUU 
CUGCCCCC C ACCCCUUC 
UGCCCCCC A CCCCUUCC 
CCCCCCAC C CCUUCCUG 
CCCCCACC C CUUCCUGC 



CCCCACCC C UUCCUGCA 



CCCACCCC U UCCUGCAU 



ACCCCUUC C UGCAUCGC 



CCCCUUCC U GCAUCGCU 



CUUCCUGC A UCGCUACU 



UGCAUCGC U ACUACCAG 



AUCGCUAC U ACCAGAGG 



GCUACUAC C AGAGGCAG 
CUACUACC A GAGGCAGC 



CCAGAGGC A GCUGUCCA 



GAGGCAGC U GUCCAGCA 



AAAGUUAC CUGAUGAG X CGAA ICUGCCUG 



CACUGCAA CUGAUGAG X CGAA IUUACUGC 



GCACCCAC CUGAUGAG X CGAA ICAAAGUU 



UGGGGGGC CUGAUGAG X CGAA ICACCCAC 



GGGUGGGG CUG ICAGCACC 



GGGGUGGG CUGAUGAG X CGAA 



AGGGGUGG CUGAUGAG X CGAA 



IGCAGCAC 
IGGCAGCA 



AAGGGGUG CUGAUGAG X CGAA IGGGCAGC 



GAAGGGGU CUGAUGAG X CGAA IGGGGCAG 



GGAAGGGG CUGAUGAG X CGAA IGGGGGCA 



CAGGAAGG CUGAUGAG X CGAA IUGGGGGG 



GCAGGAAG CUGAUGAG X CGAA IGUGGGGG 



UGCAGGAA CUGAUGAG X CGAA IGGUGGGG 



AUGCAGGA CUGAUGAG X CGAA IGGGUGGG 



GCGAUGCA CUGAUGAG X CGAA IAAGGGGU 



AGCGAUGC CUGAUGAG X CGAA IGAAGGGG 



AGUAGCGA CUGAUGAG X CGAA ICAGGAAG 



CUGGUAGU CUGAUGAG X CGAA ICGAUGCA 



CCUCUGGU CUGAUGAG X CGAA IUAGCGAU 



CUGCCUCU CUGAUGAG X CGAA IUAGUAGC 



GCUGCCUC CUGAUGAG X CGAA IGUAGUAG 



UGGACAGC CUGAUGAG X CGAA ICCUCUGG 



UGCUGGAC CUGAUGAG X CGAA ICUGCCUC 
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CAGCUGUC C AGCACAUA 



AGCUGUCC A GCACAUAC 



U GUCCAGC A CAUACCGG 
UCCAGCAC A UACCGGGA 



GCACAUAC C GGGACCUC 



ACCGGGAC C UCCGGAAG 



CCGGGACC U CCGGAAGG 



GGGACCUC C GGAAGGGU" 



GUAUGUGC C CUACACCC~ 



UAUGUGCC C UACACCCA 



AUGUGCCC U ACACCCAG 



UGCCCUAC A CCCAGGGC 



CCCUACAC C CAGGGCAA 



ccuacacc c agggcaagT 



CUACACCC A GGGCAAGU 



CCCAGGGC A AGUGGGAA 



AGGGGAGC U GGGCACCG~ 



AGCUGGGC A CCGACCUG 



CUGGGCAC C GACCUGGtT 



GCACCGAC C UGGUAAGC 



CACCGACC U GGUAAGCA~ 



UAUGUGCU CUGAUGAG X CGAA IACAGCUG 



GUAUGUGC CUGAUGAG X CGAA IGACAGCU 



CCGGUAUG CUGAUGAG X CGAA ICUGGACA~ 



'UCCCGGUA CUGAUGAG X CGAA IUGCUGGA~ 



GAGGUCCC CUGAUGAG X CGAA IUAUGUGC~ 



" CUUCCGGA CUGAUGAG X CGAA IUCCCGGU 



CCUUCCGG CUGAUGAG X CGAA IGUCCCGG 



ACCCUUCC CUGAUGAG X CGAA IAGGUCCC 



GGGUGUAG CUGAUGAG X CGAA ICACAUAC 



UGGGUGUA CUGAUGAG X CGAA IGCACAUA 



" CUGGGUGU CUGAUGAG X CGAA IGGCACAU 



GCCCUGGG CUGAUGAG X CGAA IUAGGGCA~ 



UUGCCCUG CUGAUGAG X CGAA IUGUAGGG 



"CUUGCCCU CUGAUGAG X CGAA IGUGUAGG 



ACUUGCCC CUGAUGAG X CGAA IGGUGUAG 



UUCCCACU CUGAUGAG X CGAA ICCCUGGG 



UGGUAAGC A UCCCCCAU 



UAAGCAUC C CCCAUGGC 



AAGCAUCC C CCAUGGCC~~ 



AGCAUCCC C CAUGGCCC" 



GCAUCCCC C AUGGCCCC 



CAUCCCCC A UGGCCCCA 



CCCAUGGC C CCAACGUC 



CCAUGGCC C CAACGUCA" 



~CAUGGCCC C AACGUCAC 



AUGGCCCC A ACGUCACU 



CCAACGUC A CUGUGCGU" 



AACGUCAC U GUGCGUGC~ 



GUGCGUGC C AACAUUGC 



UGCGUGCC A ACAUUGCU 



GUGCCAAC A UUGCUGCC 



AACAUUGC U GCCAUCAC" 



AUUGCUGC C AUCACUGA 



UUGCUGCC A UCACUGAA 



CUGCCAUC A CUGAAUCA 



GCCAUCAC U GAAUCAGA 



ACUGAAUC A GACAAGUU 



AAUCAGAC A AGUUCUUC" 



ACAAGUUC U UCAUCAAC 



AGUUCUUC A UCAACGGC 



UCUUCAUC A ACGGCUCC 



CGGUGCCC CUGAUGAG X CGAA ICUCCCCU 



CAGGUCGG CUGAUGAG X CGAA ICCCAGCU 
ACCAGGUC CUGAUGAG X CGAA IUGCCCAG 



GCUUACCA CUGAUGAG X CGAA IUCGGUGC 



UGCUUACC CUGAUGAG X CGAA IGUCGGUG 



AUGGGGGA CUGAUGAG X CGAA ICUUACCA 



GCCAUGGG CUGAUGAG X CGAA IAUGCUUA 



GGCCAUGG CUGAUGAG X CGAA IGAUGCUU 



GGGCCAUG CUGAUGAG X CGAA IGGAUGCU~ 



GGGGCCAU CUGAUGAG X CGAA IGGGAUGC 



UGGGGCCA CUGAUGAG X CGAA IGGGGAUG 



GACGUUGG CUGAUGAG X CGAA ICCAUGGG 



UGACGUUG CUGAUGAG X CGAA IGCCAUGG 



GUGACGUU CUGAUGAG X CGAA IGGCCAUG~ 



"AGUGACGU CUGAUGAG X CGAA IGGGCCAU 



1 ACGCACAG CUGAUGAG X CGAA IACGUUGG 



GCACGCAC CUGAUGAG X CGAA IUGACGUU 



GCAAUGUU CUGAUGAG X CGAA ICACGCAC 



UCAACGGC U CCAACUGG 



AACGGCUC C AACUGGGA 



ACGGCUCC A ACUGGGAA~ 



GCUCCAAC U GGGAAGGcT 



GGGAAGGC A UCCUGGGg" 



AGCAAUGU CUGAUGAG X CGAA IGCACGCA 



GGCAGCAA CUGAUGAG X CGAA IUUGGCAC 



GUGAUGGC CUGAUGAG X CGAA ICAAUGUU~ 



UCAGUGAU CUGAUGAG X CGAA ICAGCAAU 



~UUCAGUGA' CUGAUGAG X CGAA IGCAGCAA 



"UGAUUCAG CUGAUGAG X CGAA IAUGGCAG 



~UCUGAUUC CUGAUGAG X CGAA IUGAUGGC 



AACUUGUC CUGAUGAG X CGAA IAUUCAGU 



GAAGAACU CUGAUGAG X CGAA IUCUGAUU 



GUUGAUGA CUGAUGAG X CGAA IAACUUGU 



GCCGUUGA CUGAUGAG X CGAA IAAGAACU 



GGAGCCGU CUGAUGAG X CGAA IAUGAAGA 



CCAGUUGG CUGAUGAG X CGAA ICCGUUGA 



UCCCAGUU CUGAUGAG X CGAA IAGCCGUU~ 



UUCCCAGU CUGAUGAG X CGAA IGAGCCGU 



"GCCUUCCC CUGAUGAG X CGAA IUUGGAGC 



CCCCAGGA CUGAUGAG X CGAA ICCUUCCC 



2351 
2352 
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991 


AAGGCAUC C UGGGGCUG 


. 587 


_ CAGCCCCA CUGAUGAG X IGAUgccu 




992 


AGGCAUCC U GGGGCUGG 


588 


CCAGCCCC GAUGAG x CGAA ICCCCAG G 


23g3 


998 


CCUGGGGC U GGCCUAUG 


589 


CAUAGGCC CUGAU^^ _ lcchGCCC 


~2364 


1002 


GGGCUGGC C UAUGCUGA 


590 


UCAGCAUA CUGAU 


2365 


1003 


GGCUGGCC U AUGCUGAG 


591 


CUCAGCAU CUGAUGAG X CGAA IGCCAGCX 


— 2366 


1008 


GCCUAUGC U GAGAUUGC 


592 


GCAAUCUC CUGAUGAG X CGAA ICAUAGGC 


2367 


1017 


GAGAUUGC C AGGCCUGA 


593 


UCAGGCCU CUGAUGAG X 





1018 


AGAUUGCC A GGCCUGAC 


594 


GUCAGGCC CUGAUGAG X CGAA IGCAAULU 




1022 


UGCCAGGC C UGACGACU 


595 


AGUCGUCA CUGAUGAG X CGAA "ICCUGGCJA 


237Q 


1023 


GCCAGGCC U GACGACUC 


596 


GAGUCGUC CUGAUGAG X CGAA IGCCUGG ' 


~ 2371 


1030 


CUGACGAC U CCCUGGAG 


597 


CUCCAGGG CUGAUGAG X CGAA IUCGUUA.tj 


__ 2372 


1032 


GACGACUC C CUGGAGCC 


598 


GGCUCCAG CUGAUGAG X CGAA IAGUCGUC 


~ 2373 


1033 


ACGACUCC C UGGAGCCU 


599 


AGGCUCCA CUGAUGAG X CGAA IGAGUCGU 


■ ■ 23?4 


1034 


CGACUCCC U GGAGCCUU 


600 


AAGGCUCC CUGAUGAG X CGAA IGGAGUCG 


— 2375 


1040 


CCUGGAGC C UUUCUUUG 


601 


CAAAGAAA CUGAUGAG X CGAA ICUCCAGG 


2376 


1041 


CUGGAGCC U UUCUUUGA 


602 


UCAAAGAA CUGAUGAG X CGAA IGCUCCAU 


2377 




AGCCUUUC U UUGACUCU 


6C3 


AGAGUCAA CUGAUGAG X CGAA IAAAGGCU 


2378 


1051 


UCUUUGAC U CUCUGGUA 


604 


UACCAGAG CUGAUGAG X CGAA IUCAAAGA 


2379 1 


1053 


UUUGACUC U CUGGUAAA 


605 


UUUACCAG CUGAUGAG X CGAA IAGUCAAA 




1055 


UGACUCUC U GGUAAAGC 


606 


GCUUUACC CUGAUGAG X CGAA IAGAGUCA 


2T8l 


1064 


GGUAAAGC A GACCCACG 


607 


CGUGGGUC CUGAUGAG X CGAA ICUUUACC 


2382 


1068 


. AAGCAGAC C CACGUUCC 


608 


GGAACGUG CUGAUGAG X CGAA IUCUGCUU 


2383 


1069 


AGCAGACC C ACGUUCCC 


609 


GGGAACGU CUGAUGAG X CGAA IGUCUGCU 


2384 


1070 


GCAGACCC A CGUUCCCA 


610 


UGGGAACG CUGAUGAG X CGAA IGGUCUUC 
AGAGGUUG CUGAUGAG X CGAA IAACGUGG 


2285 

— 2386 


1076 
1077 


CCACGUUC C CAACCUCU 
CACGUUCC C AACCUCUU 


611 
612 


AAGAGGUU CUGAUGAG X CGAA IGAACGUG 


2387 


1078 


ACGUUCCC A ACCUCUUC 


613 


GAAGAGGU CUGAUGAG X CGAA IGGAACGU 


2388 


1081 


UUCCCAAC C UCUUCUCC 


614 


GGAGAAGA CUGAUGAG X CGAA IUUGGGAA 


2381 


1082 


UCCCAACC U CUUCUCCC 


615 


GGGAGAAG CUGAUGAG X CGAA IGUUGGGA 


2290 

2391 


1084 


CCAACCUC U UCUCCCUG 


616 


CAGGGAGA CUGAUGAG X CGAA IAGGUUUU 




1087 


ACCUCUUC U CCCUGCAG 


617 


CUGCAGGG CUGAUGAG X CGAA IAAGAGGU 


2292 


1089 


CUCUUCUC C CUGCAGCU 


618 


AGCUGCAG CUGAUGAG X CGAA IAGAAGAG 


23~93 


1090 


UCUUCUCC C UGCAGCUU 


619 


AAGCUGCA CUGAUGAG X CGAA IGAGAAGA 


2394 


1091 


CUUCUCCC U GCAGCUUU 


620 


AAAGCUGC CUGAUGAG X CGAA l ^ Q ^ " — 


2395 


1094 


CUCCCUGC A GCUUUGUG 


621 


CACAAAGC CUGAUGAG X CGAA ICJAGGUAli 


2396 


1097 


CCUGCAGC U UUGUGGUG 


622 


CACCACAA CUGAUGAG X CGAA ICUGCAGG 


2397 


1107 


UGUGGUGC U GGCUUCCC 


623 


GGGAAGCC CUGAUGAG X CGAA 




1111 


GUGCUGGC U UCCCCCUC 


624 


GAGGGGGA CUGAUGAG X CGAA ICCAGCAC 


— ilii — 


1114 


CUGGCUUC C CCCUCAAC 


525 


GUUGAGGG CUGAUGAG X CGAA IAAGCCAG 


2400 


1115 


UGGCUUCC C CCUCAACC 


626 


GGUUGAGG CUGAUGAG X CGAA IGAAGCtJA 


2401 


1116 


GGCUUCCC C CUCAACCA 


627 


UGGUUGAG CUGAUGAG X ________ 


2402 


1117 


GCUUCCCC C UCAACCAG 


628 


^q^q CUGAUGAG X CGAA IGGGGAAG 


2403 


1118 


CUUCCCCC U CAACCAGU 


629 




2404 
24 0 5 




UCCCCCUC A ACCAGUCU 


630 


AGACUGGU CUGAUGAG X CGAA IAGGGGGA 


1123 


CCCUCAAC C AGUCUGAA 


631 


UUCAGACU CUGAUGAG X CGAA IUUGAGGG 


2406 


1124 


CCUCAACC A GUCUGAAG 


632 


CUUCAGAC CUGAUGAG X CGAA IGUUGAGG 


2407 


1128 


AACCAGUC U GAAGUGCU 


633 


AGCACUUC CUGAUGAG X CGAA IACUGGUU 


2408 


1136 


UGAAGUGC U GGCCUCUG 


634 


CAGAGGCC CUGAUGAG X CGAA ICACUUCA 


2409 


1140 


GUGCUGGC C UCUGUCGG 


635 


CCGACAGA CUGAUGAG X CGAA ICCAGCAC 


2410 


1141 


UGCUGGCC U CUGUCGGA 


636 


UCCGACAG CUGAUGAG X CGAA IGCCAGCA 


2411 • 


1143 


CUGGCCUC U GUCGGAGG 


637 


CCUCCGAC CUGAUGAG X CGAA IAGGCCAG 


2412 
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WO 01/16312 



PCT/US00/23998 



1459 
1461 
1462 



1477 
1478 
1479 



1490 
1492 
1495 
1496 
1512 
1513 
1516 



1522 
1525 
1528 



1541 
1544 
1547 
1552 



1575 
1576 



GAAGUUCC C UGAUGGUU 
AAGUUCCC U GAUGGUUU 
AUGGUUUC U GGCUAGGA 
UUUCUGGC U AGGAGAGC 
AGGAGAGC A GCUGGUGU 
AGAGCAGC U GGUGUGCU 
UGGUGUGC U GGCAAGCA 
GUGCUGGC A AGCAGGCA 
UGGCAAGC A GGCACCAC 
AAGCAGGC A CCACCCCU 
GCAGGCAC C ACCCCUUG 
CAGGCACC A CCCCUUGG 
GGCACCAC C CCUUGGAA 
GCACCACC C CUUGGAAC 
CACCACCC C UUGGAACA 
ACCACCCC U UGGAACAU 
CUUGGAAC A UUUUCCCA 
ACAUUUUC C CAGUCAUC 
CAUUUUCC C AGUCAUCU 
AUUUUCCC A GUCAUCUC 
UCCCAGUC A UCUCACUC 
CAGUCAUC U CACUCUAC 
GUCAUCUC A CUCUACCU 
CAUCUCAC U CUACCUAA 
UCUCACUC U ACCUAAUG 
CACUCUAC C UAAUGGGU 
ACUCUACC U AAUGGGUG 
GAGGUUAC C AACCAGUC 
AGGUUACC A ACCAGUCC 
UUACCAAC C AGUCCUUC 
UACCAACC A GUCCUUCC 
AACCAGUC C UUCCGCAU 
ACCAGUCC U UCCGCAUC 
AGUCCUUC C GCAUCACC 
CCUUCCGC A UCACCAUC 
UCCGCAUC A CCAUCCUU 
CGCAUCAC C AUCCUUCC 
GCAUCACC A UCCUUCCG 
UCACCAUC C UUCCGCAG 
CACCAUCC U UCCGCAGC 
CAUCCUUC C GCAGCAAU 
CCUUC CGC A GCAAUACC 
UCCGC AGC A AUACCUGC 
AGCAAUAC C UGCGGCCA 
GCAAUACC U GCGGCCAG 
CCUGCGGC C AGUGGAAG 
CUGCGGCC A GUGGAAGA 
GAUGUGGC C ACGUCCCA 
AUGUGGCC A CGUCCCAA 
GCCACGUC C CAAGACGA 
CCACGUCC C AAGACGAC 



AACCAUCA CUGAUGAG X CGAA IGAACUUC 
AAACCAUC CUGAUGAG X CGAA IGGAACUU 
UCC UAGCC CUGAUGAG X CGAA IAAACCAU 
GCUCUCCU CUGAUGAG X CG AA ICCAGAAA 
ACA CCAGC CUGAUGAG X CGAA ICUCUCCU 
AGCACACC CUGAUGAG X CGAA ICUGCUCU 
UGC UUGCC CUGAUGAG X CGAA ICACACCA 
UGCCUGCU CUGAUG AG X CGAA ICCAGCAC 
GUGGUGCC CUGAUGAG X CGAA ICUUGCCA 
AGGGGUGG CUGA UGAG X CGAA ICCUGCUU 
CAAGGGGU CUGAUGAG X CGAA IUGCCUGC 
CCAAGGG G CUGAUGAG X CGAA IGUGCCUG 
UUCC AAGG CUGAUGAG X CGAA IUGGUGCC 
GUUCCAAG CUGAUG AG X CGAA IGUGGUGC 
UGUUCCAA CUGAUGAG X CGAA I GGUGGUG 
K CGAA IGGGUGGU 
UGGGAAAA CUGAUGAG X CGAA IUUCCAAG 
GAUGACUG CUGAUGAG X CGAA IAAAAUGU 
AGAUGACU CUG AUGAG X CGAA IGAAAAUG 
GAGAUGAC C UGAUGAG X CGAA IGGAAAAU 
GAGUGAGA CUGAUGAG X CGAA IACUGGGA 
GUAGAGUG CUGAUGA G X CGAA IAUGACUG 
AGGUAGAG CUGAUGAG X CGAA IAGAUGAC 
UUAGGUAG CUGAUGAG X CGAA I UGAGAUG 
CAUUAGGU CUGAUGAG X CGAA IAGUGAGA 
ACCCAUUA CUGAUG AG X CGAA IUAGAGUG 
CACCCAUU CUGA UGAG X CGAA IGUAGAGU 
GACUGGUU CUGAUGAG X CGAA IUAACCUC 
GGACUGGU CUGAUGAG X CGAA IGUAACCU 
GAAGGACU CUGAUGAG X CGAA IUUGGUAA 
GGAAGGAC CUGAUGAG X CGAA IGUUGGUA 
AUGCGGAA CUGAUGAG X CGAA IACUGGUU 
GAUGCGGA CUGAUGAG X CGAA IGACUGGU ~ 
GGUGAUGC CUGAUGAG X CGAA IAAGGACU 
GAUGGUGA CUGAUGAG X CGAA ICGGAAGG 
AAG GAUGG CUGAUGAG X CGAA IAUGCGGA 
GG AAGGAU CUGAUGAG X CGAA IUGAUGCG 
CGGAAGGA CUGAUGAG X CGAA IGUGAUGC ~ 
CUGCGGAA CUGAUGAG X C GAA IAUGGUGA ~ 
GCUGC GGA CUGAUGAG X CGAA IGAUGGUG 
AUUGCUGC CUGAUG AG X CGAA IAAGGAUG 
GGUAUUGC CUGAUGAG X CGAA ICGGAAGG 
G CAGGUAU CUGAUGAG X CGAA ICUGCGGA 
UGGCCG CA CUGAUGAG X CGAA IUAUUGCU 
CUGGCCGC CUGAUGAG X C GAA IGUAUUGC 
CUUCCAC U CUGAUGAG X CGAA ICCGCAGG 
UCUUCCAC CUGAUGAG X CGA A IGCCGCAG 
UGGG ACGU CUGAUGAG X CGAA ICCACAUC 
UUGGGACG CUGAUGAG X C GAA IGCCACA'J~ 
UCGUCUUG CUGAUGAG X CGAA IACGUGGC 
GUCGUCUU CUGAUGAG X CGAA IGACGUGG 



2492 
2493 



2502 
2503 



2506 
2507 
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1597 
1605 



1657 
1650 
1669 
1680 
1681 
1696 
1701 
1705 



1732 
1740 
1753 



1786 
1789 



AAGACGAC U GUUACAAG 
ACUGUUAC A AGUUUGCC 
AAGUUUGC C AUCUCACA 
AGUUUGCC A U CUCACAG 
UUGCCAUC U CACAGUCA 
GCCAUCUC A CAGUCAUC 
CAUCUCAC A GUCAUCCA 
UCACAGUC A UCCACGGG 
CAGUCAUC C ACGGGCAC 
AGUCAUCC A CGGGCACU 
CCACGGGC A CUGUUAUG 
ACGGGCAC U GUUAUGGG 
AUGGGAGC U GUUAUCAU 
CUGUUAUC A UGGAGGGC 
UGGAGGGC U UCUACGUU 
AGGGCUUC U ACGUUGUC 
ACGUUGUC U UUGAUCGG 
GAUCGGGC C CGAAAACG 
AUCGGGCC C GAAAACGA 
GAAUUGGC U UUGCUGUC 
GGCUUUGC U GUCAGCGC 
UUGCUGUC A GCGCUUGC 
GUCAGCG C U UGCCAUGU 
GCGCUUGC C AUGUGCAC 
CGCUUGCC A UGUGCACG 
CCAUGUGC A CGAUGAGU 
AUGAGUUC A GGACGGCA 
AGGACGGC A GCGGUGGA 
UGGAAGGC C CUUUUGUC 
GGAAGGCC C UUUUGUCA 
GAAGGCCC U UUUGUCAC 
CUUUUGUC A CCUUGGAC 
UUUGUCAC C UUGGACAU 
UUGUCACC U UGGACAUG 
CCUUGGAC A UGGAAGAC 
UGGAAGAC U GUGGCUAC 
ACUGUGGC U ACAACAUU 
GUGGCU AC A ACAUUCCA 
GCUACAAC A UUCCACAG 
CAACAUUC C ACAGACAG 
AACAUUCC A CAGACAGA 
CAUUCCAC A GACAGAUG 
CCACAGAC A GAUGAGUC 
GAUGAGUC A ACCCUCAU 
GAGUCAAC C CUCAUGAC 
AGUCAACC C UCAUGACC 
GUCAACCC U CAUGACCA 
CAACCCUC A UGACCAUA 
CUCAUGAC C AUAGCCUA 
UCAUGACC -A UAGCCUAU 



AGUC GUCU CUGAUGAG X CGAA IGGACGUG 
CUUGUAAC CUGAUGAG X CGAA IUCGUCUU 
GGC AAACU CUGAUGAG X CGAA IUAACAC-U 
UGUGAGAU CUGAUGAG X CGAA ICAAACUU 
CUGUGAGA CUGAUGAG X CGAA IGCAAACU 
UGACUGUG CUGAUGAG X CGAA IAUGGCAA 
GAUGACUG CUGAUGAG X CGAA IAGAUGGC 
UGGAUGAC CUGAUGAG X CGAA I UGAGAUG 
CC CGUGGA CUGAUGAG X CGAA IACUGUGA 
GUGCCCGU CUGAUGAG X CGAA IAUGACUG 
AGUGCCCG CUGAUGAG X CGAA IGAUGACU 
CAUAACA G CUGAUGAG X CGAA ICCCGUGG 
CCCA UAAC CUGAUGAG X CGAA IUGCCCGU 
AUGAUAAC CUGAUGAG X CGAA ICUCCCAU 
GCCCUCCA CUGAUGAG X CGAA I AUAACAG 
AACGUAGA CUGAUGAG X CGAA ICCCUCCA~ 
GACAACGU CUGAUGAG X CG AA IAAGCCCU~ 
CCGAUCAA CUGAUGAG X CGAA IACAACGU 
CGUUUUCG CUGAUGAG X CGAA ICCCGAUC 
UCGUUUUC CUGAU GAG X CGAA IGCCCGAU 
GACAGCAA CUGAUGAG X CGAA ICCAAUUC 
GCGCUGAC CUGAUGAG X CGAA ICAAAGCC 
GCAAGCGC CUGAUGAG X CGAA IACAGCAA 

ACAUGGCA CUGAUGAG X CGAA T 

GUGCACAU CUGAUGAG X CGAA ICAAGCGC 
CGUGCACA CUGA UGAG X CGAA I GCAAGCG 
ACUCAUCG CUGAUGAG X CG AA ICACAUGG 
UGCCGUCC CUGAUGAG X CGAA IAACUCAU 
UCCACCGC CUGAUGAG X CGAA ICCGUCCU 
GACAAAAG CUGAUGAG X CGAA ICCUUCCA 
UGACAAAA CUGAUG AG X CGAA IGCCUUCC~ 
GUGACAAA CUGAUGAG X CGAA IGGCCUUC 
GUCCAAGG CUGAUGAG X CGAA IACAAAAG 
AUGUCCAA CUGAUGAG X CGAA IUGACAAA 
CAUGUCCA C UGAUGAG X CGAA IGUGACAA 
GUCUUCCA CUGAU GAG X CGAA IUCCAAGG 
GUAGCCAC C UGAUGAG X CGAA IUCUUCCA 
AAUGUUGU CUGAUGAG X CGAA ICCACAGU 
UGGAAUGU CUGAUGAG X CGAA IUAGCCAC 
CUGUGG AA CUGAUGAG X CGAA IUUGUAGC 
CUGUCUGU CUGAUGAG X CGAA IAAUGUUG 
UCUGUCUG CUGAUGAG X CGAA IGAAUGUU 
CAUCUGUC C UGAUGAG X CGAA IUGGAAUG 
GACUC AUC CUGAUGAG X CGAA IUCUGUGG 
AUGAGGGU CUGAUGAG X CGAA IACUCAUC 
GUCAUGA G CUGAUGAG X CGAA IUUGACUC 
GGUCAUGA CUGAUGAG X CGAA IGUUGACU 
UGGUCAUG CUGAUGAG X CGAA IGGUUGAC 
' UAUGGUCA CUGAUGAG X CGAA IAGGGUUG 
UAGGCUAU CUGAUGAG X CGAA IUCAUGAG 
AUAGGCUA CUGAUGAG X CGAA IGUCAUGA 



2553 
2554 
2555 
2556 



WO 01/16312 



PCT/US00/23998 



1837 
1842 



1855 
1856 
1858 
1861 



1900 
1901 
1903 



1935 
1942 



ACCAUAGC C UAUGUCAU 
CCAUAGCC U AUGUCAUG 
CCUAUGUC A UGGCUGCC 
GUCAUGGC U GCCAUCUG 
AUGGCUGC C AUCUGCGC 
UGGCUGCC A UCUGCGCC 
CUGCCAUC U GCGCCCUC 
AUCUGCGC C CUCUUCAU 
UCUGCGCC C UCUUCAUG 
CUGCGCCC U CUUCAUGC 
GCGCCCUC U UCAUGCUG 
CCCUCUUC A UGCUGCCA 
CUUCAUGC U GCCACUCU 
CAUGCUGC C ACUCUGCC 
AUGCUGCC A CUCUGCCU 
GCUGCCAC U CUGCCUCA 
UGCCACUC U GCCUCAUG 
CACUCUGC C UCAUGGUG 
ACUCUGCC U CAUGGUGU 
UCUGCCUC A UGGUGUGU 
GGUGUGUC A GUGGCGCU 
AGUGGCGC U GCCUCCGC 
GGCGCUGC C UCCGCUGC 
GCGCUGCC U CCGCUGCC 
GCUGCCUC C GCUGCCUG 
GCCUCCGC U GCCUGCGC 
UCCGCUGC C UGCGCCAG 
CCGCUGCC U GCGCCAGC 
GCCUGCGC C AGCAGCAU 
CCUGCGCC A GCAGCAUG 
GCGCCAGC A GCAUGAUG 
CCAGCAGC A UGAUGACU 
AUGAUGAC U UUGCUGAU 
GACUUUGC U GAUGACAU 



AUGACAUA CUGAUGAG X C 



. ICUAUGGU 



CAUGACAU CUGAUGAG X CGAA IGCUAUGG 

GGCAGCCA CUGAUGAG X CGAA IACAUAGG 

CAGAUGGC CUGAUGAG X CGAA ICCAUGAC 

GCGCAGAU CUGAUGAG X CGAA ICAGCCAU 

GGCGCAGA CUGAUGAG X CGAA IGCAGCCA 

GAGGGCGC CUGAUGAG X CGAA IAUGGCAG 

AUGAAGAG CUGAUGAG X CGAA ICGCAGAU 

CAUGAAGA CUGAUGAG X CGAA IGCGCAGA 

GCAUGAAG CUGAUGAG X CGAA IGGCGCAG 

CAGCAUGA CUGAUGAG X CGAA IAGGGCGC 

UGGCAGCA CUGAUGAG X CGAA IAAGAGGG 

AGAGUGGC CUGAUGAG X CGAA ICAUGAAG 

GGCAGAGU CUGAUGAG X CGAA ICAGCAUG 

AGGCAGAG CUGAUGAG X CGAA IGCAGCAU 

UGAGGCAG CUGAUGAG X CGAA IUGGCAGC 

CAUGAGGC CUGAUGAG X CGAA IAGUGGCA 

CACCAUGA CUGAUGAG X CGAA I CAGAGUG 

ACACCAUG CUGAUGAG X CGAA IGCAGAGU 

ACACACCA CUGAUGAG X CGAA IAGGCAGA 

AGCGCCAC CUGAUGAG X CGAA IACACACC 

GCGGAGGC CUGAUGAG X CGAA ICGCCACU 

GCAGCGGA CUGAUGAG X CGAA ICAGCGCC 

GGCAGCGG CUGAUGAG X CGAA IGCAGCGC 

CAGGCAGC CUGAUGAG X CGAA IAGGCAGC 

GCGCAGGC CUGAUGAG X CGAA I CGGAGGC 
CUGGCGCA CUGAUGAG X CGAA ICAGCGGA 

GCUGGCGC CUGAUGAG X CGAA I GCAGCGG 
AUGCUGCU CUGAUGAG X CGAA ICGCAGGC 

CAUGCUGC CUGAUGAG X CGAA IGCGCAGG 

CAUCAUGC CUGAUGAG X CGAA ICUGGCGC 
AGUCAUCA CUGAUGAG X CGAA ICUGCUGG 
AUCAGCAA CUGAUGAG X CGAA IUCAUCAU 

AUGUCAUC CUGAUGAG X CGAA 



CUGAUGAC A UCUCCCUG 



CAGGGAGA CUGAUGAG X CGAA IUCAUCAG 



1967 
1968 



1992 
1997 



AUGACAUC U CCCUGCUG 

GACAUCUC C CUGCUGAA 

ACAUCUCC C UGCUGAAG 

CAUCUCCC U GCUGAAGU 

CUCCCUGC U GAAGUGAG 

GAGGAGGC C CAUGGGCA 

AGGAGGCC C AUGGGCAG 

GGAGGCCC A UGGGCAGA 

CCAUGGGC A GAAGAUAG 

AGAGAUUC C CCUGGACC 

GAGAUUCC C CUGGACCA 

AGAUUCCC C UGGACCAC 

GAUUCCCC U GGACCACA 

CCCUGGAC C ACACCUCC 

CCUGGACC A CACCUCCG 

UGGACCAC A CCUCCGUG 



CAGCAGGG 



i CGAA IAUGUCAU 



UUCAGCAG CUGAUGAG X CGAA IAGAUGUC 

CUUCAGCA CUGAUGAG X CGAA IGAGAUGU 

ACUUCAGC CUGAUGAG X CGAA I GGAGAUG 

CUCACUUC CUGAUGAG X CGAA ICAGGGAG 

UGCCCAUG CUGAUGAG X CGAA ICCUCCUC 

CUGCCCAU CUGAUGAG X CGAA IGCCUCCU 

UCUGCCCA CUGAUGAG X CGAA IGGCCUCC 

CUAUCUUC CUGAUGAG X CGAA ICCCAUGG 

GGUCCAGG CUGAUGAG X CGAA IAAUCUCU 

UGGUCCAG CUGAUGAG X CGAA IGAAUCUC 

GUGGUCCA CUGAUGAG X CGAA IGGAAUCU 

UGUGGUCC CUGAUGAG X CGAA IGGGAAUC 

GGAGGUGU CUGAUGAG X CGAA IUCCAGGG 

CGGAGGUG CUGAUGAG X CGAA IGUCCAGG 

CACGGAGG CUGAUGAG X CGAA IUGGUCCA 



2608 
2609 



WO 01/16312 



PCT/US00/23998 



2054 
2059 
2061 
2062 
2064 
2069 
2070 
2071 



GACCACAC C UCCGUGGU 
ACCACACC U CCGUGGUU 
C AC AC CUC C G UGGUUCA 
CGUGGU UC A CUUUGGUC 
UGGUUCAC U UUGGUCAC 
CUUUGGUC A CAAGUAGG 
UUGGUCAC A AGUAGGAG 
UAGGAGAC A CAGAJGGC 
GGAG ACAC A GAUGGCAC 
CAGAUGGC A CCUGUGGC 
GAUGGCAC C UGUGGCCA 
A UGGCACC U GUGGCCAG 
CCUGUGGC C AGAGCACC 
CUGUGGCC A GAGCACCU 
G CCAGAGC A CCUCAGGA 
CAGAGCAC C UCAGGACC 
AGAGCACC U CAGGACCC 
AGCACCU C A GGACCCUC 
CU CAGGAC C CUCCCCAC 
UCAGGAC C C UCCCCACC 
CAGGACCC U CCCCACCC~~ 
GGACCCUC C CCACCCAC 



2083 
2089 
2090 
2092 
2095 
2096 



2154 



GACCCUCC C CACCCACC 



2176 
2178 
21S1 
2193 
2195 
2201 
2203 



CUCCCCAC C 
UCCCCACC C 



CCCCACCC A CCAAAUGC 



CCACCCAC C AAAUGCCU 



CACCCACC A AAUGCCUC 



: C UCUGCCUU 
: U CUGCCUUG 



CAAAUGCC 1 
AAUGCCUC U GCCUUGAU 



GCCUCUGC C UUGAUGGA 



CCUCUGCC U UGAUGGAG 



GAAAAGGC U GGCAAGGU 



AGGCUGGC A AGGUGGGU 



GUGGGUUC C AGGGACUG 



UGGGUUCC A GGGACUGU 



CCAGGGAC U GUACCUGU 
GACUGUAC C UGUAGGAA 



ACUGUACC U GUAGGAAA 



UAGGAAAC A GAAAAGAG 



AAAGAAGC A CUCUGCUG 



AGAAGCAC U CUGCUGGC 



AAGCACUC U GCUGGCGG 



CACUCUGCU 

GGGAAUAC U CUUGGUCA" 
GAAUACUC U UGGUCACC 
UCUUGGUC A CCUCAAAU 
UUGGUCAC C UCAAAUUU 



ACCAC GGA CUGAUGAG X CGAA IUGUGGUC 
AACCACGG CUGAUGAG X CGAA IGUGUGGU 
UGAACCAC CUGA UGAG X CGAA IAGGUGUG 
GACCAAAG CUGAUGAG X CG AA IAACCACG 
GUGACCAA CUGA UGAG X CGAA IUGAACCA~ 
CCUACUUG CUGAUGAG X CGAA IACCAAAG 
CUCCUA CU CUGAUGAG X CGAA IUGACCAA 
GCCAUCUG CUGAUGA G X CGAA IUCUCCUA 
GUGCCA UC CUGAUGAG X CGAA IUGUCUCC 
GCCACAGG CUGAUG AG X CGAA ICCAUCUG 
UGGC CACA CUGAUGAG X CGAA IUGCCAUC 
CUGGCCAC CUGAUGAG X CGA A 1GUGCCAU ~ 
GGUGCUCU CUGAUG AG X CGAA ICCACAGG~ 
AGGUGCUC CUGAUGAG X CGAA IGCCACAG~ 
UCCUGA GG CUGAUGAG X CGAA ICUCUGGC 
GGUCCUGA CUGAUGAG X CGAA IUGCUCUG 
GGGUCCUG CUGAUGA G X CGAA IGUGCUCu " 
GAGGGU CC CUGAUGAG X CGAA IAGGUGCU 
GUGGG GAG CUGAUGAG X CGAA IUCCUGAG 
GGUGGGG A CUGAUGAG X CGAA IGUCCUGA 
GGGUGGGG CUGAUGAG X CGAA IGGUCCUG 
GUGGGUGG CUGAUGAG X CGAA IAGGGUCC' 
GGUGGGUG CUGAUGAG X CGAA I GAGGGUC 



UGGUGGGU CUGAUGAG X CGAA IGGAGGGU 



UUGGUGGG CUGAUGAG X CGAA IGGGAGGG 



AUUUGGUG CUGAUGAG X CGAA I UGGGGAG 



CAUUUGGU CUGAUGAG X CGAA IGUGGGGA 



GCAUUUGG CUGAUGAG X CGAA IGGUGGGG 



"aGGCAUUU CUGAUGAG X CGAA IUGGGUGG 



GAGGCAUU CUGAUGAG X CGAA IGUGGGUG 



" AAG3CA3A CUGAUGAG X CGAA 



CAAGGCAG CUGAUGAG X CGAA 



ICAUUUGG 
IGCAUUUG 



AUCAAGGC CUGAUGAG X CGAA IAGGCAUU 



UCCAUCAA CUGAUGAG X CGAA I CAGAGGC 



CUCCAUCA CUGAUGAG X ( 



ACCUUGCC 
ACCCACCU 



CUGAUGAG X CGAA ICCUUUUC 



CUGAUGAG X CGAA ICCAGCCU 



CAGUCCCU CUGAUGAG X CGAA IAACCCAC 



ACAGUCCC CUGAUGAG X CGAA IGAACCCA 



ACAGGUAC CUG AUGAG X CGAA I 
UUCCUACA CUGAUGAG X CGAA 1 



UUUCCUAC CUGAUGAG X CGAA IGUACAGU 



CUCUUUUC CUGAUGAG X CGAA 



CAGCAGAG CUGAUGAG } 



IUUUCCUA 
ICUUCUUU 



GCCAGCAG CUGAUGAG J 



CCGCCAGC CUGAUGAG X CGAA IAGUGCUU 



UUCCCGCC CUGAUGAG X CGAA ICAGAGUG 
UGACCAAG CUGAUGAG X CGAA IUAUUCCC 
i CUGAUGAG X CGAA IAGUAUUC 



GGUGACCA 
AUUUGAGG C UGAUGAG X CGAA~ 
AAAUUUGA CUGAUGAG X CGAA 



IACCAAGA 
IUGACCAA 



WO 01/16312 



PCT/US00/23998 



UGGUCACC U CAAAUUUA 



UAAAUUUG CUGAUGAG X CGAA IGUGACCA~~ 



GUCACCUC A AAUUUAAG 



CUUAAAUU CUGAUGAG X CGAA IAGGUGAC 



GGAAAUUC U GCUGCUUG 



CAAGCAGC CUGAUGAG X CGAA IAAUUUCC 



AAUUCUGC U GCUUGAAA 



UUUCAAGC CUGAUGAG X CGAA ICAGAAUU 



UCUGCUGC U UGAAACUU 



AAGUUUCA CUGAUGAG X CGAA ICAGCAGA 



CUUGAAAC U UCAGCCCU 



AGGGCUGA CUGAUGAG X CGAA IUUUCAAG 



GAAACUUC A GCCCUGAA 
ACUUCAGC C CUGAACCU 



UUCAGGGC CUGAUGAG X CGAA IAAGUUUC 



AGGUUCAG CUGAUGAG X CGAA ICUGAAGU 



CUUCAGCC C UGAACCUU 



AAGGUUCA CUGAUGAG X CGAA IGCUGAAG 



UUCAGCCC U GAACCUUU 



AAAGGUUC CUGAUGAG X CGAA IGGCUGAA 



UGGACAAA CUGAUGAG X CGAA IUUCAGGG 



CCCUGAAC C UUUGUCCA 



2268 
2277 
2279 
2280 
2283 
2284 
2285 
229' 
2297 
2302 
2310 
2318 
2322 
2325 



2339 
2351 
2352 
2353 
2361 
2362 
2363 
2367 
2378 
2379 
2383 
2390 
2391 
2392 
2395 
2399 
2400 



CCUGAACC U UUGUCCAC 



GUGGACAA CUGAUGAG X CGAA IGUUCAGG~ 



CCUUUGUC C ACCAUUCC 



GGAAUGGU CUGAUGAG X CGAA IACAAAGG 



CUUUGUCC A CCAUUCCU 



AGGAAUGG CUGAUGAG X CGAA IGACAAAG 



UUGUCCAC C AUUCCUUU 



AAAGGAAU CUGAUGAG X CGAA IUGGACAA 



UGUCCACC A UUCCUUUA 
CACCAUUC C UUUAAAUU 
ACCAUUCC U UUAAAUUC 
UUAAAUUC U CCAACCCA 
AAAUUCUC C AACCCAAA 
AAUUCUC C A ACCCAAAG 
UCUCCAAC C CAAAGUAU 
CUCCAAC C C AAAGUAUU 
UCCAACCC A AAGUAUUC 
AAGUAUUC U UCUUUUCU 
UAUUCUUC U UUUCUUAG 
UUCUUUUC U UAGUUUCA 
UUAGUUUC A GAAGUACU 
AGAAGUAC U GGCAUCAC 
GUACUGGC A UCACACGC 
CUGGCAUC A CACGCAGG 
GGCAUCA C A CGCAGGUU 
UCACACGC A GGUUACCU 
CAG GUUAC C UUGGCGUG~ 
AGGUUACC U UGGCGUGU 
CGUGUGUC C CUGUGGUA 
GUGUGUCC C UGUGGUAC 
UGUGUCCC U GUGGUACC 
UGUGGUAC C CUGGCAGA 
GUGGUACC C UGGCAGAG 
UGGUACCC U GGCAGAGA 
ACCCUGGC A GAGAAGAG 
GAAGAGACTC AAGCUUGU 
AAGAG ACC A AGCUUGUU 
GACCAAGC U UGUUUCCC 
CUUGUUUC C CUGCUGGC 
UUGUUUCC C UGCUGGCC 
UGUUUCCC U GCUGGCCA 
UUCCCUGC U GGCCAAAG 
CUGCUGGC C AAAGUCAG 
UGCUGGCC A AAGUCAGU 



UAAAGGAA CUGAUGAG X CGAA IGUGGACA 
AAUUUAAA CUGAUGAG X C GAA IAAUGGUG 
GAAUUUAA CUGAUGAG X CGA A IGAAUGGU 
UGGG UUGG CUGAUGAG X CGAA IAAUUUAA 
UUUGGGUU CUGAUGAG X CGAA IAGAAUUU 
CUUUGGGU CUGAUGAG X CGAA I GAGAAUU 
AUACUUUG CUGAUGAG X CGAA IUUGGAGA 
AAUACUUU CUGA UGAG X CGAA IGUUGGAG 
GAAUACUU CUG AUGAG X CGAA I GGUUGGA 
AGAAAA GA CUGAUGAG X CGAA IAAUACUU 
CUAAGAAA CUGAUG AG X CGAA IAAGAAUA 
UGAAACUA CUGAUGAG X CGAA IAAAAGAA 
AGUACUUC CUGAUGAG X CGAA IAAACUAA 
GUGAU GCC CUGAUGAG X CGAA IUACUUCU 
GCGUGUGA CU GAUGAG X CGAA ICCAGUAC~~ 
CCUGCGUG CUGA UGAG X CGAA IAUGCCAG ~ 
AACCUGCG CUGAUGAG X CGAA IUGAUGCC 
AGGUAACC CUGAUGAG > 
CACGCCAA CUGAUGAG X CGAA IUAACCUG 
ACACGCCA CUGAU GAG X CGAA IGUAACCU~ 
UACCACAG CUGAU GAG X CGAA IACACACG~ 
GUACCACA CUGAUGAG X CGAA IGACACAC~~ 
GGUACCAC CUGAUG AG X CGAA IGGACACA 
UCUGCCAG CUGAUGAG X CGAA IUACCACA~ 
CUCUG CCA CUGAUGAG X CGAA IGUACCAC 
UCUCUGCC CUGAUGAG X CGAA IGGUACCA~ 
CUCUUCUC CUGAUGAG X CGAA ICCAGGGU~ 
ACAAGCUU CUGAUGAG X CGAA IUCUCUUC 
AACAAGCU CUGAUG AG X CGAA IGUCUCUU~ 
GGGAAACA CUGAUGAG X CGAA ICUUGGUC 
GCCAGCAG CUGAUGA G X CGAA IAAACAAG 
GGCCAGC A CUGAUGAG X CGAA IGAAACAA 
UGGCCAGC CUGAUG AG X CGAA IGGAAACA 
CUUUGGCC CUGAUGAG X CGAA I C AGGGAA 
CUGA CUUU CUGAUGAG X CGAA ICCAGCAG 
ACUGACUU CUGAUGAG X CGAA IGCCAGCA~ 



WO 01/16312 



PCT/US00/23998 



2423 
2430 
2437 



2465 
2466 
2470 
2473 
2487 
2488 
2490 
2509 



CCAAAG UC A GUAGGAGA 
GAGGAUGC A CAGUUUGC 
GGAUGCAC A GUUUGCUA 
CAGUUUGC U AUUUGCUU 
CUAUUUGC U UUAGAGAC 
UUAGAGAC A GGGACUGU 
ACAGGGAC U GUAUAAAC 
GUAUAAAC A AGCCUAAC 
AAACAAGC C UAACAUUG 
AACAAGCC U AACAUUGG 
AGCCUAAC A UUGGUGCA 
AUUGGUGC A AAGAUUGC 
AAGAUUGC C UCUUGAAU 
AGAUUGCC U CUUGAAUU 
AUUGCCUC U UGAAUUAA 
AAAAAAAC U AGAAAAAA 



UCUCCUAC CUGAUGAG X CGAA 
GCAAACUG CUGAUGAG X CGAA ICAUCCUC 
UAGCAAAC CUGAUGAG X CGAA IUGCAUCC 
AAGCAAAU CUGAUGAG X CGAA ICAAACUG 
GUCUCUAA CUGAUG AG X CGAA ICAAAUAG 
ACAGUCCC CUGAUGAG X CGAA IUCUCUAA 
GUUU AUAC CUGAUGAG X CGAA IUCCCUGU 
GUUAGGCU CUGAUGAG X CGAA IUUUAUAC 
CAAUGUUA CUGAUGAG X CGAA ICUUGUUU 
CCAAUGUU CUGAUGAG X CGAA IGCUUGUU 
UGCACCAA CUGAUGAG X CGAA IUUAGGCU 
GCAAUCUU CUGA UGAG X CGAA ICACCAAU~ 
AUUCAAGA CUGAUGAG X CGAA ICAAUCUU~ 
AAUUCAAG CUGAUGAG X CGAA IGCAAUCU 
UUAAUU CA CUGAUGAG X CGAA I AGGCAAU 
UUUUUUCU CUGAUGAG X CGAA IUUUUUUU 



Input Sequence = AF190725. Cut Site = G/ . 

Stem Length = 8 . Core Sequence = CUGAUGAG X CGAA (X = GCCGUUAGGC or other stem II) 
AF190725 (Homo sapiens beta-site APP cleaving enzyme (BACE) mRNA ,- 2526 bp) 
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Table 21 



Table 21: Human BACE Zinzyme Ribozyme and Target Sequence 



Pos 


Substrate 


Seq ID 


Ribozyme 


ID 


11 


c G CAGCCCGC 

ACGCGUCC G ________ 


~ 960 


GCGGGCUG GCCGAAAGGCGAGUCAAGGUCU GGACGCGU 


2921 


18 


CGCAGCCC G g£gg__ . 


— — 


CUCCCGGG GCCGAAAGGCGAGUCAAGGUCU GGGCUGCG 


2922 


29 




~962 


GCGGCUCG GCCGAAAGGCGAGUCAAGGUCU AGCUCCCG 


2923 


36 


UGCGAGCC G CGAGCUGG ~ 


~964 


CCAGCUCG GCCGAAAGGCGAGUCAAGGUCU GGCUCGCA 


2924 


69 


CAGCCAAC G CAGCCGCA 


96 j 


UGCGGC'JG GCCGAAAGGCGAGUCAAGGUCU GUUGGCUG 


2925 


75 


ACGCAGCC G CAGGAGCC 


9gB 


GGCUCCUG GCCGAAAGGCGAGUCAAGGUCU GGCUGCGU 


2926 


94 


GAGCCCUU G _______ 


— 969 — 


GGCAGGGG GCCGAAAGGCGAGUCAAGGUCU AAGGGCUC 


2927 


100 




— 97 0 


GGCGCGGG GCCGAAAGGCGAGUCAAGGUCU AGGGGCAA 


2928 


104 


UUGCCCCU G CCCGC^^ 
. CCCUGCCC G 


— 971 


CGGCGGCG GCCGAAAGGCGAGUCAAGGUCU GGGCAGGG 


2929 


106 


CUGCCCGC G CCGCCGCC 


— 972 


GGCGGCGG GCCGAAAGGCGAGUCAAGGUCU GCGGGCAG 


2930 


109 


CCCGCGCC G CCGCCCGC 


973 


GCGGGCGG GCCGAAAGGCGAGUCAAGGUCU GGCGCGGG 


2931 


112 


GCGCCGCC G CCCGCCGG 


974 


CCGGCGGG GCCGAAAGGCGAGUCAAGGUCU GGCGGCGC 


2932 


116 


CGCCGCCC G CCGGGGGG 


975 


CCCCCCGG GCCGAAAGGCGAGUCAAGGUCU GGGCGGCG 


2933 


137 


GGGAAGCC G CCACCGGC 


976 


GCCGGUGG GCCGAAAGGCGAGUCAAGGUCU GGCUUCCC 


2934 


148 


ACCGGCCC G CCAUGCCC 


977 


GGGCAUGG GCCGAAAGGCGAGUCAAGGUCU GGGCCGGU 


2935 


153 


CCCGCCAU G CCCGCCCC 


978 


GGGGCGGG GCCGAAAGGCGAGUCAAGGUCU AUGGCGGG 


2936 


157 


CCAUGCCC G CCCCUCCC 


979 


GGGAGGGG GCCGAAAGGCGAGUCAAGGUCU GGGCAUGG 


2937 


172 


CCAGCCCC G CCGGGAGC 


980 


GCUCCCGG GCCGAAAGGCGAGUCAAGGUCU GGGGCUGG 


2938 


183 


GGGAGCCC G CGCCCGCU 


981 


AGCGGGCG GCCGAAAGGCGAGUCAAGGUCU GGGCUCCC 


2939 


185 


GAGCCCGC G CCCGCUGC 


982 


GCAGCGGG GCCGAAAGGCGAGUCAAGGUCU GCGGGCUC 


2940 


189 


CCGCGCCC G CUGCCCAG 


983 — 


CUGGGCAG GCCGAAAGGCGAGUCAAGGUCU GGGCGCGG 


2941 


192 


CGCCCGCU G CCCAGGCU 


984 


AGCCUGGG GCCGAAAGGCGAGUCAAGGUCU AGCGGGCG 


2942 


205 


GGCUGGCC G CCGCCGUG 


985 


CACGGCGG GCCGAAAGGCGAGUCAAGGUCU GGCCAGCC 


2943 


208 


UGGCCGCC G CCGUGCCG 


986 


CGGCACGG GCCGAAAGGCGAGUCAAGGUCU GGCGGCCA 


2944 


213 


GCCGCCGU G CCGAUGUA 


987 


UACAUCGG GCCGAAAGGCGAGUCAAGGUCU ACGGCGGC 


2945 


250 


UCUCCCCU G CUCCCGUG 


989 


CACGGGAG GCCGAAAGGCGAGUCAAGGUCU AGGGGAGA 


2946 


258 


GCUCCCGU G CUCUGCGG 


990 


CCGCAGAG GCCGAAAGGCGAGUCAAGGUCU ACGGGAGC 


2947 


263 


CGUGCUCU G CGGAUCUC 


91 


GAGAUCCG GCCGAAAGGCGAGUCAAGGUCU AGAGCACG 


2948 


280 


CCCUGACC G CUCUCCAC 


993 


GUGGAGAG GCCGAAAGGCGAGUCAAGGUCU GGUCAGGG 


2949 


320 


AGGGCCCU G CAGGCCCU 


994 — 


' AGGGCCUG GCCGAAAGGCGAGUCAAGGUCU AGGGCCCU 


2950 


340 


GUCCUGAU G CCCCCAAG 


996 — 


CUUGGGGG GCCGAAAGGCGAGUCAAGGUCU AUCAGGAC 


2951 


397 


GGGCAGGC G CCAGGGAC 


998 


GUCCCUGG GCCGAAAGGCGAGUCAAGGUCU GCCUGCCC 


2952 


420 


GGGCCAGU G _____ 


999" ' 


" UGGGCUCG GCCGAAAGGCGAGUCAAGGUCU ACUGGCCC 


2953 


468 


CAAGCCCU G CCCU___ 


1Q02 


AGCCAGGG GCCGAAAGGCGAGUCAAGGUCU AGGGCUUG 


2954 


480 


UGGCUCCU G _______ 


1Q03 


AUCCACAG GCCGAAAGGCGAGUCAAGGUCU AGGAGCCA 


2955 


493 




1Q04 


CACUCCCG GCCGAAAGGCGAGUCAAGGUCU GCCCAUCC 


" 2956 




GCGGGAGU G CUGCCUGC ' 


1Q05 


" "GCAGGCAG GCCGAAAGGCGAGUCAAGGUCU ACUCCCGC 


2957 


504 


GGAGUGCU G CCUGCCCA 


1006 


UGGGCAGG GCCGAAAGGCGAGUCAAGGUCU AGCACUCC 


2958 


508 


UGCUGCCU G CCCACGGC 


1007 


GCCGUGGG GCCGAAAGGCGAGUCAAGGUCU AGGCAGCA 




537 


AUCCGGCU G CCCCUGCG 


1008 


CGCAGGGG GCCGAAAGGCGAGUCAAGGUCU AGCCGGAU 


2960 


543 


CUGCCCCU G CGCAGCGG 


1009 


CCGCUGCG GCCGAAAGGCGAGUCAAGGUCU AGGGGCAG 


2961 


545 


GCCCCUGC G CAGCGGCC 


1010 


GGCCGCUG GCCGAAAGGCGAGUCAAGGUCU GCAGGGGC 


2962 ' 


562 


UGGGGGGC G CCCCCCUG 


1011 


CAGGGGGG GCCGAAAGGCGAGUCAAGGUCU GCCCCCCA 


2963 


576 


CUGGGGCU G CGGCUGCC 


1012 


GGCAGCCG GCCGAAAGGCGAGUCAAGGUCU AGCCCCAG 


2964 


582 


CUGCGGCU G CCCCGGGA 


1013 


UCCCGGGG GCCGAAAGGCGAGUCAAGGUCU AGCCGCAG 


2965 


708 


AGCCCCCC G CAGACGCU 


1019 


AGCGUCUG GCCGAAAGGCGAGUCAAGGUCU GGGGGGCU 


2966 


714 


CCGCAGAC G CUCAACAU 


1020 


AUGUUGAG GCCGAAAGGCGAGUCAAGGUCU GUCUGCGG 


2967 
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751 


GUAACUUU G CAGUGGGU 


1021 


ACCCACUG GCCGAAAGGCGAGUCAAG3UCU AAAGUUAC 


2968 


760 


CAGUGGGU G CUGCCCCC 




GGGGGCAG GCCGAAAGGCGAGUCAAGGUCU ACCCACUG 


2969 


763 


UGGGUGCU G CCCCCCAC 


— 

1023 


GUGGGGGG GCCGAAAGGCGAGUCAAGGUCU AGCACCCA 


2970 


780 


CCCUUCCU G CAUCGCUA 




UAGCGAUG GCCGAAAGGCGAGUCAAGGUCU AGGAAGGG 


2971 


785 


CCUGCAUC G CUACUACC 




1025 


GGUAGUAG GCCGAAAGGCGAGUCAAGGUCU GAUGCAGG 


2972 


843 


GUGUAUGU G CCCUACAC 


1026 


GUGUAGGG GCCGAAAGGCGAGUCAAGGUCU ACAUACAC 


2973 


921 


GUCACUGU G CGUGCCAA 


1028 


UUGGCACG GCCGAAAGGCGAGUCAAGGUCU ACAGUGAC 


2974 


925 


CUGUGCGU G CCAACAUU 


1029 - 


AAUGUUGG GCCGAAAGGCGAGUCAAGGUCU ACGCACAG 


2975 


934 


CCAACAUU G CUGCCAUC 


1030 


GAUGGCAG GCCGAAAGGCGAGUCAAGGUCU AAUGUUGG 


2976 


937 


ACAUUGCU G CCAUCACU 


1031 


AGUGAUGG GCCGAAAGGCGAGUCAAGGUCU AGCAAUGU 


2977 


1006 


UGGCCUAU G CUGAGAUU 


1033 


AAUCUCAG GCCGAAAGGCGAGUCAAGGUCU AUAGGCCA 




1015 


CUGAGAUU G CCAGGCCU 


1035 „ 


AGGCCUGG GCCGAAAGGCGAGUCAAGGUCU AAUCUCAG 


2979 


1092 


UUCUCCCU G CAGCUUUG 


1039 


CAAAGCUG GCCGAAAGGCGAGUCAAGGUCU AGGGAGAA 


2980 


1105 


UUUGUGGU G CUGGCUUC 


1040 


GAAGCCAG GCCGAAAGGCGAGUCAAGGUCU ACCACAAA 


2981 


1134 


UCUGAAGU G CUGGCCUC 


1042 


GAGGCCAG GCCGAAAGGCGAGUCAAGGUCU ACUUCAGA 


2982 


1182 


GACCACUC G CUGUACAC 


1045 


GUGUACAG GCCGAAAGGCGAGUCAAGGUCU GAGUGGUC 


2983 


1248 


AUCAUUGU G CGGGUGGA 


1048 


UCCACCCG GCCGAAAGGCGAGUCAAGGUCU ACAAUGAU 


2984 


1286 


AAUGGACU G CAAGGAGU 


1050 


ACUCCUUG GCCGAAAGGCGAGUCAAGGUCU AGUCCAUU 


2985 


1344 


CUUCGUUU G CCCAAGAA 


1052 




2986 


1366 


UUGAAGCU G CAGUCAAA 


1054 


" UUUGACUG GCCGAAAGGCGAGUCAAGGUCU AGCUUCAA 


2987 


1442 


GCUGGUGU G CUGGCAAG 


1056 


CUUGCCAG GCCGAAAGGCGAGUCAAGGUCU ACACCAGC 


2988 


1526 


GUCCUUCC G CAUCACCA 


1058 






1542 


AUCCUUCC G CAGCAAUA 


1059 


UAUUGCUG GCCGAAAGGCGAGUCAAGGUCU GGAAGGAU 


2990 


1554 


CAAUACCU G CGGCCAGU 


1060 




2991 


1603 


ACAAGUUU G CCAUCUCA 


1062 


UGAGAUGG GCCGAAAGGCGAGUCAAGGUCU AAACUUGU 


2992 


1699 


UUGGCU0U G CUGUCAGC 


1066 


GCUGACAG GCC ; G ^°^^^^^ GCUGACAG 


2993 


1708 


CUGUCAGC G CUUGCCAU 


1067 




2994 


1712 


CAGCGCUU G CCAUGUGC 


1068 


GCACAUGG GCCGAAAGGCGAGUCAAGGUCU AAGCGCUG 




1719 


UGCCAUGU G CACGAUGA 


1069 


UCAUCGUG GCCGAAAGGCGAGUCAAGGUCU ACAUGGCA 


2996 


1843 


UCAUGGCU G CCAUCUGC 


1074 


GCAGAUGG GCCGAAAGGCGAGUCAAGGUCU AGCCAUGA 


2997 


1850 


UGCCAUCU G CGCCCUCU 


1075 




2998 


1852 


CCAUCUGC G CCCUCUUC 


1076 _ 


GAAGAGGG GCCGAAAGGCGAGUCAAGGUCU GCAGAUGG 


2999 


1863 


CUCUUCAU G CUGCCACU 


1077 


AGUGGCAG GCCGAAAGGCGAGUCAAGGUCU AUGAAGAG 


3000 


1866 


UUCAUGCU G CCACUCUG 


1078 


CAGAGUGG GCCGAAAGGCGAGUCAAGGUCU AGCAUGAA 


3001 


1874 


GCCACUCU G CCUCAUGG 


1079 


CCAUGAGG GCCGAAAGGCGAGUCAAGGUCU AGAGUGGC 


3002 


1895 


UCAGUGGC G CUGCCUCC 


1080 


GGAGGCAG GCCGAAAGGCGAGUCAAGGUCU GCCACUGA 


3003 


1898 


GUGGCGCU G CCUCCGCU 


1081 


AGCGGAGG GCCGAAAGGCGAGUCAAGGUCU AGCGCCAC 


3004 


1904 


CUGCCUCC G CUGCCUGC 


1082 


GCAGGCAG GCCGAAAGGCGAGUCAAGGUCU GGAGGCAG 


3005 


1907 


CCUCCGCU G CCUGCGCC 


1083 


GGCGCAGG GCCGAAAGGCGAGUCAAGGUCU AGCGGAGG 


3006 


1911 


CGCUGCCU G CGCCAGCA 


1084 


UGCUGGCG GCCGAAAGGCGAGUCAAGGUCU AGGCAGCG 


3007 


1913 


CUGCCUGC G CCAGCAGC 


.. 1085 


GCUGCUGG GCCGAAAGGCGAGUCAAGGUCU GCAGGCAG 


3008 


1933 


AUGACUUU G CUGAUGAC 


1088 


GUCAUCAG GCCGAAAGGCGAGUCAAGGUCU AAAGUCAU 


3009 


1950 


AUCUCCCU G CUGAAGUG 




CACUUCAG GCCGAAAGGCGAGUCAAGGUCU AGGGAGAU 


3010 




CACCAAAU G CCUCUGCC 


1094 


GGCAGAGG GCCGAAAGGCGAGUCAAGGUCU AUUUGGUG 


3011 


2093 


AUGCCUCU G CCUUGAUG 


1095 


CAUCAAGG GCCGAAAGGCGAGUCAAGGUCU AGAGGCAU 


3012 


2179 


AGCACUCU G CUGGCGGG 


1097 


CCCGCCAG GCCGAAAGGCGAGUCAAGGUCU AGAGUGCU 


3013 


2227 


GAAAUUCU G CUGCUUGA 


1098 


UCAAGCAG GCCGAAAGGCGAGUCAAGGUCU AGAAUUUC 


3014 


2230 


AUUCUGCU G CUUGAAAC 


1099 


GUUUCAAG GCCGAAAGGCGAGUCAAGGUCU AGCAGAAU 


3015 


2329 


CAUCACAC G CAGGUUAC 


1102 


GUAACCUG GCCGAAAGGCGAGUCAAGGUCU GUGUGAUG 


3016 


2393 


GUUUCCCU G CUGGCCAA 


1103 


UUGGCCAG GCCGAAAGGCGAGUCAAGGUCU AGGGAAAC 


3017 


2419 


GAGAGGAU G CACAGUUU 


1104 


AAACUGUG GCCGAAAGGCGAGUCAAGGUCU AUCCUCUC 


3018 
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2428 


CACAGUUU G CUAUUUGC 


-., 1Q5 


GCAAAUAG GCCGAAAGGCGAGUCAAGGUCU AAACUGUG 


3019 


2435 


UGCUAUUU g cuuuagag 


1106 


CUCUAAAG GCCGAAAGGCGAGUCAAGGUCU AAAUAGCA 


3020 


2476 


ACAUUGGU G CAAAGAUU 


11Q7 


AAUCUUUG GCCGAAAGGCGAGUCAAGGUCU ACCAAUGU 


3021 


2485 


CAAAGAUU G CCUCUUGA 


11Q8 


UCAAGAGG GCCGAAAGGCGAGUCAAGGUCU AAUCUUUG 


3022 


219 


GUGCCGAU G UAGCGGGC 


— 1110 


GCCCGCUA GCCGAAAGGCGAGUCAAGGUCU AUCGGCAC 


3023 


483 


CUCCUGCU G UGGAUGGG 


lll:L 


CCCAUCCA GCCGAAAGGCGAGUCAAGGUCU AGCAGGAG 


3024 


634 


GCAGCUUU G UGGAGAUG 


1112 


CAUCUCCA GCCGAAAGGCGAGUCAAGGUCU AAAGCUGC 


3025 


804 


AGGCAGCU G UCCAGCAC 


-■ 1113 


GUGCUGGA GCCGAAAGGCGAGUCAAGGUCU AGCUGCCU 


3026 


835 


GGAAGGGU G UGUAUGUG 


Ill4 


CACAUACA GCCGAAAGGCGAGUCAAGGUCU ACCCUUCC 


3027 


837 


AAGGGUGU G UAUGUGCC 


1115 


GGCACAUA GCCGAAAGGCGAGUCAAGGUCU ACACCCUU 


3028 


841 


GUGUGUAU G UGCCCUAC 


g 


GUAGGGCA GCCGAAAGGCGAGUCAAGGUCU AUACACAC 


3029 


919 


ACGUCACU G UGCGUGCC 





GGCACGCA GCCGAAAGGCGAGUCAAGGUCU AGUGACGU 


3030 


1100 


GCAGCUUU G UGGUGCUG 





CAGCACCA GCCGAAAGGCGAGUCAAGGUCU AAAGCUGC 


3031 


1144 


UGGCCUCU G UCGGAGGG 


9 


CCCUCCGA GCCGAAAGGCGAGUCAAGGUCU AGAGGCCA 


3032 


1185 


CACUCGCU G UACACAGG 





CCUGUGUA GCCGAAAGGCGAGUCAAGGUCU AGCGAGUG 


3033 


1246 


UGAUCAUU G UGCGGGUG 




CACCCGCA GCCGAAAGGCGAGUCAAGGUCU AAUGAUCA 


3034 


1315 


AGAGCAUU G UGGACAGU 


il22 


ACUGUCCA GCCGAAAGGCGAGUCAAGGUCU AAUGCUCU 


3035 


1356 


AAGAAAGU G UUUGAAGC 




1123 


GCUUCAAA GCCGAAAGGCGAGUCAAGGUCU ACUUUCUU 


3036 


1440 


CAGCUGGU G UGCUGGCA 


1124 


UGCCAGCA GCCGAAAGGCGAGUCAAGGUCU ACCAGCUG 


3037 


1570 


UGGAAGAU G UGGCCACG 


1125 ■ 


CGUGGCCA GCCGAAAGGCGAGUCAAGGUCU AUCUUCCA 


3038 


1592 


AGACGACU G UUACAAGU 


— LL 


ACUUGUAA GCCGAAAGGCGAGUCAAGGUCU AGUCGUCU 


3039 


1630 


CGGGCACU G UUAUGGGA 


— hill — 


UCCCAUAA GCCGAAAGGCGAGUCAAGGUCU AGUGCCCG 


3040 


1642 


UGGGAGCU G UUAUCAUG 


1128 


CAUGAUAA GCCGAAAGGCGAGUCAAGGUCU AGCUCCCA 


3041 


1666 


UCUACGUU G UCUUUGAU 


1129 


AUCAAAGA GCCGAAAGGCGAGUCAAGGUCU AAC3UAGA 


3042 


1702 


GCUUUGCU G UCAGCGCU 




AGCGCUGA GCCGAAAGGCGAGUCAAGGUCU AGCAAAGC 


3043 


1717 


CUUGCCAU G UGCACGAU 




1131 


AUCGUGCA GCCGAAAGGCGAGUCAAGGUCU AUGGCAAG 


3044 


1759 


GCCCUUUU G UCACCUUG 




CAAGGUGA GCCGAAAGGCGAGUCAAGGUCU AAAAGGGC 


3045 


1781 


GGAAGACU G UGGCUACA 


1133 


UGUAGCCA GCCGAAAGGCGAGUCAAGGUCU AGUCUUCC 


3046 


1834 


UAGCCUAU G UCAUGGCU 


1134 


AGCCAUGA GCCGAAAGGCGAGUCAAGGUCU AUAGGCUA 


3047 


1884 


CUCAUGGU G UGUCAGUG 


1135 


CACUGACA GCCGAAAGGCGAGUCAAGGUCU ACCAUGAG 


3048 


1886 


CAUGGUGU G UCAGUGGC 


_J: 


GCCACUGA GCCGAAAGGCGAGUCAAGGUCU ACACCAUG 


3049 


2048 


UGGCACCU G UGGCCAGA 




UCUGGCCA GCCGAAAGGCGAGUCAAGGUCU AGGUGCCA 


3050 


2139 


CAGGGACU G UACCUGUA 


113B 


UACAGGUA GCCGAAAGGCGAGUCAAGGUCU AGUCCCUG 


3051 


2145 


CUGUACCU G UAGGAAAC 




GUUUCCUA GCCGAAAGGCGAGUCAAGGUCU AGGUACAG 


3052 


2256 


GAACCUUU G UCCACCAU 




AUGGUGGA GCCGAAAGGCGAGUCAAGGUCU AAAGGUUC 


3053 


2346 


CUUGGCGU G UGUCCCUG 


— 


CAGGGACA GCCGAAAGGCGAGUCAAGGUCU ACGCCAAG 


3054 


2348 


UGGCGUGU G UCCCUGUG 




CACAGGGA GCCGAAAGGCGAGUCAAGGUCU ACACGCCA 


3055 


2354 


GUGUCCCU G UGGUACCC 


143 - 


GGGUACCA GCCGAAAGGCGAGUCAAGGUCU AGGGACAC 


3056 


2385 


CCAAGCUU G UUUCCCUG 


1144 


CAGGGAAA GCCGAAAGGCGAGUCAAGGUCU AAGCUUGG 


3057 


2453 


CAGGGACU G UAUAAACA 




UGUUUAUA GCCGAAAGGCGAGUCAAGGUCU AGUCCCUG 


3058 


14 


CGUCCGCA G CCCGCCCG 




CGGGCGGG GCCGAAAGGCGAGUCAAGGUCU UGCGGACG 


3059 


26 


GCCCGGGA G CUGCGAGC 





GCUCGCAG GCCGAAAGGCGAGUCAAGGUCU UCCCGGGC 


3060 




AGCUGCGA G CCGCGAGC 




GCUCGCGG GCCGAAAGGCGAGUCAAGGUCU UCGCAGCU 


3061 


40 


AGCCGCGA G CUGGAUUA 


1149 


UAAUCCAG GCCGAAAGGCGAGUCAAGGUCU UCGCGGCU 


3062 


51 


GGAUUAUG G UGGCCUGA 


1150 


UCAGGCCA GCCGAAAGGCGAGUCAAGGUCU CAUAAUCC 


3063 


54 


UUAUGGUG G CCUGAGCA 


1151 


UGCUCAGG GCCGAAAGGCGAGUCAAGGUCU CACCAUAA 


3064 


60 


UGGCCUGA G CAGCCAAC 


1152 


GUUGGCUG GCCGAAAGGCGAGUCAAGGUCU UCAGGCCA 


3065 


63 


CCUGAGCA G CCAACGCA 


1153 


UGCGUUGG GCCGAAAGGCGAGUCAAGGUCU UGCUCAGG 


3066 


72 


CCAACGCA G CCGCAGGA 


1154 


UCCUGCGG GCCGAAAGGCGAGUCAAGGUCU UGCGUUGG 


3067 


81 


CCGCAGGA G CCCGGAGC 


1155 


GCUCCGGG GCCGAAAGGCGAGUCAAGGUCU UCCUGCGG 


3068 


88 


AGCCCGGA G CCCUUGCC 


1156 


GGCAAGGG GCCGAAAGGCGAGUCAAGGUCU UCCGGGCU 


3069 
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C CAGGGAA G CCGCCACC 
CGCCACCG G CCCGCCAU 
C CCUCCCA G CCCCGCCG 
CGCCGGGA G CCCGCGCC 
CUGCCCAG G CUGGCCGC 
CCAGGCUG G CCGCCGCC 
CCGCCGCC G UGCCGAUG 
CCGAUGUA G CGGGCUCC 
UGUAGCGG G CUCCGGAU 
GGAUCCCA G CCUCUCCC 
CUGCUCCC G UGCUCUGC 
UCUCCACA G CCCGGACC 
ACCCGGGG G CUGGCCCA 
GGGGGCUG G CCCAGGGC 
GGCCCAGG G CCCUGCAG 
CCCUGCAG G CCCUGGCG 
AGGCCC'JG G CGUCCUGA 
GCCCUGGC G UCCUGAUG 
GCCCCCAA G CUCCCUCU 
CCUGAGAA G CCACCAGC 
AGCCACCA G CACCACCC 
ACUUGGGG G CAGGCGCC 
GGGGGCAG G CGCCAGGG 
GGACGGAC G UGGGCCAG 
GGACGUGG G CCAGUGCG 
GUGGGCCA G UGCGAGCC 
CAGUGCGA G CCCAGAGG 



CCCAGAGG C 



CCCGAAGG 



GCCCGAAG G CCGGGGCC 
AGGCCGGG G CCCACCAU 
CCACCAUG G CCCAAGCC 
UGGCCCAA G CCCUGCCC 
CUGCCCUG G CUCCUGCU 
GUGGAUGG G CGCGGGAG 
GCGCGGGA G UGCUGCCU 
UGCCCACG G CACCCAGC 
GGCACCCA G CACGGCAU 
CCAGCACG G CAUCCGGC 
GGCAUCCG G CUGCCCCU 
CCUGCGCA G CGGCCUGG 
GCGCAGCG G CCUGGGGG 
CCUGGGGG G CGCCCCCC 
CCCCUGGG G CUGCGGCU 
GGGCUGCG G CUGCCCCG 
GACGAAGA G CCCGAGGA 
CCCGAGGA G CCCGGCCG 
GGAGCCCG G CCGGAGGG 
CCGGAGGG G CAGCUUUG 
GAGGGGCA G CUUUGUGG 
UGGAGAUG G UGGACAAC 
CCUGAGGG G CAAGUCGG 



GGUGGCGG GCCGAAAGGCGAGUCAAGGUCU UUCCCUGG 
AUGGCGGG GCCGAAAGGCGAGUCAAGGUCU CGGUGGCG 
CGGCGGGG GCCGAAAGGCGAGUCAAGGUCU UGGGAGGG 
GGCGCGGG GCCGAAAGGCGAGUCAAGGUCU UCCCGGCG 
GCGGCCAG GCCGAAAGGCGAGUCAAGGUCU CUGGGCAG 
GGCGGCGG GCCGAAAGGCGAGUCAAGGUCU CAGCCUGG 
CAUCGGCA GCCGAAAGGCGAGUCAAGGUCU GGCGGCGG 
GGAGCCCG GCCGAAAGGCGAGUCAAGGUCU UACAUCGG 
AUCCGGAG GCCGAAAGGCGAGUCAAGGUCU CCGCUACA 
GGGAGAGG GCCGAAAGGCGAGUCAAGGUCU UGGGAUCC 
GCAGAGCA GCCGAAAGGCGAGUCAAGGUCU GGGAGCAG 
GGUCCGGG GCCGAAAGGCGAGUCAAGGUCU UGUGGAGA 
UGGGCCAG GCCGAAAGGCGAGUCAAGGUCU CCCCGGGU 
GCCCUGGG GCCGAAAGGCGAGUCAAGGUCU CAGCCCCC 
CUGCAGGG GCCGAAAGGCGAGUCAAGGUCU CCUGGGCC 
CGCCAGGG GCCGAAAGGCGAGUCAAGGUCU CUGCAGGG 
UCAGGACG GCCGAAAGGCGAGUCAAGGUCU CAGGGCCU 
CAUCAGGA GCCGAAAGGCGAGUCAAGGUCU GCCAGGGC 
AGAGGGAG GCCGAAAGGCGAGUCAAGGUCU UUGGGGGC 
GCUGGUGG GCCGAAAGGCGAGUCAAGGUCU UUCUCAGG 
GGGUGGUG GCCGAAAGGCGAGUCAAGGUCU UGGUGGCU 
GGCGCCUG GCCGAAAGGCGAGUCAAGGUCU CCCCAAGU 
CCCUGGCG GCCGAAAGGCGAGUCAAGGUCU CUGCCCCC 
CUGGCCCA GCCGAAAGGCGAGUCAAGGUCU GUCCGUCC 
CGCACUGG GCCGAAAGGCGAGUCAAGGUCU CCACGUCC 
GGCUCGCA GCCGAAAGGCGAGUCAAGGUCU UGGCCCAC 
CCUCUGGG GCCGAAAGGCGAGUCAAGGUCU UCGCACUG 
CCUUCGGG GCCGAAAGGCGAGUCAAGGUCU CCUCUGGG 
GGCCCCGG GCCGAAAGGCGAGUCAAGGUCU CUUCGGGC 
AUGGUGGG GCCGAAAGGCGAGUCAAGGUCU CCCGGCCU 
GGCUUGGG GCCGAAAGGCGAGUCAAGGUCU CAUGGUGG 
GGGCAGGG GCCGAAAGGCGAGUCAAGGUCU UUGGGCCA 
AGCAGGAG GCCGAAAGGCGAGUCAAGGUCU CAGGGCAG 
CUCCCGCG GCCGAAAGGCGAGUCAAGGUCU CCAUCCAC 
AGGCAGCA GCCGAAAGGCGAGUCAAGGUCU UCCCGCGC 
GCUGGGUG GCCGAAAGGCGAGUC AAGGUCU CGUGGGCA 
AUGCCGUG GCCGAAAGGCGAGUCAAGGUCU UGGGUGCC 
GCCGGAUG GCCGAAAGGCGAGUCAAGGUCU CGUGCUGG 
AGGGGCAG GCCGAAAGGCGAGUCAAGGUCU CGGAUGCC 
' CCAGGCCG GCCGAAAGGCGAGUCAAGGUCU UGCGCAGG 
( r- 3 —i iA UCAAGGUCU CGCUGCGC 

GGGGGGCG GCCGAAAGGCGAGUCAAGGUCU CCCCCAGG 
AGCCGCAG GCCGAAAGGCGAGUCAAGGUCU CCCAGGGG 
CGGGGCAG GCCGAAAGGCGAGUCAAGGUCU CGCAGCCC 
UCCUCGGG GCCGAAAGGCGAGUCAAGGUCU UCUUCGUC 
CGGCCGGG GCCGAAAGGCGAGUCAAGGUCU UCCUCGGG 
CCCUCCGG GCCGAAAGGCGAGUCAAGGUCU CGGGCUCC 
CAAAGCUG GCCGAAAGGCGAGUCAAGGUCU CCCUCCGG 
CCACAAAG GCCGAAAGGCGAGUCAAGGUCU UGCCCCUC 
GUUGUCCA GCCGAAAGGCGAGUCAAGGUCU CAUCUCCA 
CCGACUUG GCCGAAAGGCGAGUCAAGGUCU CCCUCAGG 



3113 
3114 
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663 


AGGGGCAA G UCGGGGCA 


12 ° 8 ' 


UGCCCCGA GCCGAAAGGCGAGUCAAGGUCU UUGCCCCU 


3121 


669 


AAGUCGGG G CAGGGCUA 


1209 


UAGCCCUG GCCGAAAGGCGAGUCAAGGUCU CCCGACUU 


3122 


674 


GGGGCAGG G CUACUACG 


1210 


CGUAGUAG GCCGAAAGGCGAGUCAAGGUCU CCUGCCCC 


3123 


682 


GCUACUAC G UGGAGAUG 




CAUCUCCA GCCGAAAGGCGAGUCAAGGUCU GUAGUAGC 


3124 


694 


AGAUGACC G UGGGCAGC 





GCUGCCCA GCCGAAAGGCGAGUCAAGGUCU GGUCAUCU 


3125 


698 


GACCGUGG G CAGCCCCC 


1 1 - 


GGGGGCUG GCCGAAAGGCGAGUCAAGGUCU CCACGGUC 


3126 


701 


CGUGGGCA G CCCCCCGC 


1214 


GCGGGGGG GCCGAAAGGCGAGUCAAGGUCU UGCCCACG 


3127 


727 


ACAUCCUG G UGGAUACA 


1215 


UGUAUCCA GCCGAAAGGCGAGUCAAGGUCU CAGGAUGU 


3128 


737 


GGAUACAG G CAGCAGUA 


1216 


UACUGCUG GCCGAAAGGCGAGUCAAGGUCU CUGUAUCC 


3129 


740 


UACAGGCA G CAGUAACU 


1217 


AGUUACUG GCCGAAAGGCGAGUCAAGGUCU UGCCUGUA 


3130 


743 


AGGCAGCA G UAACUUUG 


1218 ■ 


CAAAGUUA GCCGAAAGGCGAGUCAAGGUCU UGCUGCCU 


3131 


754 


ACUUUGCA G UGGGUGCU 


1219 


AGCACCCA GCCGAAAGGCGAGUCAAGGUCU UGCAAAGU 


3132 


758 


UGCAGUGG G UGCUGCCC 


1220 


GGGCAGCA GCCGAAAGGCGAGUCAAGGUCU CCACUGCA 


3133 


798 


UACCAGAG G CAGCUGUC 


1221 


GACAGCUG GCCGAAAGGCGAGUCAAGGUCU CUCUGGUA 


3134 


801 


CAGAGGCA G CUGUCCAG 


1222 


CUGGACAG GCCGAAAGGCGAGUCAAGGUCU UGCCUCUG 




809 


GCUGUCCA G CACAUACC 


1223 


GGUAUGUG GCCGAAAGGCGAGUCAAGGUCU UGGACAGC 


3136 


833 


CCGGAAGG G UGUGUAUG 


1224 


CAUACACA GCCGAAAGGCGAGUCAAGGUCU CCUUCCGG 


3137 


857 


CACCCAGG G CAAGUGGG 


1225 


CCCACUUG GCCGAAAGGCGAGUCAAGGUCU CCUGGGUG 


3138 


861 


CAGGGCAA G UGGGAAGG 


1226 


CCUUCCCR G ggg^gg^g^QyCAAGGUCU UCCCCUUC 


3139 


873 


GAAGGGGA G CUGGGCAC 


1227 


GUGCCCAG GCCGAAAGGC A GGUCU CCAGCUCC 


3140 


878 


GGAGCUGG G CACCGACC 


1228 


GGUCGGUG GCCGAAA GGUCU CAGGUCGG 




889 


CCGACCUG G UAAGCAUC 


1229 




3T4l — 


893 


CCUGGUAA G CAUCCCCC 


1230 


GGGGGAUG GCCGAAAGGCGAGUCAAGGUCU UUACCAGG 


3143 


905 


CCCCCAUG G CCCCAACG 


1231 


CGUUGGGG GCCGAAAGGCGAGUCAAGGUCU CAUGGGGG 




913 


GCCCCAAC G UCACUGUG 


1232 


CACAGUGA GCCGAAAGGCGAGUCAAGGUCU GUUGGGGC 


3145 


923 


CACUGUGC G UGCCAACA 


1233 






957 


UCAGACAA G UUCUUCAU 


1234 


AUGAAGAA GCCGAAAGGCGAGUCAAGGUCU UUGUCUGA 


3147 


971 


CAUCAACG G CUCCAACU 


1235 






986 


CUGGGAAG G CAUCCUGG 


1236 


CCAGGAUG GCCGAAAGGCGAGUCAAGGUCU CUUCCCAG 


3149 


996 


AUCCUGGG G CUGGCCUA 


1237 


UAGGCCAG GCCGAAAGGCGAGUCAAGGUCU CCCAGGAU 


3150 


1000 


UGGGGCUG G CCUAUGCU 


1238 


AGCAUAGG GCCGAAAGGCGAGUCAAGGUCU CAGCCCCA 


3151 


1020 


AUUGCCAG G CCUGACGA 


1239 


UCGUCAGG GCCGAAAGGCGAGUCAAGGUCU CUGGCAAU 


3152 


1038 


UCCCUGGA G CCUUUCUU 


1240 


AAGAAAGG GCCGAAAGGCGAGUCAAGGUCU UCCAGGGA 


3153 


1057 


ACUCUCUG G UAAAGCAG 


1241 


CUGCUUUA GCCGA^VAi < rCGA M. lp/ iAGGUCU CAGAGAGU 


' 3154 


1062 


CUGGUAAA G CAGACCCA 


1242 


UGGGUCUG GCCGAAAGGCGAGUCAAGGUCU UUUACCAG 


3155 


1072 


AGACCCAC G UUCCCAAC 


1243 


GUUGGGAA GCCGAAAGGCGAGUCAAGGUCU GUGGGUCU 


315S 


1095 


UCCCUGCA G CUUUGUGG 


1244 


CCACAAAG GCCGAAAGGCGAGUCAAGGUCU UGCAGGGA 


3157 


1103 


GCUUUGUG G UGCUGGCU 


1245 


AGCC^GCA GCCGAAAGGCGAGUCAAGGUCU CACAAAGC 


3158 


1109 


UGGUGCUG G CUUCCCCC 


1246 


GGGGGAAG GCCGAAAGGCGAGUCAAGGUCU CAGCACCA 


3159 


1125 


CUCAACCA G UCUGAAGU 


1247 


ACUUCAGA GCCGAAAGGCGAGUCAAGGUCU UGGUUGAG 


3160 


1132 


AGUCUGAA G UGCUGGCC 


1248 


GGCCAGCA GCCGAAAGGCGAGUCAAGGUCU UUCAGACU 


3161 


1138 


AAGUGCUG G CCUCUGUC 


1249 


GACAGAGG GCCGAAAGGCGAGUCAAGGUCU CAGCACUU 


3162 


1154 


CGGAGGGA G CAUGAUCA 




UGAUCAUG GCCGAAAGGCGAGUCAAGGUCU UCCCUCCG 


3163 




CAUUGGAG G UAUCGACC 


1251 


GGUCGAUA GCCGAAAGGCGAGUCAAGGUCU CUCCAAUG 


3164 


1193 


GUACACAG G CAGUCUCU 


1252 


AGAGACUG GCCGAAAGGCGAGUCAAGGUCU CUGUGUAC 


3165 


1196 


CACAGGCA G UCUCUGGU 


1253 


ACCAGAGA GCCGAAAGGCGAGUCAAGGUCU UGCCUGUG 


3166 


1203 


AGUCUCUG G UAUACACC 


1254 


GGUGUAUA GCCGAAAGGCGAGUCAAGGUCU CAGAGACU 


3167 


1218 


CCCAUCCG G CGGGAGUG 


1255 


CACUCCCG GCCGAAAGGCGAGUCAAGGUCU CGGAUGGG 


3168 


1224 


CGGCGGGA G UGGUAUUA 


1256 


UAAUACCA GCCGAAAGGCGAGUCAAGGUCU UCCCGCCG 


3169 


1227 


CGGGAGUG G UAUUAUGA 


1257 


UCAUAAUA GCCGAAAGGCGAGUCAAGGUCU CACUCCCG 


3170 


1237 


AUUAUGAG G UGAUCAUU 


1258 


AAUGAUCA GCCGAAAGGCGAGUCAAGGUCU CUCAUAAU 


3171 
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1252 


UUGUGCGG G UGGAGAUC 


- 


GAUCUCCA GCCGAAAGGCGAGUCAAGGUCU CCGCACAA 


3172 


1293 


UGCAAGGA G UACAACUA 




UAGUUGUA GCCGAAAGGCGAGUCAAGGUCU UCCUUGCA 


3173 


1310 


UGACAAGA G CAUUGUGG 


i26i - 


CCACAAUG GCCGAAAGGCGAGUCAAGGUCU UCUUGUCA 


3174 


1322 


UGUGGACA G UGGCACCA 


i2g2 


UGGUGCCA GCCGAAAGGCGAGUCAAGGUCU UGUCCACA 


3175 


1325 


GGACAGUG G CACCACCA 


"T263 


UGGUGGUG GCCGAAAGGCGAGUCAAGGUCU CACUGUCC 


3176 


1340 


CAACCUUC G UUUGCCCA 


264 


UGGGCAAA GCCGAAAGGCGAGUCAAGGUCU GAAGGUUG 


3177 


1354 


CCAAGAAA G UGUUUGAA 




1265 - 


UUCAAACA GCCGAAAGGCGAGUCAAGGUCU UUUCUUGG 


3178 


1363 


UGUUUGAA G CUGCAGUC 


1266 


GACUGCAG GCCGAAAGGCGAGUCAAGGUCU UUCAAACA 


3179 


1369 


AAGCUGCA G UCAAAUCC 


1267 


GGAUUUGA GCCGAAAGGCGAGUCAAGGUCU UGCAGCUU 


3180 


1384 


CCAUCAAG G CAGCCUCC 


1268 ■ 


GGAGGCUG GCCGAAAGGCGAGUCAAGGUCU CUUGAUGG 


3181 


1387 


UCAAGGCA G CCUCCUCC 


1269 - 


GGAGGAGG GCCGAAAGGCGAGUCAAGGUCU UGCCUUGA 


3182 


1404 


ACGGAGAA G UUCCCUGA 


1270 - 


UCAGGGAA GCCGAAAGGCGAGUCAAGGUCU UUCUCCGU 


3183 


1415 


CCCUGAUG G UUUCUGGC 


1271 


GCCAGAAA GCCGAAAGGCGAGUCAAGGUCU CAUCAGGG 


3184 


1422 


GGUUUCUG G CUAGGAGA 


1272 


UCUCCUAG GCCGAAAGGCGAGUCAAGGUCU CAGAAACC 


3185 


1431 


CUAGGAGA G CAGCUGGU 


1273 


ACCAGCUG GCCGAAAGGCGAGUCAAGGUCU UCUCCUAG 


3185 


1434 


GGAGAGCA G CUGGUGUG 




CACACCAG GCCGAAAGGCGAGUCAAGGUCU UGCUCUCC 


3187 


1438 


AGCAGCUG G UGUGCUGG 


1275 


CCAGCACA GCCGAAAGGCGAGUCAAGGUCU CAGCUGCU 


3188 


1446 


GUGUGCUG G CAAGCAGG 


1276 


CCUGCUUG GCCGAAAGGCGAGUCAAGGUCU CAGCACAC 


3189 


1450 


GCUGGCAA G CAGGCACC 


1277 


GGUGCCUG GCCGAAAGGCGAGUCAAGGUCU UUGCCAGC 


3190 


1454 


GCAAGCAG G CACCACCC 


1278 


GGGUGGUG GCCGAAAGGCGAGUCAAGGUCU CUGCUUGC 


3191 


1480 


UUUUCCCA G UCAUCUCA 


1279 


UGAGAUGA GCCGAAAGGCGAGUCAAGGUCU UGGGAAAA 


3192 


1502 


CCUAAUGG G UGAGGUUA 


1280 


UAACCUCA GCCGAAAGGCGAGUCAAGGUCU CCAUUAGG 


3193 


1507 


UGGGUGAG G UUACCAAC 


1281 


GUUGGUAA GCCGAAAGGCGAGUCAAGGUCU CUCACCCA 


3194 


1518 


ACCAACCA G UCCUUCCG 1282 


CGGAAGGA GCCGAAAGGCGAGUCAAGGUCU UGGUUGGU 


3195 


1545 


CUUCCGCA G CAAUACCU 1283 


AGGUAUUG GCCGAAAGGCGAGUCAAGGUCU UGCGGAAG 


3196 


1557 


UACCUGCG G CCAGUGGA 




UCCACUGG GCCGAAAGGCGAGUCAAGGUCU CGCAGGUA 


3197 


1561 


UGCGGCCA G UGGAAGAU 


1285 


AUCUUCCA GCCGAAAGGCGAGUCAAGGUCU UGGCCGCA 


. 3198 


1573 


AAGAUGUG G CCACGUCC 


1286 


GGACGUGG GCCGAAAGGCGAGUCAAGGUCU CACAUCUU 


3199 


1578 


GUGGCCAC G UCCCAAGA 


1287 


UCUUGGGA GCCGAAAGGCGAGUCAAGGUCU GUGGCCAC 


3200 


1599 


UGUUACAA G UUUGCCAU 


1288 


AUGGCAAA GCCGAAAGGCGAGUCAAGGUCU UUGUAACA 


3201 


1614 


AUCUCACA G UCAUCCAC 


1289 


GUGGAUGA GCCGAAAGGCGAGUCAAGGUCU UGUGAGAU 


3202 


1625 


AUCCACGG G CACUGUUA 


1290 


UAACAGUG GCCGAAAGGCGAGUCAAGGUCU CCGUGGAU 


3203 


1639 


UUAUGGGA G CUGUUAUC 


1291 


GAUAACAG GCCGAAAGGCGAGUCAAGGUCU UCCCAUAA 


3204 


1655 


CAUGGAGG G. CUUCUACG 


1292 


CGUAGAAG GCCGAAAGGCGAGUCAAGGUCU CCUCCAUG 


3205 


1663 


GCUUCUAC G UUGUCUUU 


1293 


AAAGACAA GCCGAAAGGCGAGUCAAGGUCU GUAGAAGC 


3206 


1678 


UUGAUCGG G CCCGAAAA 


1294 


UUUUCGGG GCCGAAAGGCGAGUCAAGGUCU CCGAUCAA 


3207 


1694 


ACGAAUUG G CUUUGCUG 


1295 


CAGCAAAG GCCGAAAGGCGAGUCAAGGUCU CAAUUCGU 


3208 


1706 


UGCUGUCA G CGCUUGCC 


1296 


GGCAAGCG GCCGAAAGGCGAGUCAAGGUCU UGACAGCA 


3209 


1728 


CACGAUGA G UUCAGGAC 


1297 


GUCCUGAA GCCGAAAGGCGAGUCAAGGUCU UCAUCGUG 


3210 


1738 


UCAGGACG G CAGCGGUG 


1298 


CACCGCUG GCCGAAAGGCGAGUCAAGGUCU CGUCCUGA 


3211 


1741 


GGACGGCA G CGGUGGAA 


1299 


UUCCACCG GCCGAAAGGCGAGUCAAGGUCU UGCCGUCC 


3212 


1744 


CGGCAGCG G UGGAAGGC 


1300 


GCCUUCCA GCCGAAAGGCGAGUCAAGGUCU CGCUGCCG 


3213 


1751 


GGUGGAAG G CCCUUUUG 


1301 


CAAAAGGG GCCGAAAGGCGAGUCAAGGUCU CUUCCACC 


3214 




AGACUGUG G CUACAACA 


1302 


UGUUGUAG GCCGAAAGGCGAGUCAAGGUCU CACAGUCU 


3215 


1809 


ACAGAUGA G UCAACCCU 


1303 


AGGGUUGA GCCGAAAGGCGAGUCAAGGUCU UCAUCUGU 


321S 


1828 


UGACCAUA G CCUAUGUC 


1304 


GACAUAGG GCCGAAAGGCGAGUCAAGGUCU UAUGGUCA 


3217 
3218 


1840 


AUGUCAUG G CUGCCAUC 


1305 


GAUGGCAG GCCGAAAGGCGAGUCAAGGUCU CAUGACAU 


1882 


GCCUCAUG G UGUGUCAG 


1306 


CUGACACA GCCGAAAGGCGAGUCAAGGUCU CAUGAGGC 


3219 


1890 


GUGUGUCA G UGGCGCUG 


1307 


CAGCGCCA GCCGAAAGGCGAGUCAAGGUCU UGACACAC 


3220 


1893 
1 1917 


UGUCAGUG G CGCUGCCU 
CUGCGCCA G CAGCAUGA 


1308 
1309 


AGGCAGCG GCCGAAAGGCGAGUCAAGGUCU CACUGACA 
UCAUGCUG GCCGAAAGGCGAGUCAAGGUCU UGGCGCAG 


3221 
3222 
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1972 
2006 
2009 
2019 
2026 
2042 
2051 
2057 
2114 
2118 
2123 



2214 
2243 
2288 
2305 
2314 
2320 
2333 
2342 
2344 
2357 
23S5 



2463 
2474 



CGCCAGCA G CAUGAUGA 
CUGCUGAA G UGAGGAGG 
GUGAGGAG G CCCAUGGG 
GCCCAUGG G CAGAAGAU 
ACACCUCC G UGGUUCAC 
CCUCCGUG G UUCACUUU 
UCACUUUG G UCACAAGU 
GGUCACAA G UAGGAGAC 
CACAGAUG G CACCUGUG 
CACCUGUG G CCAGAGCA 
UGGCCAGA G CACCUCAG 
AGGAAAAG G CUGGCAAG 
AAAGGCUG G CAAGGUGG 
CUGGCAAG G UGGGUUCC 
CAAGGUGG G UUCCAGGG 
AGAAAGAA G CACUCUGC 
CUCUGCUG G CGGGAAUA 
UACUCUUG G UCACCUCA 
AAAUUUAA G UCGGGAAA 
AAACUUCA G CCCUGAAC 
AACCCAAA G UAUUCUUC 
UUUUCUUA G UUUCAGAA 
UUUCAGAA G UACUGGCA 
AAGUACUG G CAUCACAC 
ACACGCAG G UUACCUUG 
UUACCUUG G CGUGUGUC 
ACCUUGGC G UGUGUCCC 
UCCCUGUG G UACCCUGG 
GUACCCUG G CAGAGAAG 
GAGACCAA G CUUGUUUC 
CCCUGCUG G CCAAAGUC 
UGGCCAAA G UCAGUAGG 
CAAAGUCA G UAGGAGAG 
GAUGCACA G UUUGCUAU 
AUAAACAA G CCUAACAU 
UAACAUUG G UGCAAAGA 



UCAUCAUG GCCGAAAGGCGAGUCAAGGUCU UGCUGGCG 
CCUCCUCA GCCGAAAGGCGAGUCAAGGUCU UUCAGCAG 
CCCAUGGG GCCGAAAGGCGAGUCAAGGUCU CUCCUCAC 
AUCUUCUG GCCGAAAGGCGAGUCAAGGUCU CCAUGGGC 
GUGAACCA GCCGAAAGGCGAGUCAAGGUCU GGAGGUGU 
AAAGUGAA GCCGAAAGGCGAGUCAAGGUCU CACGGAGG 
ACUUGUGA GCCGAAAGGCGAGUCAAGGUCU CAAAGUGA 
GUCUCCUA GCCGAAAGGCGAGUCAAGGUCU UUGUGACC 
CACAGGUG GCCGAAAGGCGAGUCAAGGUCU CAUCUGUG 
UGCUCUGG GCCGAAAGGCGAGUCAAGGUCU CACAGGUG 
CUGAGGUG GCCGAAAGGCGAGUCAAGGUCU UCUGGCCA 
CUUGCCAG GCCGAAAGGCGAGUCAAGGUCU CUUUUCCU 



GCCGAAAGGC 



VGUCAAGGUCU CAGCCUUU 



GGAACCCA GCCGAAAGGCGAGUCAAGGUCU CUUGCCAG 
CCCUGGAA GCCGAAAGGCGAGUCAAGGUCU CCACCUUG 
GCAGAGUG GCCGAAAGGCGAGUCAAGGUCU UUCUUUCU 
UAUUCCCG GCCGAAAGGCGAGUCAAGGUCU CAGCAGAG 
UGAGGUGA GCCGAAAGGCGAGUCAAGGUCU CAAGAGUA 
UUUCCCGA GCCGAAAGGCGAGUCAAGGUCU UUAAAUUU 
GUUCAGGG GCCGAAAGGCGAGUCAAGGUCU UGAAGUUU 
GAAGAAUA GCCGAAAGGCGAGUCAAGGUCU UUUGGGUU 
UUCUGAAA GCCGAAAGGCGAGUCAAGGUCU UAAGAAAA 
UGCCAGUA GCCGAAAGGCGAGUCAAGGUCU UUCUGAAA 
GUGUGAUG GCCGAAAGGCGAGUCAAGGUCU CAGUACUU 
CAAGGUAA GCCGAAAGGCGAGUCAAGGUCU CUGCGUGU 
GACACACG GCCGAAAGGCGAGUCAAGGUCU CAAGGUAA 
GGGACACA GCCGAAAGGCGAGUCAAGGUCU GCCAAGGU 
CCAGGGUA GCCGAAAGGCGAGUCAAGGUCU CACAGGGA 
CUUCUCUG GCCGAAAGGCGAGUCAAGGUCU CAGGGUAC 
GAAACAAG GCCGAAAGGCGAGUCAAGGUCU UUGGUCUC 
GACUUUGG GCCGAAAGGCGAGUCAAGGUCU CAGCAGGG 
CCUACUGA GCCGAAAGGCGAGUCAAGGUCU UUUGGCCA 
CUCUCCUA GCCGAAAGGCGAGUCAAGGUCU UGACUUUG 
AUAGCAAA GCCGAAAGGCGAGUCAAGGUCU UGUGCAUC 
AUGUUAGG GCCGAAAGGCGAGUCAAGGUCU UUGUUUAU 
UCUUUGCA GCCGAAAGGCGAGUCAAGGUCU CAAUGUUA 



Input Sequence = AF190725. Cut Site = G/ . 

Stem Length = 8 . Core Sequence = GCcgaaagGCGaGuCaaGGuCu 

AF190725 (Homo sapiens beta-site APP cleaving enzyme (BACE) mRNA; 2526 bp) 
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Table 22: Human BACE DNAzyme and Target Sequence 



POS 


Substrate 


Seq ID 


Ribozyme 


Rz^Seq 


48 


GCUGGAUU A UGGUGGCC 


3 


CAACGA AATCCAGC 

GGCCACCA GGC ^gCTACAACGA AGCCCTGC 


3259 


677 


GCAGGGCU A CUACGUGG 


27 


CCACGTAG GGC ' rAG p T ^^p G ^ AGTAGCCC 


~ 3260 


680 


GGGCUACU A CGUGGAGA 


28 


TCTCCACG GGCTAG ATCCACCA 


32gl 


733 


UGGUGGAU A CAGGCAGC 


31 


GCTGCCTG GGCTAGCTACAACGA AT _ 


~~ 3262 


788 


GCAUCGCU A CUACCAGA 


38 


TCTGGTAG GGCTAGC ™^^^ AGTAGCGA 


3263 


791 


UCGCUACU A CCAGAGGC 


39 


GCCTCTGG GGCTAGCTACAACGA AGiAGCGA 


~ 3264 


815 


CAGCACAU A CCGGGACC 


41 


GGTCCCGG GGCTAGCTACAACGA A 


32g5 


839 


GGGUGUGU A UGUGCCCU 


43 


AGGGCACA GGCTAGCTACAACGA ACACACCC 


32S6 


848 


UGUGCCCU A CACCCAGG 


44 


CCTGGGTG GGCTAGCTACAACGA AGGGCACA 


326? 


1004 


GCUGGCCU A UGCUGAGA 


58 


TCTCAGCA GGCTAGCTACAACGA AGGCCAGC 


3268 


1171 


UUGGAGGU A UCGACCAC 


85 


GTGGTCGA GGCTAGCTACAACGA ACCTCCAA 


— 3269 


1187 


CUCGCUGU A CACAGGCA 


88 


TGCCTGTG GGCTAGCTACAACGA ACAGCGAG 


— 3270 


1205 


UCUCUGGU A UACACCCA 


91 


TGGGTGTA GGCTAGCTACAACGA ACCAGAGA 


3271 


1207 


UCUGGUAU A CACCCAUC 


92 


GATGGGTG GGCTAGCTACAACGA ATACCAGA 


3272 


1229 


GGAGUGGU A UUAUGAGG 


94 


CCTCATAA GGCTAGCTACAACGA ACCACTCC 


3273 


1232 


GUGGUAUU A UGAGGUGA 


96 


TCACCTCA GGCTAGCTACAACGA AATACCAC 


3274 


1295 


CAAGGAGU A CAACUAUG 


101 


CATAGTTG GGCTAGCTACAACGA ACTCCTTG 


3275 — 


1301 


GUACAACU A UGACAAGA 


102 


TCTTGTCA GGCTAGCTACAACGA AGTTGTAC 


3276 


1493 


CUCACUCU A CCUAAUGG 


130 


CCATTAGG GGCTAGCTACAACGA AGAGTGAG 


32?7 


1510 


GUGAGGUU A CCAACCAG 


133 


CTGGTTGG GGCTAGCTACAACGA AACCTCAC 


3278 


1550 


GCAGCAAU A CCUGCGGC 


141 


GCCGCAGG GGCTAGCTACAACGA ATTGCTGC 


3279 


1595 


CGACUGUU A CAAGUUUG 


144 


CAAACTTG GGCTAGCTACAACGA AACAGTCG 


3280 


1633 


GCACUGUU A UGGGAGCU 


152 


AGCTCCCA GGCTAGCTACAACGA AACAGTGC 


3281 


1645 


GAGCUGUU A UCAUGGAG 


154 


.CTCCATGA GGCTAGCTACAACGA AACAGCTC 


— 3282 


1661 


GGGCUUCU A CGUUGUCU 


158 


AGACAACG GGCTAGCTACAACGA AGAAGCCC 


— 3283 


1787 


CUGUGGCU A CAACAUUC 


176 


GAATGTTG GGCTAGCTACAACGA AGCCACAG 


— 3284 — 


1832 


CAUAGCCU A UGUCAUGG 


182 


CCATGACA GGCTAGCTACAACGA AGGCTATG 


— 3285 


2141 


GGGACUGU A CCUGUAGG 


212 


CCTACAGG GGCTAGCTACAACGA ACAGTCCC 


— 3286 


2191 


GCGGGAAU A CUCUUGGU 


215 


ACCAAGAG GGCTAGCTACAACGA ATTCCCGC 


32 81 '~ 


2290 


CCCAAAGU A UUCUUCUU 


240 


AAGAAGAA GGCTAGCTACAACGA ACTTTGGG 


3288 


2316 


UCAGAAGU A CUGGCAUC 


254 


GATGCCAG GGCTAGCTACAACGA ACTTCTGA 


3281 


2336 


CGCAGGUU A CCUUGGCG 


257 


CGCCAAGG GGCTAGCTACAACGA AACCTGCG 


— 3290 


2359 


CCUGUGGU A CCCUGGCA 


260 


TGCCAGGG GGCTAGC ^^^ G ^ AGCAAACT 


3291 


2431 


AGUUUGCU A UUUGCUUU 


269 


AAAGCAAA GGCTAGCTA Tr^TT^ 


3292 


2455 


GGGACUGU A UAAACAAG 


275 


CTTGTTTA GGCTAGCTACAACGA ACAGTCCC 


3293 


140 


AAGCCGCC A CCGGCCCG 


322 




3294 


151 


GGCCCGCC A UGCCCGCC 




GGCGGGCA GGCTAGCTACAACGA GGCGGGCC 


3295 




CGCUCUCC A CAGCCCGG 


380 


CCGGGCTG GGCTAGCTACAACGA GGAGAGCG 


3296 


368 


GAGAAGCC A CCAGCACC 


412 


GGTGCTGG GGCTAGCTACAACGA GGCTTCTC 


3297 


374 


CCACCAGC A CCACCCAG 


415 


CTGGGTGG GGCTAGCTACAACGA GCTGGTGG 


3298 


377 


CCAGCACC A CCCAGACU 


417 


AGTCTGGG GGCTAGCTACAACGA GGTGCTGG 


3299 


451 


CGGGGCCC A CCAUGGCC 


435 


GGCCATGG GGCTAGCTACAACGA GGGCCCCG 


3300 


454 


GGCCCACC A UGGCCCAA 


437 


TTGGGCCA GGCTAGCTACAACGA GGTGGGCC 


3301 


512 


GCCUGCCC A CGGCACCC 


456 


GGGTGCCG GGCTAGCTACAACGA GGGCAGGC 


3302 


517 


CCCACGGC A CCCAGCAC 


457 


GTGCTGGG GGCTAGCTACAACGA GCCGTGGG 


3303 


524 


CACCCAGC A CGGCAUCC 


461 


GGATGCCG GGCTAGCTACAACGA GCTGGGTG 


3304 


529 


AGCACGGC A UCCGGCUG 


462 


CAGCCGGA GGCTAGCTACAACGA GCCGTGCT 


3305 
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721 


CGCUCAAC A UCCUGGUG 




CACCAGGA GGCTAGCTACAACGA GTTGAGCG 


3306 


770 


UGCCCCCC A CCCCUUCC 


~~ §15 


GGAAGGGG GGCTAGCTACAACGA GGGGGGCA 


3307 


782 


CUUCCUGC A UCGCUACU 


~~ 529 


AGTAGCGA GGCTAGCTACAACGA GCAGGAAG 


3308 


811 


UGUCCAGC A CAUACCGG 


— — 


CCGGTATG GGCTAGCTACAACGA GCTGGACA 


3309 


813 


UCCAGCAC A UACCGGGA 


~~ 539 


TCCC3GTA GGCTAGCTACAACGA GTGCTGGA 


3310 


850 


UGCCCUAC A CCCAGGGC 


7 


GCCCTGGG GGCTAGCTACAACGA GTAGGGCA 


3311 


880 


AGCUGGGC A CCGACCUG 





CAGGTCGG GGCTAGCTACAACGA GCCCAGCT 


3312 


895 


UGGUAAGC A UCCCCCAU 


_. 

_J 


ATGGGGGA GGCTAGCTACAACGA GCTTACCA 


3313 


902 


CAUCCCCC A UGGCCCCA 


562 


TGGGGCCA GGCTAGCTACAACGA GGGGGATG 


3314 


91S 


CCAACGUC A CUGUGCGU 




ACGCACAG GGCTAGCTACAACGA GACGTTGG 


3315 


931 


GUGCCAAC A UUGCUGCC 




571 


GGCAGCAA GGCTAGCTACAACGA GTTGGCAC 


3316 


940 


UUGCUGCC A UCACUGAA 


574 


TTCAGTGA GGCTAGCTACAACGA GGCAGCAA 


3317 


943 


CUGCCAUC A CUGAAUCA 


575 


TGATTCAG GGCTAGCTACAACGA GATGGCAG 


3318 


964 


AGUUCUUC A UCAACGGC 




GCCGTTGA GGCTAGCTACAACGA GAAGAACT 


3319 


988 


GGGAAGGC A UCCUGGGG 


— — 

586 


CCCCAGGA GGCTAGCTACAACGA GCCTTCCC 


3320 


1070 


GCAGACCC A CGUUCCCA 


610 


TGGGAACG GGCTAGCTACAACGA GGGTCTGC 


3321 


1156 


GAGGGAGC A UGAUCAUU 


638 


AATGATCA GGCTAGCTACAACGA GCTCCCTC 


3322 


1162 


GCAUGAUC A UUGGAGGU 


639 


ACCTCCAA GGCTAGCTACAACGA GATCATGC 


3323 


1178 


UAUCGACC A CUCGCUGU 


641 


ACAGCGAG GGCTAGCTACAACGA GGTCGATA 


3324 


1189 


CGCUGUAC A CAGGCAGU 




ACTGCCTG GGCTAGCTACAACGA GTACAGCG 


3325 


1209 


UGGUAUAC A CCCAUCCG 


— 

649 


CGGATGGG GGCTAGCTACAACGA GTATACCA 


3326 


1213 


AUACACCC A UCCGGCGG 


652 


CCGCCGGA GGCTAGCTACAACGA GGGTGTAT 


3327 


1243 


AGGUGAUC A UUGUGCGG 


654 


CCGCACAA GGCTAGCTACAACGA GATCACCT 


3328 


1312 


ACAAGAGC A UUGUGGAC 


663 


GTCCACAA GGCTAGCTACAACGA GCTCTTGT 


3329 


1327 


ACAGUGGC A CCACCAAC 


665 


GTTGGTGG GGCTAGCTACAACGA GCCACTGT 


3330 


1330 


GUGGCACC A CCAACCUU 


S67 


AAGGTTGG GGCTAGCTACAACGA GGTGCCAC 


3331 


137B 


UCAAAUCC A UCAAGGCA 


679 


TGCCTTGA GGCTAGCTACAACGA GGATTTGA 


3332 


1395 


CCUCCUCC A CGGAGAAG 


.. 681 


CTTCTCCG GGCTAGCTACAACGA GGAGGAGG 


3333 


1456 


AAGCAGGC A CCACCCCU 


698 


AGGGGTGG GGCTAGCTACAACGA GCCTGCTT 


3334 


1459 


CAGGCACC A CCCCUUGG 


700 


CCAAGGGG GGCTAGCTACAACGA GGTGCCTG 


3335 


1471 


CUUGGAAC A UUUUCCCA 


705 


TGGGAAAA GGCTAGCTACAACGA GTTCCAAG 


3336 


1483 


UCCCAGUC A UCUCACUC 


709 


GAGTGAGA GGCTAGCTACAACGA GACTGGGA 


3337 


1488 


GUCAUCUC A CUCUACCU 




AGGTAGAG GGCTAGCTACAACGA GAGATGAC 


3338 


1528 


CCUUCCGC A UCACCAUC 


— 

723 


GATGGTGA GGCTAGCTACAACGA GCGGAAGG 


3339 


1531 


UCCGCAUC A CCAUCCUU 


724 


AAGGATGG GGCTAGCTACAACGA GATGCGGA 


3340 


1534 


GCAUCACC A UCCUUCCG 


. 72S ... 


CGGAAGGA GGCTAGCTACAACGA GGTGATGC 


3341 


1576 


AUGUGGCC A CGUCCCAA 


737 


TTGGGACG GGCTAGCTACAACGA GGCCACAT 


3342 


1606 


AGUUUGCC A UCUCACAG 




CTGTGAGA GGCTAGCTACAACGA GGCAAACT 


3343 


1611 


GCCAUCUC A CAGUCAUC 


— ~~i — 


GATGACTG GGCTAGCTACAACGA GAGATGGC 


3344 


1617 


UCACAGUC A UCCACGGG 





CCCGTGGA GGCTAGCTACAACGA GACTGTGA 


3345 


1621 


AGUCAUCC A CGGGCACU 


— _ — 
. 750 


AGTGCCCG GGCTAGCTACAACGA GGATGACT 


3346 


1627 


CCACGGGC A CUGUUAUG 




CATAACAG GGCTAGCTACAACGA GCCCGTGG 


3347 


1648 


CUGUUAUC A UGGAGGGC 


— 7iT~ 


GCCCTCCA GGCTAGCTACAACGA GATAACAG 


3343 


1715 


CGCUUGCC A UGUGCACG 


765 


CGTGCACA GGCTAGCTACAACGA GGCAAGCG 


3349 


1721 


CCAUGUGC A CGAUGAGU 


766 


ACTCATCG GGCTAGCTACAACGA GCACATGG 


3350 


1762 


CUUUUGUC A CCUUGGAC 


772 


GTCCAAGG GGCTAGCTACAACGA GACAAAAG 


3351 


1771 


CCUUGGAC A UGGAAGAC 


775 


GTCTTCCA GGCTAGCTACAACGA GTCCAAGG 


3352 


1792 


GCUACAAC A UUCCACAG 


779 


CTGTGGAA GGCTAGCTACAACGA GTTGTAGC 


3353 


1797 


AACAUUCC A CAGACAGA 




TCTGTCTG GGCTAGCTACAACGA GGAATGTT 


3354 


1819 


CAACCCUC A UGACCAUA 


788 


TATGGTCA GGCTAGCTACAACGA GAGGGTTG 


3355 


1825 


UCAUGACC A UAGCCUAU 


790 


ATAGGCTA GGCTAGCTACAACGA GGTCATGA 


3356 
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1837 


CCUAUGUC A UGGCUGCC 


793 


GGCAGCCA G ^™^|™^^ G ^ GGCAGCCA 




1846 


UGGCUGCC A UCUGCGCC 


796 


GGCGCAGA CAACCA GAAGAGGG 


335B 


1861 


CCCUCUUC A UGCUGCCA 


802 


TGGCAGCA G CAACGA GGCAGCAT 


~3359 


1869 


AUGCUGCC A CUCUGCCU 


805 


AGGCAGAG GGCTAGCTA ^GA GAGGCAGA 


~~ 3360 


1879 


UCUGCCUC A UGGUGUGU 


810 


ACACACCA GGCTAGC ™^^ GCTGCTGG 


3361 


1922 


CCAGCAGC A UGAUGACU 


822 


AGTCATCA GGCTAGCTACAA — GTCATCAG 


~ 3362 


1942 


CUGAUGAC A UCUCCCUG 


825 


CAGGGAGA GGCTAG ^™ C , A ^ CGA GGQCCTCC 


3363 


1968 


GGAGGCCC A UGGGCAGA 


833 


TCTGCCCA GGCTAGCTACAA 


~ 3364 


1998 


CCUGGACC A CACCUCCG 


840 


CGGAGGTG GGCTAGCTACAACGA GGTCCAGG 


336S 


2000 


UGGACCAC A CCUCCGUG 


841 


CACGGAGG GGCTAGCTACAACGA GTGGTCCA 


3366 


2013 


CGUGGUUC A CUUUGGUC 


845 


GACCAAAG GGCTAGCTACAACGA GAACCACG 


— 3367 


2022 


CUUUGGUC A CAAGUAGG 


847 


CCTACTTG GGCTAGCTACAACGA GACCAAAG 


336g 


2035 


UAGGAGAC A CAGAUGGC 


849 


GCCATCTG GGCTAGCTACAACGA GTCTCCTA 


33g g 


2044 


CAGAUGGC A CCUGUGGC 


851 


GCCACAGG GGCTAGCTACAACGA GCCATCTG 


3370 


2059 


GCCAGAGC A CCUCAGGA 


856 


TCCTGAGG GGCTAGCTACAACGA G ^^ GG ^ 


33?1 


2076 


CCCUCCCC A CCCACCAA 


866 


TTGGTGGG GGCTAGCTA -■ 


— 3372 


2080 


CCCCACCC A CCAAAUGC 


869 


GCATTTGG GGCTAGCTACAACGA GGGIGGLHj 


3373 




AAAGAAGC A CUCUGCUG 


885 


CAGCAGAG GGCTAGCTACAACGA GCTTCTTT 


3374 


2201 


UCUUGGUC A CCUCAAAU 


891 


ATTTGAGG GGCTAGCTACAACGA GACCAAGA 


3375 


2260 


CUUUGUCC A CCAUUCCU 


906 


AGGAATGG GGCTAGCTACAACGA GGACAAAG 




2263 


UGUCCACC A UUCCUUUA 


908 


TAAAGGAA GGCTAGCTACAACGA GGTGGACA 


3T7I 


2322 


GUACUGGC A UCACACGC 


922 


GCGTGTGA GGCTAGCTACAACGA GCCAGTAC 


3378 


2325 


CUGGCAUC A CACGCAGG 


923 


CCTGCGTG GGCTAGCTACAACGA GATGCCAG 


— 3379 — 


2327 


GGCAUCAC A CGCAGGUU 


924 


AACCTGCG GGCTAGCTACAACGA GTGATGCC 


3380 


2421 


GAGGAUGC A CAGUUUGC 


945 
954 


GCAAACTG GGCTAGCTACAACGA GCATCCTC 
TGCACCAA GGCTAGCTACAACGA GTTAGGCT 


3381 

— 3382 


2470 
11 


AGCCUAAC A UUGGUGCA 
ACGCGUCC G CAGCCCGC 


960 


GCGGGCTG GGCTAGCTACAACGA GGAL'GCGT 


3383 


18 


CGCAGCCC G CCCGGGAG 


961 


CTCCCGGG GGCTAGCTACAACGA GGGC1GCU 


3384 


29 


CGGGAGCU G CGAGCCGC 


962 


GCGGCTCG GGCTAGCTACAACGA AGCTCCCG 


3385 


36 


UGCGAGCC G CGAGCUGG 


964 


CCAGCTCG GGCTAGCTACAACGA GGCTCGCA 


— 3386 — 


69 


CAGCCAAC G CAGCCGCA 


967 


TGCGGCTG GGCTAGCTACAACGA GTTGGCTG 


— 33l7 


75 


ACGCAGCC G CAGGAGCC 


968 


GGCTCCTG GGCTAGCTACAACGA GGCTGCGT 


3388 — 


94 


GAGCCCUU G CCCCUGCC 


969 


GGCAGGGG GGCTAGCTACAACGA AAGGGCTC 


3389 — 


100 


UUGCCCCU G CCCGCGCC 


970 


GGCGCGGG GGCTAGCTACAACGA AGGGGCAA 


3390 


104 


CCCUGCCC G CGCCGCCG 


971 


CGGCGGCG GGCTAGCTACAACGA GGGCAGGG 


3391 


106 


CUGCCCGC G CCGCCGCC 


972 


GGCGGCGG GGCTAGCTACAACGA GCGGGCAG 


3392 


109 


CCCGCGCC G CCGCCCGC 


973 


GCGGGCGG GGCTAGC f ACAACGA 


3393 


112 


GCGCCGCC G CCCGCCGG 


974 


CCGGCGGG GGCTAGCTACAA gggcC-QCG 




116 


CGCCGCCC G CCGGGGGG 


975 


CCCCCCGG GGCTA 


3~f9l 


137 


GGGAAGCC G CCACCGGC 


976 


GCCGGTGG GGCTAGCTACAACGA GGCliCLiu 


3396 


148 


ACCGGCCC G CCAUGCCC 


977 


GGGCATGG GGCTAGCTACAACGA GGGCCGGT 


— 3397 — 


153 


CCCGCCAU G CCCGCCCC 


978 


GGGGCGGG GGCTAGCTACAACGA ATGGCGGG 


3398 


157 


CCAUGCCC G CCCCUCCC 


979 




3399 




CCAGCCCC G CCGGGAGC 


980 


GCTCCCGG GGCTAGCTACAACGA GGGGCTGG 


340O 


183 


GGGAGCCC G CGCCCGCU 


981 


AGCGGGCG GGCTAGCTACAACGA GGGCTCCC 


3401 


185 


GAGCCCGC G CCCGCUGC 


982 


GCAGCGGG GGCTAGCTACAACGA GCGGGCTC 


3402 


189 


CCGCGCCC G CUGCCCAG 


983 


CTGGGCAG GGCTAGCTACAACGA GGGCGCGG 


3403 


192 


CGCCCGCU G CCCAGGCU 


984 


AGCCTGGG GGCTAGCTACAACGA AGCGGGCG 


3404 


205 


GGCUGGCC G CCGCCGUG 


985 


CACGGCGG GGCTAGCTACAACGA GGCCAGCC 


3405 


208 


UGGCCGCC G CCGUGCCG 


986 


CGGCACGG GGCTAGCTACAACGA GGCGGCCA 


3406 


213 


GCCGCCGU G CCGAUGUA | 987 


TACATCGG GGCTAGCTACAACGA ACGGCGGC 


3407 
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G CUCCCGUG 



CACGGGAG GGCTAGCTACAACGA AGGGGAGA 



GCUCCCGU G CUCUGCGG 



CCGCAGAG GGCTAGCTACAACGA ACGGGAGC 



IGCUCU G CGGAUCUC 



GAGATCCG GGCTAGCTACAACGA AGAGCACG 



CCCUGACC G CUCUCCAC 



GTGGAGAG GGCTAGCTACAACGA GGTCAGGG 



AGGGCCCU G CAGGCCCU 



AGGGCCTG GGCTAGCTACAACGA AGGGCCCT 



GUCCUGAU G CCCCCAAG 



CTTGGGGG GGCTAGCTACAACGA ATCAGGAC 



GGGCAGGC G CCAGGGAC 



GTCCCTGG GGCTAGCTACAACGA GCCTGCCC 



GGGCCAGU G CGAGCCCA 



TGGGCTCG GGCTAGCTACAACGA ACTGGCCC 



CAAGCCCU G CCCUGGCU 



AGCCAGGG GGCTAGCTACAACGA AGGGCTTG 



UGGCUCCU G CUGUGGAU 



ATCCACAG GGCTAGCTACAACGA AGGAGCCA 



GGAUGGGC G CGGGAGUG 



CACTCCCG GGCTAGCTACAACGA GCCCATCC 



GCGGGAGU G CUGCCUGC 



GCAGGCAG GGCTAGCTACAACGA ACTCCCGC 



GGAGUGCU G CCUGCCCA 



TGGGCAGG GGCTAGCTACAACGA AGCACTCC 



508 
537 



UGCUGCCU G CCCACGGC 



GCCGTGGG GGCTAGCTACAACGA AGGCAGCA 



AUCCGGCU G CCCCUGCG 



CGCAGGGG GGCTAGCTACAACGA AGCCGGAT 



CUGCCCCU G CGCAGCGG 



CCGCTGCG GGCTAGCTACAACGA AGGGGCAG 



GCCCCUGC G CAGCGGCC 



GGCCGCTG GGCTAGCTACAACGA GCAGGGGC 



UGGGGGGC G CCCCCCUG 



CAGGGGGG GGCTAGCTACAACGA GCCCCCCA 



576 
582 
708 



J G CGGCUGCC 



GGCAGCCG GGCTAGCTACAACGA AGCCCCAG 



CUGCGGCU G CCCCGGGA 



TCCCGGGG GGCTAGCTACAACGA AGCCGCAG 



AGCCCCCC G CAGACGCU 



AGCGTCTG GGCTAGCTACAACGA GGGGGGCT 



CCGCAGAC G CUCAACAU 



ATGTTGAG GGCTAGCTACAACGA GTCTGCGG 



751 
760 



GUAACUUU G CAGUGGGU 



ACCCACTG GGCTAGCTACAACGA AAAGTTAC 



CAGUGGGU G CUGCCCCC 



GGGGGCAG GGCTAGCTACAACGA ACCCACTG 



UGGGUGCU G CCCCCCAC 



GTGGGGGG GGCTAGCTACAACGA AGCACCCA 



CCCUUCCU G CAUCGCUA 



TAGCGATG GGCTAGCTACAACGA AGGAAGGG 



CCUGCAUC G CUACUACC 



GGTAGTAG GGCTAGCTACAACGA GATGCAGG 



GUGUAUGU G CCCUACAC 



GTGTAGGG GGCTAGCTACAACGA ACATACAC 



GUCACUGU G CGUGCCAA 



TTGGCACG GGCTAGCTACAACGA ACAGTGAC 



CUGUGCGU G CCAACAUU 



AATGTTGG GGCTAGCTACAACGA ACGCACAG 



CCAACAUU G CUGCCAUC 



GATGGCAG GGCTAGCTACAACGA AATGTTGG 



; CCAUCACU 



AGTGATGG GGCTAGCTACAACGA AGCAATGT 



1006 
1015 
1092 



1286 
1344 
1366 
1442 
1526 
1542 
1554 
1603 
1699 
1708 
1712 
1719 



UGGCCUAU G CUGAGAUU 



AATCTCAG GGCTAGCTACAACGA ATAGGCCA 



CUGAGAUU G CCAGGCCU 
UUCUCCCU G CAGCUUUG 
UUUGUGGU G CUGGCUUC 
UCUGAAGU G CUGGCCUC 



AUCAUUGU G CGGGUGGA 
AAUGGACU G CAAGGAGU 
CUUCGUUU G CCCAAGAA 
UUGAAGCU G CAGUCAAA 
GCUGGUGU G CUGGCAAG 
GUCCUUCC G CAUCACCA 
AUCCUUCC G CAGCAAUA 
CAAUACCU G CGGCCAGU 
ACAAGUUU G CCAUCUCA 
UUGGCUUU G CUGUCAGC 
CUGUCAGC G CUUGCCAU 
CAGCGCUU G CCAUGUGC 
UGCCAUGU G CACGAUGA 



GGCTAGCTACAACGA AATCTCAG 
CAAAGCTG GGCTAGCTACAACGA AGGGAGAA 
GAAGCCAG GGCTAGCTACAACGA ACCACAAA 
GAGGCCAG GGCTAGCTACAACGA ACTTCAGA 
GTGTACAG GGCTAGCTACAACGA GAGTGGTC 
TCCACCCG GGCTAGCTACAACGA ACAATGAT 
ACTCCTTG GGCTAGCTACAACGA AGTCCATT 
TTCTTGGG GGCTAGCTACAACGA AAACGAAG 
TTTGACTG GGCTAGCTACAACGA AGCTTCAA 
CTTGCCAG GGCTAGCTACAACGA ACACCAGC 
TGGTGATG GGCTAGCTACAACGA GGAAGGAC 
TATTGCTG GGCTAGCTACAACGA GGAAGGAT 
ACTGGCCG GGCTAGCTACAACGA AGGTATTG 
TGAGATGG GGCTAGCTACAACGA AAACTTGT 
GCTGACAG GGCTAGCTACAACGA AAAGCCAA 
ATGGCAAG GGCTAGCTACAACGA GCTGACAG 
GCACATGG GGCTAGCTACAACGA AAGCGCTG 
TCATCGTG GGCTAGCTACAACGA ACATGGCA 
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1843 


UCAUGGCU G CCAUCUGC 


1074 






1850 


UGCCAUCU G CGCCCUCU 


1075 


AGAGGGCG GGCTAGCTACAACGA AGATGGCA 


3460 


1852 


CCAUCUGC G CCCUCUUC 


1076 


GAAGAGGG GGCTAGCTACAACGA GCAGATGG 


3461 


1863 


CUCUUCAU G CUGCCACU 


1077 


AGTGGCAG GGC ™^^^ AGCATGAA 


3462 


1866 


UUCAUGCU G CCACUCUG 


1078 


CAGAGTGG °° G ™^ CAACGA AGAGTGG C 


3463 


1874 


GCCACUCU G CCUCAUGG 


1079 




3464 


1895 


UCAGUGGC G CUGCCUCC 


_ 1080 


GGAGGCAG GGCTAGCTACAACGA GCCACTGA 




1898 


GUGGCGCU G CCUCCGCU 


1081 


AGCGGAGG GGCTAGCTACAACGA AGCGCCAC 


3466 


1904 


CUGCCUCC G CUGCCUGC 


. 1082 


GCAGGCAG GGCT ^ G ^^^g^ AGCGGAGG 


3^gg 


1907 


CCUCCGCU G CCUGCGCC 


1083 


GGCGCAGG A AACGA AGGCAGCG 




1911 


CGCUGCCU G CGCCAGCA 


1084 


TGCTGGCG GGCTA ACAACGA GCAGGCAG 


34g9 


1913 


CUGCCUGC G CCAGCAGC 


1085 


GCTGCTGG GGCTAG CTACRACGA AAAGTCAT 


~ 3470 


1933 


AUGACUUU G CUGAUGAC 


1088 


GTCATCAG GGCT A GG ™ GA ^ CGA ftGGGAGAT 


3471 


1950 


AUCUCCCU G CUGAAGUG 


1091 


CAC,ITCAG GGCTAGC ™ A ^ CGA ATTTGGTG 


~ 3472 


2087 


CACCAAAU G CCUCUGCC 


1094 


GGCAGAGG GGCTAGCTACAACGA 


3473 


2093 


AUGCCUCU G CCUUGAUG 


1095 


CATCAAGG GGCTAGCTACAACGA AGAGGCA I 


3474 


2179 


AGCACUCU G CUGGCGGG 


1097 


CCCGCCAG GGCTAGCTACAACGA AGAGTGCT 


3475 


2227 


GAAAUUCU G CUGCUUGA 




TCAAGCAG GGCTAGCTACAACGA AGAATTTC 


3476 


2230 


AUUCUGCU G CUUGAAAC 


1099 


CTTTCAAG GGCTAGCTACAACGA AGCAGAAT 


3477 


2329 


CAUCACAC G CAGGUUAC 


1102 


GTAACCTG GGCTAGCTACAACGA GTGTGATG 


3478 


2393 


GUUUCCCU G CUGGCCAA 


1103 


TTGGCCAG GGCTAGCTACAACGA AGGGAAAC 


3479 


2419 


GAGAGGAU G CACAGUUU 


1104 


AAACTGTG GGCTAGCTACAACGA ATCCTCTC 


3480 


2428 


CACAGUUU G CUAUUUGC 


1105 


GCAAATAG GGCTAGCTACAACGA AAACTGTG 


3481 


2435 


UGCUAUUU G CUUUAGAG 


1106 | CTCTAAAG GGCTAGCTACAACGA AAATAGCA 


3482 


2476 


ACAUUGGU G CAAAGAUU 


1107 1 AATCTTTG GGCTAGCTACAACGA ACCAATGT 


— 3483 


2485 


CAAAGAUU G CCUCUUGA 


1108 


TCAAGAGG GGCTAGCTACAACGA AATCTTTG 


3484 


219 


GUGCCGAU G UAGCGGGC 


1110 


GCCCGCTA GGCTAGCTACAACGA ATCGGCAC 


3485 


483 


CUCCUGCU G UGGAUGGG 


1111 


CCCATCCA GGCTAGCTACAACGA AGCAGGAG 


3486 


634 


GCAGCUUU G UGGAGAUG 


1112 


CATCTCCA GGCTAGCTACAACGA AAAGCTGC 


3487 


804 


AGGCAGCU G UCCAGCAC 


1113 


GTGCTGGA GGCTAGCTACAACGA AGCTGCCT 


34813 


835 


GGAAGGGU G UGUAUGUG 


1114 


CACATACA GGCTAGCTACAACGA ACCCTTCC 


3489 


837 


AAGGGUGU G UAUGUGCC 


1115 


GGCACATA GGCTAGCTACAACGA ACACCCTT 


3490 


841 


GUGUGUAU G UGCCCUAC 


1116 


GTAGGGCA GGCTAGCTACAACGA ATACACAC 




919 


ACGUCACU G UGCGUGCC 


1117 


GGCACGCA GGCTAGCTACAACGA AGTGACGT 


3492 


110.0 


GCAGCUUU G UGGUGCUG 


1118 


CAGCACCA GGCTAGCTACAACGA AAAGCTGC 


3493 


1144 


UGGCCUCU G UCGGAGGG 


1119 


CCCTCCGA GGCTAGCTACAACGA AGAGGCCA 




1185 


CACUCGCU G UACACAGG 


1120 


CCTGTGTA GGCTAGCTACAACGA AGCGAGTG 


— Jiii — 


124S 


UGAUCAUU G UGCGGGUG 


1121 


CACCCGCA GGCTAGCTACAACGA AATGATCA 


3496 


1315 


AGAGCAUU G UGGACAGU 


1122 


ACTGTCCA GGCTAGCTACAACGA AATGCTCT 


3497 


1356 


AAGAAAGU G UUUGAAGC 


1123 


GCTTCAAA GGCTAGCTACAACGA ACTTTCTT 


3498 


1440 


CAGCUGGU G UGCUGGCA 


1124 


TGCCAGCA GGCTAGCTACAACGA ACCAGCTG 


3499 


1570 


UGGAAGAU G UGGCCACG 


1125 


CGTGGCCA GGCTAGCTACAACGA ATCTTCCA 


3500 


1592 


AGACGACU G UUACAAGU 


1126 


ACTTGTAA GGCTAGCTACAACGA AGTCGTCT 


3501 




CGGGCACU G UUAUGGGA 


1127 


TCCCATAA GGCTAGCTACAACGA AGTGCCCG 


3502 


1642 


UGGGAGCU G UUAUCAUG 


1128 


CATGATAA GGCTAGCTACAACGA AGCTCCCA 


3503 


1666 


UCUACGUU G UCUUUGAU 


1129 


ATCAAAGA GGCTAGCTACAACGA AACGTAGA 


3504 


1702 


GCUUUGCU G UCAGCGCU 


1130 


AGCGCTGA GGCTAGCTACAACGA AGCAAAGC 


3505 


1717 


CUUGCCAU G UGCACGAU 


1131 


ATCGTGCA GGCTAGCTACAACGA ATGGCAAG 


3506 


1759 


GCCCUUUU G UCACCUUG 


1132 


CAAGGTGA GGCTAGCTACAACGA AAAAGGGC 


3507 


1781 


GGAAGACU G UGGCUACA 


113 3 TGTAGCCA GGCTAGCTACAACGA AGTCTTCC 


3508 


1834 


UAGCCUAU G UCAUGGCU 


1134 1 AGCCATGA GGCTAGCTACAACGA ATAGGCTA 


3509 
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1884 


CUCAUGGU G UGUCAGUG 


- 


CACTGAC A < r . a .G [A 'AACGA ACCATGAG 


3510 


1886 


CAUGGUGU G UCAGUGGC 


ii36 - 


GCCACTGA GGCTAGCTACAACGA ACACCATG 


3511 


2048 


UGGCACCU G UGGCCAGA 


? - 


TCTGGCCA GGCTAGCTACAACGA AGGTGCCA 


3512 


2139 


CAGGGACU G UACCUGUA 


— — ■-- 


TACAGGTA GGCTAGCTACAACGA AGTCCCTG 


3513 


2145 


CUGUACCU G UAGGAAAC 


_ 

1139 - 


GTTTCCTA GGCTAGCTACAACGA AGGTACAG 


3514 


2256 


GAACCUUU G UCCACCAU 


X1 * - 


ATGGTGGA GGCTAGCTACAACGA AAAGGTTC 


3515 


2346 


CUTJGGCGU G UGUCCCUG 




CAGGGACA GGCTAGCTACAACGA ACGCCAAG 


3516 


2348 


UGGCGUGU G UCCCUGUG 





CACAGGGA GGCTAGCTACAACGA ACACGCCA 


3517 


2354 


GUGUCCCU G UGGUACCC 




GGGTACCA GGCTAGCTACAACGA AGGGACAC 


3518 


2385 


CCAAGCUU G UUUCCCUG 




1 44 


CAGGGAAA GGCTAGCTACAACGA AAGCTTGG 


3519 


2453 


CAGGGACU G UAUAAACA 




TGTTTATA GGCTAGCTACAACGA AGTCCCTG 


3520 


14 


CGUCCGCA G CCCGCCCG 




CGGGCGGG GGCTAGCTACAACGA TGCGGACG 


3521 


26 


GCCCGGGA G CUGCGAGC 




__L_ — 


GCTCGCAG GGCTAGCTACAACGA TCCCGGGC 


3522 


33 


AGCUGCGA G CCGCGAGC 


1148 


GCTCGCGG GGCTAGCTACAACGA TCGCAGCT 


3523 


40 


AGCCGCGA G CUGGAUUA 


1149 


TAATCCAG GGCTAGCTACAACGA TCGCGGCT 


3524 


51 


GGAUUAUG G UGGCCUGA 


1150 


TCAGGCCA GGCTAGCTACAACGA CATAATCC 


3525 


54 


UUAUGGUG G CCUGAGCA 


1151 


TGCTCAGG GGCTAGCTACAACGA CACCATAA 


3526 


60 


UGGCCUGA G CAGCCAAC 


1152 - 


GTTGGCTG GGCTAGCTACAACGA TCAGGCCA 


3527 


63 


CCUGAGCA G CCAACGCA 


1153 


TGCGTTGG GGCTAGCTACAACGA TGCTCAGG 


3528 


72 


CCAACGCA G CCGCAGGA 


1154 


TCCTGCGG GGCTAGCTACAACGA TGCGTTGG 


3529 


81 


CCGCAGGA G CCCGGAGC 




GCTCCGGG GGCTAGCTACAACGA TCCTGCGG 


3530 


88 


AGCCCGGA G CCCUUGCC 


1156 


GGCAAGGG GGCTAGCTACAACGA TCCGGGCT 


3531 


134 


CCAGGGAA G CCGCCACC 


1157 


GGTGGCGG GGCTAGCTACAACGA TTCCCTGG 


3532 


144 


CGCCACCG G CCCGCCAU 


1158 


ATGGCGGG GGCTAGCTACAACGA CGGTGGCG 


3533 


167 


CCCUCCCA G CCCCGCCG 


1159 


CGGCGGGG GGCTAGCTACAACGA TGGGAGGG 


3534 


179 


CGCCGGGA G CCCGCGCC 


1160 


GGCGCGGG GGCTAGCTACAACGA TCCCGGCG 


3535 


198 


CUGCCCAG G CUGGCCGC 


1161 


GCGGCCAG GGCTAGCTACAACGA CTGGGCAG 


3536 


202 


CCAGGCUG G CCGCCGCC 


1162 


GGCGGCGG GGCTAGCTACAACGA CAGCCTGG 


3537 


211 


CCGCCGCC G UGCCGAUG 


1163 
1164 


CATCGGCA GGCTAGCTACAACGA GGCGGCGG 
GGAGCCCG GGCTAGCTACAACGA TACATCGG 


3538 
3539 


222 
226 


CCGAUGUA G CGGGCUCC 
UGUAGCGG G CUCCGGAU 


1165 


ATCCGGAG GGCTAGCTACAACGA CCGCTACA 


3540 


239 


GGAUCCCA G CCUCUCCC 


1166 


GGGAGAGG GGCTAGCTACAACGA TGGGATCC 


3541 


256 


CUGCUCCC G UGCUCUGC 


... 1167 


GCAGAGCA GGCTAGCTACAACGA GGGAGCAG 


3542 


290 


UCUCCACA G CCCGGACC 


. 1163 


GGTCCGGG GGCTAGCTACAACGA TGTGGAGA 


3543 


304 


ACCCGGGG G CUGGCCCA 


1169 


TGGGCCAG GGCTAGCTACAACGA CCCCGGGT 


3544 


308 


GGGGGCUG G CCCAGGGC 


1170 


GCCCTGGG GGCTAGCTACAACGA CAGCCCCC 


3545 


315 


GGCCCAGG G CCCUGCAG 


1171 


CTGCAGGG GGCTAGCTACAACGA CCTGGGCC 


3546 


324 


CCCUGCAG G CCCUGGCG 


1172 


CGCCAGGG GGCTAGCTACAACGA CTGCAGGG 


3547 


330 


AGGCCCUG G CGUCCUGA 





TCAGGACG GGCTAGCTACAACGA CAGGGCCT 


3548 


332 


GCCCUGGC G UCCUGAUG 




CATCAGGA GGCTAGCTACAACGA GCCAGGGC 


3549 


348 


GCCCCCAA G CUCCCUCU 


^ 4 


AGAGGGAG GGCTAGCTACAACGA TTGGGGGC 


3550 


365 


CCUGAGAA G CCACCAGC 


76~ 


GCTGGTGG GGCTAGCTACAACGA TTCTCAGG 


3551 


372 


AGCCACCA G CACCACCC 


~~ TvrT 


GGGTGGTG GGCTAGCTACAACGA TGGTGGCT 


3552 




ACUUGGGG G CAGGCGCC 


1178 


GGCGCCTG GGCTAGCTACAACGA CCCCAAGT 


3553 


395 


GGGGGCAG G CGCCAGGG 


1179 


CCCTGGCG GGCTAGCTACAACGA CTGCCCCC 


3554 


410 


GGACGGAC G UGGGCCAG 


1180 


CTGGCCCA GGCTAGCTACAACGA GTCCGTCC 


3555 




GGACGUGG G CCAGUGCG 


1181 


CGCACTGG GGCTAGCTACAACGA CCACGTCC 


3556 


418 


GUGGGCCA G UGCGAGCC 


1182 


GGCTCGCA GGCTAGCTACAACGA TGGCCCAC 


3557 


424 


CAGUGCGA G CCCAGAGG 


1133 


CCTCTGGG GGCTAGCTACAACGA TCGCACTG 


3558 


433 


CCCAGAGG G CCCGAAGG 


1184 


CCTTCGGG GGCTAGCTACAACGA CCTCTGGG 


.3559 


441 


GCCCGAAG G CCGGGGCC 


1185 


GGCCCCGG GGCTAGCTACAACGA CTTCGGGC 


3560 
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447 


AGGCCGGG G CCCACCAU 


1185 


ATGGTGGG GGCTAGCTACAACGA CCCGGCCT 


3561 




CCACCAUG G CCCAAGCC 


1187 


GGCTTGGG GGCTAGCTACAACGA CATGGTGG 


3562 


463 


UGGCCCAA G CCCUGCCC 


1188 


GGGCAGGG GGCTAGCTACAACGA TTGGGCCA 


3563 




CUGCCCUG G CUCCUGCU 


1189 


AGCAGGAG GGCTAGCTACAACGA CAGGGCAG 


3564 




GUGGAUGG G CGCGGGAG 


1190 


CTCCCGCG GGCTAGCTACAACGA CCATCCAC 


3565 




GCGCGGGA G UGCUGCCU 


1191 


AGGCAGCA GGCTAGCTACAACGA TCCCGCGC 


3566 


— 


UGCCCACG G CACCCAGC 


1192 


GCTGGGTG GGCTAGCTACAACGA CGTGGGCA 


3567 


522 


GGCACCCA G CACGGCAU 


1193 


ATGCCGTG GGCTAGCTACAACGA TGGGTGCC 


3568 


527 


CCAGCACG G CAUCCGGC 


1194 


GCCGGATG GGCTAGCTACAACGA CGTGCTGG 


3569 


s34 


GGCAUCCG G CUGCCCCU 


1195 


AGGGGCAG GGCTAGCTACAACGA CGGATGCC 


3570 


548 


CCUGCGCA G CGGCCUGG 


1196 




3571 




GCGCAGCG G CCUGGGGG 


1197 


CCCCCAGG GGCTAGCTACAACGA CGCTGCGC 


3572 


^ 


CCUGGGGG G CGCCCCCC 


1198 


GGGGGGCG GGCTAGCTACAACGA CCCCCAGG 


3573 




CCCCUGGG G CUGCGGCU 


1199 


AGCCGCAG GGCTAGCTACAACGA CCCAGGGG 


3574 




GGGCUGCG G CUGCCCCG 


1200 


CGGGGCAG GGCTAGCTACAACGA CGCAGCCC 


3575 




GACGAAGA G CCCGAGGA 


1201 


TCCTCGGG GGCTAGCTACAACGA TCTTCGTC 


3575 


— 


CCCGAGGA G CCCGGCCG 


1202 


CGGCCGGG GGCTAGCTACAACGA TCCTCGGG 


3577 


— 


GGAGCCCG G CCGGAGGG 


1203 


CCCTCCGG GGCTAGCTACAACGA CGGGCTCC 


3578 




CCGGAGGG G CAGCUUUG 


1204 


CAAAGCTG GGCTAGCTACAACGA CCCTCCGG 


3579 




GAGGGGCA G CUUUGUGG 


1205 


CCACAAAG GGCTAGCTACAACGA TGCCCCTC 


3580 




UGGAGAUG G UGGACAAC 


1206 


GTTGTCCA GGCTAGCTACAACGA CATCTCCA 


3581 




CCUGAGGG G CAA3UCG3 


1207 


CCGACTTG GGCTAGCTACAACGA CCCTCAGG 


3582 


gg3 


AGGGGCAA G UCGGGGCA 


1208 


TGCCCCGA GGCTAGCTACAACGA TTGCCCCT 


3583 


- sg9 


AAGUCGGG G CAGGGCUA 


1209 


TAGCCCTG GGCTAGCTACAACGA CCCGACTT 


3584 


g74 


GGGGCAGG G CUACUACG 


1210 


CGTAGTAG GGCTAGCTACAACGA CCTGCCCC 


3585 




GCUACUAC G UGGAGAUG 


1211 


CATCTCCA GGCTAGCTACAACGA GTAGTAGC 


3586 


694 


AGAUGACC G UGGGCAGC 


1212 


GCTGCCCA GGCTAGCTACAACGA GGTCATCT 


3587 


S98 


GACCGUGG G CAGCCCCC 


1213 


GGGGGCTG GGCTAGCTACAACGA CCACGGTC 


3588 




CGUGGGCA G CCCCCCGC 


1214 


GCGGGGGG GGCTAGCTACAACGA TGCCCACG 


3589 




ACAUCCUG G UGGAUACA 


1215 


TGTATCCA GGCTAGCTACAACGA CAGGATGT 


3590 


?37 


GGAUACAG G CAGCAGUA 


1216 


TACTGCTG GGCTAGCTACAACGA CTGTATCC 


3591 


74Q 


UACAGGCA G CAGUAACU 


1217 


AGTTACTG GGCTAGCTACAACGA TGCCTGTA 


3592 




AGGCAGCA G UAACUUUG 


1218 


CAAAGTTA GGCTAGCTACAACGA TGCTGCCT 


3593 


754 


ACUUUGCA G UGGGUGCU 


1219 


AGCACCCA GGCTAGCTACAACGA TGCAAAGT 


3594 




UGCAGUGG G UGCUGCCC 


1220 


GGGCAGCA GGCTAGCTACAACGA CCACTGCA 


3595 


798 


UACCAGAG G CAGCUGUC 


1221 


GACAGCTG GGCTAGCTACAACGA CTCTGGTA 


3596 


— 


CAGAGGCA G CUGUCCAG 


1222 


CTGGACAG GGCTAGCTACAACGA TGCCTCTG 


3597 




GCUGUCCA G CACAUACC 


1223 


GGTATGTG GGCTAGCTACAACGA TGGACAGC 


3598 


■ g33 


CCGGAAGG G UGUGUAUG 


1224 


CATACACA GGCTAGCTACAACGA CCTTCCGG 


3599 


857 


CACCCAGG G CAAGUGGG 


1225 


CCCACTTG GGCTAGCTACAACGA CCTGGGTG 


3600 




CAGGGCAA G UGGGAAGG 


1226 


CCTTCCCA GGCTAGCTACAACGA TTGCCCTG 


3601 


■ g 73 


GAAGGGGA G CUGGGCAC 


1227 


GTGCCCAG GGCTAGCTACAACGA TCCCCTTC 


3602 


~~87sT 


GGAGCUGG G CACCGACC 


1228 


GGTCGGTG GGCTAGCTACAACGA CCAGCTCC 


3603 


889 


CCGACCUG G UAAGCAUC 


1229 


GATGCTTA GGCTAGCTACAACGA CAGGTCGG 


3604 


893 


CCUGGUAA G CAUCCCCC 


1230 


GGGGGATG GGCTAGCTACAACGA TTACCAGG 




905 


CCCCCAUG G CCCCAACG 


1231 


CGTTGGGG GGCTAGCTACAACGA CATGGGGG 


3606 


913 


GCCCCAAC G UCACUGUG 


1232 


CACAGTGA GGCTAGCTACAACGA GTTGGGGC 


3607 


923 


CACUGUGC G UGCCAACA 


1233 


TGTTGGCA GGCTAGCTACAACGA GCACAGTG 


3608 


957 


UCAGACAA G UUCUUCAU 


1234 


ATGAAGAA GGCTAGCTACAACGA TTGTCTGA 


3609 


971 


CAUCAACG G CUCCAACU 


1235 


AGTTGGAG GGCTAGCTACAACGA CGTTGATG 


3610 


986 


CUGGGAAG G CAUCCUGG 


1236 


CCAGGATG GGCTAGCTACAACGA CTTCCCAG 


3611 
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AUCCUGGG G CUGGCCUA 



TAGGCCAG GGCTAGCTACAACGA CCCAGGAT 



UGGGGCUG G CCUAUGCU 



AGCATAGG GGCTAGCTACAACGA CAGCCCCA 



AUUGCCAG G CCUGACGA 



TCGTCAGG GGCTAGCTACAACGA CTGGCAAT 



UCCCUGGA G CCUUUCUU 



AAGAAAGG GGCTAGCTACAACGA TCCAGGGA 



ACUCUCUG G UAAAGCAG 



CTGCTTTA GGCTAGCTACAACGA CAGAGAGT 



CUGGUAAA G CAGACCCA 



TGGGTCTG GGCTAGCTACAACGA TTTACCAG 



AGACCCAC G UUCCCAAC 



GTTGGGAA GGCTAGCTACAACGA GTGGGTCT 



UCCCUGCA G CUUUGUGG 



CCACAAAG GGCTAGCTACAACGA TGCAGGGA 



GCUUUGUG G UGCUGGCU 



AGCCAGCA GGCTAGCTACAACGA CACAAAGC 



UGGUCCUG G CUUCCCCC 



GGGGGAAG GGCTAGCTACAACGA CAGCACCA 



CUCAACCA G UCUGAAGU 



ACTTCAGA GGCTAGCTACAACGA TGGTTGAG 



AGUCUGAA G UGCUGGCC 



GGCCAGCA GGCTAGCTACAACGA TTCAGACT 



AAGUGCUG G CCUCUGUC 



GACAGAGG GGCTAGCTACAACGA CAGCACTT 



CGGAGGGA G CAUGAUCA 



TGATCATG GGCTAGCTACAACGA TCCCTCCG 



CAUUGGAG G UAUCGACC 



GGTCGATA GGCTAGCTACAACGA CTCCAATG 



GUACACAG G CAGUCUCU 



AGAGACTG GGCTAGCTACAACGA CTGTGTAC 



CACAGGCA G UCUCUGGU 



ACCAGAGA GGCTAGCTACAACGA TGCCTGTG 



AGUCUCUG G UAUACACC 



GGTGTATA GGCTAGCTACAACGA CAGAGACT 



CCCAUCCG G CGGGAGUG 



CACTCCCG GGCTAGCTACAACGA CGGATGGG 



CGGCGGGA G UGGUAUUA 



TAATACCA GGC 5 TP '_*JGA TCCCGCCG 



CGGGAGUG G UAUUAUGA 



TCATAATA GGCTAGCTACAACGA CACTCCCG 



AUUAUGAG G UGAUCAUU 



AATGATCA GGCTAGCTACAACGA CTCATAAT 



UUGUGCGG G UGGAGAUC 



GATCTCCA GGCTAGCTACAACGA CCGCACAA 



UGCAAGGA G UACAACUA 



TAGTTGTA GGCTAGCTACAACGA TCCTTGCA 



UGACAAGA G CAUUGUGG 



CCACAATG GGCTAGCTACAACGA TCTTGTCA 



UGUGGACA G UGGCACCA 



TGGTGCCA GGCTAGCTACAACGA TGTCCACA 



GGACAGUG G CACCACCA 



TGGTGGTG GGCTAGCTACAACGA CACTGTCC 



CAACCUUC G UUUGCCCA 



TGGGCAAA GGCTAGCTACAACGA GAAGGTTG 



CCAAGAAA G UGUUUGAA 



TTCAAACA GGCTAGCTACAACGA TTTCTTGG 



UGUUUGAA G CUGCAGUC 



GACTGCAG GGCTAGCTACAACGA TTCAAACA 



AAGCUGCA G UCAAAUCC 



GGATTTGA GGCTAGCTACAACGA TGCAGCTT 



CCAUCAAG G CAGCCUCC 



GGAGGCTG GGCTAGCTACAACGA CTTGATGG 



UCAAGGCA G CCUCCUCC 



GGAGGAGG GGCTAGCTACAACGA TGCCTTGA 



ACGGAGAA G UUCCCUGA 



TCAGGGAA GGCTAGCTACAACGA TTCTCCGT 



CCCUGAUG G UUUCUGGC 



GCCAGAAA GGCTAGCTACAACGA CATCAGGG 



GGUUUCUG G CUAGGAGA 



TCTCCTAG GGCTAGCTACAACGA CAGAAACC 



CUAGGAGA G CAGCUGGU 



ACCAGCTG GGCTAGCTACAACGA TCTCCTAG 



GGAGAGCA G CUGGUGUG 



CACACCAG GGCTAGCTACAACGA TGCTCTCC 



AGCAGCUG G UGUGCUGG 



CCAGCACA GGCTAGCTACAACGA CAGCTGCT 



GUGUGCUG G CAAGCAGG 



CCTGCTTG GGCTAGCTACAACGA CAGCACAC 



GCUGGCAA G CAGGCACC 



GGTGCCTG GGCTAGCTACAACGA TTGCCAGC 



GCAAGCAG G CACCACCC 



GGGTGGTG GGCTAGCTACAACGA CTGCTTGC 



UUUUCCCA G UCAUCUCA 



TGAGATGA GGCTAGCTACAACGA TGGGAAAA 



CCUAAUGG G UGAGGUUA 



TAACCTCA GGCTAGCTACAACGA CCATTAGG 



UGGGUGAG G UUACCAAC 



GTTGGTAA GGCTAGCTACAACGA CTCACCCA 



ACCAACCA G UCCUUCCG 



CGGAAGGA GGCTAGCTACAACGA TGGTTGGT 



CUUCCGCA G CAAUACCU 



AGGTATTG GGCTAGCTACAACGA TGCGGAAG 



UACCUGCG G CCAGUGGA 



TCCACTGG GGCTAGCTACAACGA CGCAGGTA 



UGCGGCCA G UGGAAGAU 



ATCTTCCA GGCTAGCTACAACGA TGGCCGCA 



AAGAUGUG G CCACGUCC 



GGACGTGG GGCTAGCTACAACGA CACATCTT 



GUGGCCAC G UCCCAAGA 



TCTTGGGA GGCTAGCTACAACGA GTGGCCAC 
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UGUUACAA G UUUGCCAU 



ATGGCAAA GGCTAGCTACAACGA TTGTAACA 



AUCUCACA G UCAUCCAC 



GTGGATGA GGCTAGCTACAACGA TGTGAGAT 



AUCCACGG G CACUGUUA 



TAACAGTG GGCTAGCTACAACGA CCGTGGAT 



UUAUGGGA G CUGUUAUC 



GATAACAG GGCTAGCTACAACGA TCCCATAA 



CAUGGAGG G CUUCUACG 



CGTAGAAG GGCTAGCTACAACGA CCTCCATG 



GCUUCUAC G UUGUCUUU 



AAAGACAA GGCTAGCTACAACGA GTAGAAGC 



UUGAUCGG G CCCGAAAA 



TTTTCGGG GGCTAGCTACAACGA CCGATCAA 



ACGAAUUG G CUUUGCUG 



CAGCAAAG GGCTAGCTACAACGA CAATTCGT 



UGCUGUCA G CGCUUGCC 



GGCAAGCG GGCTAGCTACAACGA TGACAGCA 



CACGAUGA G UUCAGGAC 



GTCCTGAA GGCTAGCTACAACGA TCATCGTG 



UCAGGACG G CAGCGGUG 



CACCGCTG GGCTAGCTACAACGA CGTCCTGA 



GGACGGCA G CGGUGGAA 



TTCCACCG GGCTAGCTACAACGA TGCCGTCC 



CGGCAGCG G UGGAAGGC 



GCCTTCCA GGCTAGCTACAACGA CGCTGCCG, 



GGUGGAAG G CCCUUUUG 



CAAAAGGG GGCTAGCTACAACGA CTTCCACC 



AGACUGUG G CUACAACA 



TGTTGTAG GGCTAGCTACAACGA CACAGTCT 



ACAGAUGA G UCAACCCU 



AGGGTTGA GGCTAGCTACAACGA TCATCTGT 



UGACCAUA G CCUAUGUC 



GACATAGG GGCTAGCTACAACGA TATGGTCA 



AUGUCAUG G CUGCCAUC 



GATGGCAG GGCTAGCTACAACGA CATGACAT 



GCCUCAUG G UGUGUCAG 



CTGACACA GGCTAGCTACAACGA CATGAGGC 



GUGUGUCA G UGGCGCUG 



CAGCGCCA GGCTAGCTACAACGA TGACACAC 



UGUCAGUG G CGCUGCCU 



AGGCAGCG GGCTAGCTACAACGA CACTGACA 



CUGCGCCA G CAGCAUGA 



TCATGCTG GGCTAGCTACAACGA TGGCGCAG 



CGCCAGCA G CAUGAUGA 



TCATCATG GGCTAGCTACAACGA TGCTGGCG 



CUGCUGAA G UGAGGAGG 



CCTCCTCA GGCTAGCTACAACGA TTCAGCAG 



GUGAGGAG G CCCAUGGG 



CCCATGGG GGCTAGCTACAACGA CTCCTCAC 



GCCCAUGG G CAGAAGAU 



ATCTTCTG GGCTAGCTACAACGA CCATGGGC 



ACACCUCC G UGGUUCAC 



GTGAACCA GGCTAGCTACAACGA GGAGGTGT 



CCUCCGUG G UUCACUUU 



AAAGTGAA GGCTAGCTACAACGA CACGGAGG 



UCACUUUG G UCACAAGU 



ACTTGTGA GGCTAGCTACAACGA CAAAGTGA 



GGUCACAA G UAGGAGAC 



GTCTCCTA GGCTAGCTACAACGA TTGTGACC 



CACAGAUG G CACCUGUG 



CACAGGTG GGCTAGCTACAACGA CATCTGTG 



CACCUGUG G CCAGAGCA 



TGCTCTGG GGCTAGCTACAACGA CACAGGTG 



UGGCCAGA C 



CTGAGGTG GGCTAGCTACAACGA TCTGGCCA 



AGGAAAAG G CUGGCAAG 



CTTGCCAG GGCTAGCTACAACGA CTTTTCCT 



AAAGGCUG G CAAGGUGG 



CCACCTTG GGCTAGCTACAACGA CAGCCTTT 



CUGGCAAG G UGGGUUCC 



GGAACCCA GGCTAGCTACAACGA CTTGCCAG 



3696 
3697 



CAAGGUGG G UUCCAGGG 



CCCTGGAA GGCTAGCTACAACGA CCACCTTG 



3699 
3700 
3701 



AGAAAGAA G CACUCUGC 



GCAGAGTG GGCTAGCTACAACGA TTCTTTCT 



CUCUGCUG G CGGGAAUA 



TATTCCCG GGCTAGCTACAACGA CAGCAGAG 



UACUCUUG G UCACCUCA 



TGAGGTGA GGCTAGCTACAACGA CAAGAGTA 



AAAUUUAA G UCGGGAAA 



TTTCCCGA GGCTAGCTACAACGA TTAAATTT 



AAACUUCA G CCCUGAAC 



GTTCAGGG GGCTAGCTACAACGA TGAAGTTT 



AACCCAAA G UAUUCUUC 



GAAGAATA GGCTAGCTACAACGA TTTGGGTT 



UUUUCUUA G UUUCAGAA 



UUUCAGAA G UACUGGCA 



TT CTGAAA GGCTAGCTACAACGA TAAGAAAA 
TGCCAGTA GGCTAGCTACAACGA TTCTGAAA 



AAGUACUG G CAUCACAC 



ACACGCAG G UUACCUUG 



GTGTGATG GGCTAGCTACAACGA CAGTACTT 



CAAGGTAA GGCTAGCTACAACGA CTGCGTGT 



3708 
3709 
3710 



3 G CGUGUGUC 
; G UGUGUCCC 



GACACACG GGCTAGCTACAACGA CAAGGTAA 



GGGACACA GGCTAGCTACAACGA GCCAAGGT 



UCCCUGUG G UACCCUGG 



CCAGGGTA GGCTAGCTACAACGA CACAGGGA 



GUACCCUG G CAGAGAAG 



CTTCTCTG GGCTAGCTACAACGA CAGGGTAC 



3712 
3713 
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2381 


GAGACCAA G CUUGUUUC 


1339 


GAAACAAG GGCTAGCTACAACGA TTGGTCTC 


3714 


"2397^ 


CCCUGCUG G CCAAAGUC 


1340 


GACTTTGG GGCTAGCTACAACGA CAGCAGGG 


3715 


~T403~ 


UGGCCAAA G UCAGUAGG 


1341 


CCTACTGA GGCTAGCTACAACGA TTTGGCCA 


3716 


2407 


CAAAGUCA G UAGGAGAG 


^3 


CTCTCCTA GGCTAGCTACAACGA TGACTTTG 


3717 




GAUGCACA G UUUGCUAU 




ATAGCAAA GGCTAGCTACAACGA TGTGCATC 


3718 


2 ^ g3 


AUAAACAA G CCUAACAU 


1344 


ATGTTAGG GGCTAGCTACAACGA TTGTTTAT 


3719 


1 ~ 


UAACAUUG G UGCAAAGA 


1345 


TCTTTGCA GGCTAGCTACAACGA CAATGTTA 


3720 




CGAGCUGG A UUAUGGUG 


1346 


CACCATAA GGCTAGCTACAACGA CCAGCTCG 


3721 


— — 


AGCAGCCA A CGCAGCCG 


1347 


CGGCTGCG GGCTAGCTACAACGA TGGCTGCT 


3722 


■ 


CCGGGGGG A CCAGGGAA 


1343 


TTCCCTGG GGCTAGCTACAACGA CCCCCCGG 


3723 





CCGUGCCG A UGUAGCGG 


1349 


CCGCTACA GGCTAGCTACAACGA CGGCACGG 


3724 


233 




135Q 


GGCTGGGA GGCTAGCTACAACGA CCGGAGCC 


3725 


267 


CUCUGCGG A UCUCCCCU 


1351 


AGGGGAGA GGCTAGCTACAACGA CCGCAGAG 


3726 


277 




1352 


GAGAGCGG GGCTAGCTACAACGA CAGGGGAG 


3727 


296 


CAGCCCGG A CCCGGGGG 


1353 


CCCCCGGG GGCTAGCTACAACGA CCGGGCTG 


3728 


33 8 


GCGUCCUG ^ g^ggg^g 


1354 


TGGGGGCA GGCTAGCTACAACGA CAGGACGC 


3729 


383 ... 




l355 


CCCCCAAG GGCTAGCTACAACGA CTGGGTGG 


3730 


404 


CCACCCA CGGACGUG 

CGCCAGGG A CGGACGU 


135g 


CACGTCCG GGCTAGCTACAACGA CCCTGGCG 


3731 


408 


AGGGACGG A CGU 


i357 


GGCCCACG GGCTAGCTACAACGA CCGTCCCT 


3732 


487 


UGCUGUGG - g^g^gg^ 


x358 


CGCGCCCA GGCTAGCTACAACGA CCACAGCA 


3733 


592 


CCCGGGAG gg^g^ 




TTCGTCGG GGCTAGCTACAACGA CTCCCGGG 


3734 


596 


GGAGACCG - C — -— -g 


13g0 


GCTCTTCG GGCTAGCTACAACGA CGGTCTCC 


3735 


640 




1361 


GTCCACCA GGCTAGCTACAACGA CTCCACAA 


3736 


647 


UUGUGGAG ^gg^ 

GAUGGUGG A CAACCUGA 


i3£2 


TCAGGTTG GGCTAGCTACAACGA CCACCATC 


3737 


650 


GGUGGACA A CCUGAGGG 




CCCTCAGG GGCTAGCTACAACGA TGTCCACC 


3738 


688 


ACGUGGAG A UGACCGUG 




CACGGTCA GGCTAGCTACAACGA CTCCACGT 


3739 


691 




i3gs 


GCCCACGG GGCTAGCTACAACGA CATCTCCA 


3740 


712 


UGGAGAUG ^ ggggg^c 


13g6 


GTTGAGCG GGCTAGCTACAACGA CTGCGGGG 


3741 


719 


CCCCGCAG A ^gg^g 

GACGCUCA A CAUC U 


13g7 


CCAGGATG GGCTAGCTACAACGA TGAGCGTC 


3742 


731 


CGUGGUGG A UACAGGCA 


1368 


TGCCTGTA GGCTAGCTACAACGA CCACCAGG 


3743 


746 


CAGCAGUA A CUUUGCAG 


i3gg 


CTGCAAAG GGCTAGCTACAACGA TACTGCTG 


3744 


821 


AUACCGGG A CCUCCG A 


i37o 


TCCGGAGG GGCTAGCTACAACGA CCCGGTAT 


3745 


884 


GGGCACCG A CCU__ 


1371 


TTACCAGG GGCTAGCTACAACGA CGGTGCCC 


3746 


911 




■ 1372 


CAGTGACG GGCTAGCTACAACGA TGGGGCCA 


3747 


929 


UGG UGCCA ^ CAUUGCUG 


1373 


CAGCAATG GGCTAGCTACAACGA TGGCACGC 


3748 


94 8 


' U A A UCAGACAA 


1374 


TTGTCTGA GGCTAGCTACAACGA TCAGTGAT 


3749 


953 


UGAAUCAG A CAAGUUCU 


13?5 


AGAACTTG GGCTAGCTACAACGA CTGATTCA 


3750 




CUUCAUCA A CGGCUCCA 


137g 


TGGAGCCG GGCTAGCTACAACGA TGATGAAG 


3751 


977 




13?7 


CTTCCCAG GGCTAGCTACAACGA TGGAGCCG 


3752 


1012 


A^GCUGAG A UUGCCAGG 


137B . 


CCTGGCAA GGCTAGCTACAACGA CTCAGCAT 


3753 


1025 


CAGGCCUG A CGACUCCC 


13?9 


GGGAGTCG GGCTAGCTACAACGA CAGGCCTG 


3754 




GCCUGACG A CUCCCUGG 


138Q 


CCAGGGAG GGCTAGCTACAACGA CGTCAGGC 


3755 






1381 


CCAGAGAG GGCTAGCTACAACGA CAAAGAAA 


3756 


10S6 


UAAAGCAG A CCCACGUU 


1382 


AACGTGGG GGCTAGCTACAACGA CTGCTTTA 


3757 


1079 


CGUUCCCA A CCUCUUCU 


1383 


AGAAGAGG GGCTAGCTACAACGA TGGGAACG 


3758 


1121 


CCCCCUCA A CCAGUCUG 


1384 


CAGACTGG GGCTAGCTACAACGA TGAGGGGG 


3759 


1159 


GGAGCAUG A UCAUUGGA 


1385 


TCCAATGA GGCTAGCTACAACGA CATGCTCC 


3760 


1175 


AGGUAUCG A CCACUCGC 


1386 


GCGAGTGG GGCTAGCTACAACGA CGATACCT 


3761 


1240 


AUGAGGUG A UCAUUGUG 


1387 


CACAATGA GGCTAGCTACAACGA CACCTCAT 


3762 


1258 


GGGUGGAG A UCAAUGGA 


1388 


TCCATTGA GGCTAGCTACAACGA CTCCACCC 


3763 


1262 


GGAGAUCA A UGGACAGG 


1389 


CCTGTCCA GGCTAGCTACAACGA TGATCTCC 


3764 
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1266 


AUCAAUGG A CAGGAUCU 




AGATCCTG GGCTAGCTACAACGA CCATTGAT | 37 6 5 


1271 


UGGACAGG A UCUGAAAA 


- 


TTTTCAGA GGCTAGCTACAACGA CCTGlCCA 




1279 


AUCUGAAA A UGGACUGC 


i392 - 


GCAGTCCA GGCTAGCTACAACGA TTTCAGAT 


3767 


1283 


GAAAAUGG A CUGCAAGG 


i393 - 


CCTTGCAG GGCTAGCTACAACGA CCATTTTC 


37g3 


1298 


GGAGUACA A CUAUGACA 


94~~ 


TGTCATAG GGCTAGCTACAACGA TGTACTL'C 


37g9 


1304 


CAACUAUG A CAAGAGCA 




TGCTCTTG GGCTAGCTACAACGA CATAGTTG 


377Q 




1319 


CAUUGUGG A CAGUGGCA 


— 

1396 


TGCCACTG GGCTAGCTACAACGA CCACAATG 




1334 


CACCACCA A CCUUCGUU 




AACGAAGG GGCTAGCTACAACGA TGGTGGTG 


~ 3772 


1374 


GCAGUCAA A UCCAUCAA 


— 

221 — 


TTGATGGA GGCTAGCTACAACGA TTGACTGC 


3773 


1412 


GUUCCCUG A UGGUUUCU 




AGAAACCA GGCTAGCTACAACGA CAGGGAAC 


~~ 3774 


1469 


CCCUUGGA A CAUUUTjCC 




_140 


GGAAAATG GGCTAGCTACAACGA TCCAAGGG 


~ 3775 


1498 


UCUACCUA A UGGGUGAG 


14 1 


CTCACCCA GGCTAGCTACAACGA TAGGTAGA 


3776 


1514 


GGUUACCA A CCAGUCCU 


1402 


AGGACTGG GGCTAGCTACAACGA TGGTAACC 




1548 


CCGCAGCA A UACCUGCG 


1403 


CGCAGGTA GGCTAGCTACAACGA TGCTGCGG 


3778 


1568 


AGUGGAAG A UGUGGCCA 


4 4 


TGGCCACA GGCTAGCTACAACGA CTTCCACT 


~ 3779 




1585 


GUCCCAAG A CGACUGUU 


1405 ■■ 


AACAGTCG GGCTAGCTACAACGA CTTGGGAC 





15S9 


CCAAGACG A CUGUUACA 


1406 


TGTAACAG GGCTAGCTACAACGA CGTCTTGG 




1673 


UGUCUUUG A UCGGGCCC 


1407 


GGGCCCGA GGCTAGCTACAACGA CAAAGACA 


782 


1686 


GCCCGAAA A CGAAUUGG 


1408 


CCAATTCG GGCTAGCTACAACGA TTTCGGGC 


783 




1690 


GAAAACGA A UUGGCUUU 


1409 


AAAGCCAA GGCTAGCTACAACGA TCGTTTTC 


— 221 — 


1724 


UGUGCACG A UGAGUUCA 


1410 


TGAACTCA GGCTAGCTACAACGA CGTGCACA 


— — 


1735 


AGUUCAGG A CGGCAGCG 


1411 


CGCTGCCG GGCTAGCTACAACGA CCTGAACT 


— 21 


1769 


CACCUUGG A CAUGGAAG 


1412 


CTTCCATG GGCTAGCTACAACGA CCAAGGTG 




1778 


CAUGGAAG A CUGUGGCU 


1413 


AGCCACAG GGCTAGCTACAACGA CTTCCATG 


3788 


1790 


UGGCUACA A CAUUCCAC 


1414 


GTGGAATG GGCTAGCTACAACGA TGTAGCCA 


3789 

, 


1801 


UUCCACAG A CAGAUGAG 


— 1415 ■■ 


CTCATCTG GGCTAGCTACAACGA CTGTGGAA 




1805 


ACAGACAG A UGAGUCAA 


1416 


TTGACTCA GGCTAGCTACAACGA CTGTCTGT 


3791 


1813 


AUGAGUCA A CCCUCAUG 


1417 


CATGAGGG GGCTAGCTACAACGA TGACTCAT 


21 


1822 


CCCUCAUG A CCAUAGCC 


1418 


GGCTATGG GGCTAGCTACAACGA CATGAGGG 




1925 


GCAGCAUG A UGACUUUG 


1419 


CAAAGTCA GGCTAGCTACAACGA CATGCTGC 


3794 


1928 


GCAUGAUG A CUUUGCUG 


1420 


CAGCAAAG GGCTAGCTACAACGA CATCATGC 


3795 


1937 


CUUUGCUG A UGACAUCU 




AGATGTCA GGCTAGCTACAACGA CAGCAAAG 


3796 


1940 


UGCUGAUG A CAUCUCCC 


1422 


GGGAGATG GGCTAGCTACAACGA CATCAGCA 


3797 


1979 


GGCAGAAG A UAGAGAUU 


1423 


AATCTCTA GGCTAGCTACAACGA CTTCTGCC 




1985 


AGAUAGAG A UUCCCCUG 


14 4 


CAGGGGAA GGCTAGCTACAACGA CTCTATCT 


— J7T9" — 


1995 


UCCCCUGG A CCACACCU 


1425 


AGGTGTGG GGCTAGCTACAACGA CCAGGGGA 


3800 


2033 


AGUAGGAG A CACAGAUG 


1426 


CATCTGTG GGCTAGCTACAACGA CTCCTACT 


3801 


2039 


AGACACAG A UGGCACCU 


1427 


AGGTGCCA GGCTAGCTACAACGA CTGTGTCT 


3802 


2067 


ACCUCAGG A CCCUCCCC 




GGGGAGGG GGCTAGCTACAACGA CCTGAGGT 


3803 


2085 


CCCACCAA A UGCCUCUG 


-21 


CAGA3GCA GGCTAGCTACAACGA TTGGTGG3 
CTTCTCCA GGCTAGCTACAACGA CAAGGCAG 


— 3804 

3805 


2099 
2136 


CUGCCUUG A UGGAGAAG 
UUCCAGGG A CUGUACCU 




AGGTACAG GGCTAGCTACAACGA CCCTGGAA 


3806 


2152 


UGUAGGAA A CAGAAAAG 


1432 


CTTTTCTG GGCTAGCTACAACGA TTCCTACA 


3807 




UGGCGGGA A UACUCUUG 


1433 


CAAGAGTA GGCTAGCTACAACGA TCCCGCCA 


3808 


2208 


CACCUCAA A UUUAAGUC 


1434 


GACTTAAA GGCTAGCTACAACGA TTGAGGTG 


3809 


2222 


GUCGGGAA A UUCUGCUG 


1435 


CAGCAGAA GGCTAGCTACAACGA TTCCCGAC 


3810 


2237 


UGCUUGAA A CUUCAGCC 


1436 


GGCTGAAG GGCTAGCTACAACGA TTCAAGCA 


3811 


2250 


AGCCCUGA A CCUUUGUC 


1437 


GACAAAGG GGCTAGCTACAACGA TCAGGGCT 


3812 


2273 


UCCUUUAA A UUCUCCAA 


1438 


TTGGAGAA GGCTAGCTACAACGA TTAAAGGA 


3813 
3814 
3815 


2281 
2376 


AUUCUCCA A CCCAAAGU 
GAGAAGAG A CCAAGCUU 


1439 
1440 


ACTTTGGG GGCTAGCTACAACGA TGGAGAAT 
AAGCTTGG GGCTAGCTACAACGA CTCTTCTC 
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2417 
2444 
2450 
2459 
2468 
2482 
2494 
2507 



AGGAGAGG A UGCACAGU 
C UUUAGAG A CAGGGACU 
A GACAGGG A CUGUAUAA 
CUGUAUAA A CAAGCCUA 
CAAGCCUA A CAUUGGUG 
GUGCAAAG A UUGCCUCU 
CCUCUUGA A UUAAAAAA 
AAAAAAAA A CUAGAAAA 



ACTGTGCA GGCTAGCTACAACGA CCTCTCCT 
AGTC CCTG GGCTAGCTACAACGA CTCTAAAG 
TTATACAG GGCTAGCTACAACGA CCCTGTCT 
TAGGCTTG GGCTAGCTACAACGA TTATACAG 
CACCAATG GGCTAGCTACAACGA TAGGCTTG 
AGAGGCAA GGCTAGCTACAACGA CTTTGCAC 
TTTTTTAA GGCTAGCTACAACGA TCAAGAGG 
GGCTAGCTACAACGA TTTTTTTT 



Input Sequence = AF190725. Cut Site = G/ . 

Stem Length = 8 . Core Sequence = GGCTAGCTACAACGA 

AF190725 (Homo sapiens beta-site APP cleaving enzyme (BACE) mRNA; 2526 bp) 
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3535 




3537 




3539 








3543 






























3558 








GGUCGGUG GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG CCAGCUCC | 


GAUGCUUA GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG CAGGUCGG 1 


GGGGGAUG GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG UUACCAGG 1 


CGUUGGGG GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG CAUGGGGG 


CACAGUGA GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG GUUGGGGC j 


UGUUGGCA GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG GCACAGUG | 


AUGAAGAA GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG UUGUCUGA 


AGUUGGAG GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG CGUUGAUG 1 


CCAGGAUG GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG CUUCCCAG 


UAGGCCAG GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG CCCAGGAU 1 


AGCAUAGG GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG CAGCCCCA 


UCGUCAGG GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG CUGGCAAU j 


AAGAAAGG GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG UCCAGGGA j 


CUGCUUUA GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG CAGAGAGU j 


UGGGUCUG GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG UUUACCAG 1 


GUUGGGAA GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG GUGGGUCU ; 


CCACAAAG GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG UGCAGGGA 1 


AGCCAGCA GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG CACAAAGC j 


GGGGGAAG GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG CAGCACCA 


ACUUCAGA GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG UGGUUGAG 


GGCCAGCA GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG UUCAGACU 


GACAGAGG GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG CAGCACUU 


UGAUCAUG GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG UCCCUCCG 


GGUCGAUA GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG CUCCAAUG 


AGAGACUG GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG CUGUGUAC 


ACCAGAGA GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG UGCCUGUG 


GGUGUAUA GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG CAGAGACU 


CACUCCCG GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG CGGAUGGG 


UAAUACCA GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG UCCCGCCG 


UCAUAAUA GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG CACUCCCG 


AAUGAUCA GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG CUCAUAAU 


GAUCUCCA GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG CCGCACAA 


UAGUUGUA GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG UCCUUGCA 


CCACAAUG GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG UCUUGUCA 




1229 










1234 


1235 








1239 


1240 




1242 


1243 
































1259 




1261 


GGAGCUGG G CACCGACC 1 


CCGACCUG G UAAGCAUC 


CCUGGUAA G CAUCCCCC 


CCCCCAUG G CCCCAACG 


GCCCCAAC G UCACUGUG 


CACUGUGC G UGCCAACA 


UCAGACAA G UUCUUCAU 


CAUCAACG G CUCCAACU 


CUGGGAAG G CAUCCUGG 


AUCCUGGG G CUGGCCUA 


UGGGGCUG G CCUAUGCU 


AUUGCCAG G CCUGACGA 


E 
< 

CJ 


ACUCUCUG G UAAAGCAG 


CUGGUAAA G CAGACCCA 


S 

•J 

3 
O 

8 
< 


UCCCUGCA G CUUUGUGG 


GCUUUGUG G UGCUGGCU 


UGGUGCUG G CUUCCCCC 


CUCAACCA G UCUGAAGU 


AGUCUGAA G UGCUGGCC 


AAGUGCUG G CCUCUGUC 


CGGAGGGA G CAUGAUCA 


CAUUGGAG G UAUCGACC 


8 
1 

< 


CACAGGCA G UCUCUGGU 


AGUCUCUG G UAUACACC 


CCCAUCCG G CGGGAGUG 


CGGCGGGA G UGGUAUUA 


CGGGAGUG G UAUUAUGA 


AUUAUGAG G UGAUCAUU 


UUGUGCGG G UGGAGAUC 


UGCAAGGA G UACAACUA 


j UGACAAGA G CAUUGUGG 






















0001 










1072 






















1203 


1218 




1227 




1252 


1293 
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3903 






3906 




3908 


3909 


3910 




3913 


3914 














3921 






3926 










3931 


3932 


3933 




1 3935 1 


CUUCUGCC GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG AUGGGCCU 1 


UCUUCUGC GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG CAUGGGCC J 


UCUAUCUU GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG UGCCCAUG 1 


AUCUCUAU GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG UUCUGCCC | 


GGGAAUCU GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG UAUCUUCU 1 


AGGGGAAU GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG UCUAUCUU 


GUGUGGUC GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG AGGGGAAU 


GGUGUGGU GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG CAGGGGAA 1 


§ % 

v> % 

<C < 

§ 3 
\ k 

t t 

B t 
B | 
g \ 

8 h 
u c 

5 I 

§ E 

it 


UGUGUCUC GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG UACUUGUG j 


CUGUGUCU GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG CUACUUGU 1 


AUCUGUGU GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG UCCUACUU 


GGUGCCAU GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG UGUGUCUC 


ACAGGUGC GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG AUCUGUGU 


GCUCUGGC GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG ACAGGUGC 


GAGGUGCU GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG UGGCCACA 


\ GGAGGGUC GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG UGAGGUGC 


GGGAGGGU GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG CUGAGGUG 


| UCCUUCUC GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG AUCAAGGC 


8 3 

% D 
lb 

5 t 
ft 

B c 

a 

1 * 
@ £ 

B = 

B t 

B I 


1 £ 

1 c 
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1 UUGCCAGC GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG UUUUCCUU 


| CACCUUGC GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG AGCCUUUU 


GAACCCAC GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG UUGCCAGC 


| CUGGAACC GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG ACCUUGCC 


CCUGGAAC GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG CACCUUGC 


UACAGUCC GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG UGGAACCC 


| GUACAGUC GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG CUGGAACC 


| GGUACAGU GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG CCUGGAAC 


UCUGUUUC GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG UACAGGUA 


j UUCUGUUU GGAGGAAACUCC CU UCAAGGACAUCGUCCGGG CUACAGGU 










1709 
























j 1722 ! 
























1736 
























AGGCCCAU G GGCAGAAG 


GGCCCAUG G GCAGAAGA 


CAUGGGCA G AAGAUAGA 


GGGCAGAA G AUAGAGAU 


AGAAGAUA G AGAUUCCC 


AAGAUAGA G AUUCCCCU 


AUUCCCCU G GACCACAC 


UUCCCCUG G ACCACACC 


ACCUCCGU G GUUCACUU 


CACAAGUA G GAGACACA 


ACAAGUAG G AGACACAG 


l 

u 

1 


GAGACACA G AUGGCACC 


1 ACACAGAU G GCACCUGU 


GCACCUGU G GCCAGAGC 


UGUGGCCA G AGCACCUC 


GCACCUCA G GACCCUCC 


\ 

I 

1 


GCCUUGAU G GAGAAGGA 


CCUUGAUG G AGAAGGAA 


AUGGAGAA G GAAAAGGC 


I i 

I 8 
I 8 

I 8 
? 5 


AAAAGGCU G GCAAGGUG 


(J 

6 

z 

1 

b 

1 


GGCAAGGU G GGUUCCAG 


GCAAGGUG G GUUCCAGG 


GGGUUCCA G GGACUGUA 


GGUUCCAG G GACUGUAC 


GUUCCAGG G ACUGUACC 


UACCUGUA G GAAACAGA 


| ACCUGUAG G AAACAGAA 


1 1970 1 






1978 




1984 


1993 




200B 






2032 


2038 ! 




2050 






2066 


















2133 






2148 
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Table 24: Human Phospholamban (PLN) Hammerhead Ribozyme and Target 
Sequence 



Pos 


Substrate 


Seq ID 


Ribozyme 


Rz Seq 
ID 


16 


AGAAAACU C CCCAGCUA 


1 


UAGCUGGG CUGAUGAG X CGAA AGUUUUCU 


1137 


24 


CCCCAGCU A AACACCCG 


2 


CGGGUGUU CUGAUGAG X CGAA AGCUGGGG 


1138 


34 


ACACCCGU A AGACUUCA 


3 


UGAAGUCU CUGAUGAG X CGAA ACGGGUGU 


1139 


40 


GUAAGACU U CAUACAAC 


4 


GUUGUAUG CUGAUGAG X CGAA AGUCUUAC 


1140 


41 


UAAGACUU C AUACAACA 


5 


UGUUGUAU CUGAUGAG X CGAA AAGUCUUA 


1141 


44 


GACUUCAU A CAACACAA 


6 


UUGUGUUG CUGAUGAG X CGAA AUGAAGUC 


1142 


54 


AACACAAU A CUCUAUAC 


7 


GUAUAGAG CUGAUGAG X CGAA AUUGUGUU 


1143 


57 


ACAAUACU C UAUACUGU 


8 


ACAGUAUA CUGAUGAG X CGAA AGUAUUGU 


1144 


59 


AAUACUCU A UACUGUGA 


9 


UCACAGUA CUGAUGAG X CGAA AGAGUAUU 


1145 


61 


UACUCUAU A CUGUGAUG 


10 


CAUCACAG CUGAUGAG X CGAA AUAGAGUA 


1146 


72 


GUGAUGAU C ACAGCUGC 


11 


GCAGCUGU CUGAUGAG X CGAA AUCAUCAC 


1147 


88 


CCAAGGCU A CCUAAAAG 


12 


CUUUUAGG CUGAUGAG X CGAA AGCCUUGG 


1148 


92 


GGCUACCU A AAAGAAGA 


13 


UCUUCUUU CUGAUGAG X CGAA AGGUAGCC 


1149 


105 


AAGACAGU U AUCUCAUA 


14 


UAUGAGAU CUGAUGAG X CGAA ACUGUCUU 


1150 


10S 


AGACAGUU A UCUCAUAU 


15 


AUAUGAGA CUGAUGAG X CGAA AACUGUCU 


1151 


108 


ACAGUUAU C UCAUAUUU 


16 


AAAUAUGA CUGAUGAG X CGAA AUAACUGU 


1152 


110 


AGUUAUCU C AUAUUUGG 


17 


CCAAAUAU CUGAUGAG X CGAA AGAUAACU 


1153 


113 


UAUCUCAU A UUUGGCUG 


18 


CAGCCAAA CUGAUGAG X CGAA AUGAGAUA 


1154 


115 


UCUCAUAU U UGGCUGCC 


19 


GGCAGCCA CUGAUGAG X CGAA AUAUGAGA 


1155 


116 


CUCAUAUU U GGCUGCCA 


20 


UGGCAGCC CUGAUGAG X CGAA AAUAUGAG 


1156 


128 


UGCCAGCU U UUUAUCUU 


21 


AAGAUAAA CUGAUGAG X CGAA AGCUGGCA 


1157 


129 


GCCAGCUU U UUAUCUUU 


22 


AAAGAUAA CUGAUGAG X CGAA AAGCUGGC 


1158 


130 


CCAGCUUU U UAUCUUUC 


23 


GAAAGAUA CUGAUGAG X CGAA AAAGCUGG 


1159 


131 


CAGCUUUU U AUCUUUCU 


24 


AGAAAGAU CUGAUGAG X CGAA AAAAGCUG 


1160 


132 


AGCUUUUU A UCUUUCUC 


25 


GAGAAAGA CUGAUGAG X CGAA AAAAAGCU 


1161 


134 


CUUUUUAU C UUUCUCUC 


26 


GAGAGAAA CUGAUGAG X CGAA AUAAAAAG 


1162 


136 


UUUUAUCU U UCUCUCGA 


27 


UCGAGAGA CUGAUGAG X CGAA AGAUAAAA 


1163 


137 


UUUAUCUU U CUCUCGAC 


28 


GUCGAGAG CUGAUGAG X CGAA AAGAUAAA 


1164 


138 


UUAUCUUU C UCUCGACC 


29 


GGUCGAGA CUGAUGAG X CGAA AAAGAUAA 


1165 


140 


AUCUUUCU C UCGACCAC 


30 


GUGGUCGA CUGAUGAG X CGAA AGAAAGAU 


1166 


142 


CUUUCUCU C GACCACUU 


31 


AAGUGGUC CUGAUGAG X CGAA AGAGAAAG 


1167 








GAAGUUUU CUGAUGAG X CGAA AGUGGUCG 




151 


GACCACUU A AAACUUCA 


33 


UGAAGUUU CUGAUGAG X CGAA AAGUGGUC 


1169 


157 


UUAAAACU U CAGACUUC 


34 


GAAGUCUG CUGAUGAG X CGAA AGUUUUAA 


1170 


158 


UAAAACUU C AGACUUCC 


35 


GGAAGUCU CUGAUGAG X CGAA AAGUUUUA 


1171 




UUCAGACU U CCUGUCCU 


36 


AGGACAGG CUGAUGAG X CGAA AGUCUGAA 


1172 


155 


UCAGACUU C CUGUCCUG 


37 


CAGGACAG CUGAUGAG X CGAA AAGUCUGA 


1173 


170 


CUUCCUGU C CUGCUGGU 


38 


ACCAGCAG CUGAUGAG X CGAA ACAGGAAG 


1174 


179 


CUGCUGGU A UCAUGGAG 


39 


CUCCAUGA CUGAUGAG X CGAA ACCAGCAG 


1175 


181 


GCUGGUAU C AUGGAGAA 


40 


UUCUCCAU CUGAUGAG X CGAA AUACCAGC 


1176 




GAGAAAGU C CAAUACCU 


41 


AGGUAUUG CUGAUGAG X CGAA ACUUUCUC 


1177 


198 


AGUCCAAU A CCUCACUC 


42 


GAGUGAGG CUGAUGAG X CGAA AUUGGACU 


1178 
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CAAUACCU C ACUCGCUC 



GAGCGAGU CUGAUGAG X CGAA AGGUAUUG 



ACCUCACU C GCUCAGCU 



AGCUGAGC CUGAUGAG X CGAA AGUGAGGU 



CACUCGCU C AGCUAUAA 



UUAUAGCU CUGAUGAG X CGAA AGCGAGUG 



GCUCAGCU A UAAGAAGA 



UCUUCUUA CUGAUGAG X CGAA AGCUGAGC 



UCAGCUAU A AGAAGAGC 



GCUCUUCU CUGAUGAG X CGAA AUAGCUGA 



AAGAGCCU C AACCAUUG 



CAAUGGUU CUGAUGAG X CGAA AGGCUCUU 



UCAACCAU U GAAAUGCC 



GGCAUUUC CUGAUGAG X CGAA AUGGUUGA 



AAAUGCCU C AACAAGCA 



UGCUUGUU CUGAUGAG X CGAA AGGCAUUU 



AAGCACGU C AAAAGCUA 



UAGCUUUU CUGAUGAG X CGAA ACGUGCUU 



CAAAAGCU A CAGAAUCU 



AGAUUCUG CUGAUGAG X CGAA AGCUUUUG 



UACAGAAU C UAUUUAUC 



GAUAAAUA CUGAUGAG X CGAA AUUCUGUA 



CAGAAUCU A UUUAUCAA 



UUGAUAAA CUGAUGAG X CGAA AGAUUCUG 



GAAUCUAU U UAUCAAUU 



AAUUGAUA CUGAUGAG X CGAA AUAGAUUC 



AAUCUAUU U AUCAAUUU 



AAAUUGAU CUGAUGAG X CGAA AAUAGAUU 



AUCUAUUU A UCAAUUUC 



GAAAUUGA CUGAUGAG X CGAA AAAUAGAU 



CUAUUUAU C AAUUUCUG 



CAGAAAUU CUGAUGAG X CGAA AUAAAUAG 



UUAUCAAU U UCUGUCUC 



i CUGAUGAG X CGAA AUUGAUAA 



UAUCAAUU U CUGUCUCA 



UGAGACAG CUGAUGAG X CGAA AAUUGAUA 



AUCAAUUU C UGUCUCAU 



AUGAGACA CUGAUGAG X CGAA AAAUUGAU 



AUUUCUGU C UCAUCUUA 



UAAGAUGA CUGAUGAG X CGAA ACAGAAAU 



UUCUGUCU C AUCUUAAU 



AUUAAGAU CUGAUGAG X CGAA AGACAGAA 



UGUCUCAU C UUAAUAUG 



CAUAUUAA CUGAUGAG X CGAA AUGAGACA 



UCUCAUCU U AAUAUGUC 



GACAUAUU CUGAUGAG X CGAA AGAUGAGA 



CUCAUCUU A AUAUGUCU 



AGACAUAU CUGAUGAG X CGAA AAGAUGAG 



AUCUUAAU A UGUCUCUU 



AAGAGACA CUGAUGAG X CGAA AUUAAGAU' 



UAAUAUGU C UCUUGCUG 



CAGCAAGA CUGAUGAG X CGAA ACAUAUUA 



AUAUGUCU C UUGCUGAU 



AUCAGCAA CUGAUGAG X CGAA AGACAUAU 



AUGUCUCU U GCUGAUCU 



AGAUCAGC CUGAUGAG X CGAA AGAGACAU 



UUGCUGAU C UGUAUCAU 



AUGAUACA CUGAUGAG X CGAA AUCAGCAA 



UGAUCUGU A UCAUCGUG 



CACGAUGA CUGAUGAG X CGAA ACAGAUCA 



AUCUGUAU C AUCGUGAU 



AUCACGAU CUGAUGAG X CGAA AUACAGAU 



UGUAUCAU C GUGAUGCU 



AGCAUCAC CUGAUGAG X CGAA AUGAUACA 



GUGAUGCU U CUCUGAAG 



CUUCAGAG CUGAUGAG X CGAA AGCAUCAC 



UGAUGCUU C UCUGAAGU 



ACUUCAGA CUGAUGAG X CGAA AAGCAUCA 



AUGCUUCU C UGAAGUUC 



GAACUUCA CUGAUGAG X CGAA AGAAGCAU 



UCUGAAGU U CUGCUACA 



UGUAGCAG CUGAUGAG X CGAA ACUUCAGA 



CUGAAGUU C UGCUACAA 



UUGUAGCA CUGAUGAG X CGAA AACUUCAG 



GUUCUGCU A CAACCUCU 



AGAGGUUG CUGAUGAG X CGAA AGCAGAAC 



UACAACCU C UAGAUCUG 



CAGAUCUA CUGAUGAG X CGAA AGGUUGUA 



CAACCUCU A GAUCUGCA 



UGCAGAUC CUGAUGAG X CGAA AGAGGUUG 



CUCUAGAU C UGCAGCUU 



AAGCUGCA CUGAUGAG X CGAA AUCUAGAG 



CUGCAGCU U GCCACAUC 



GAUGUGGC CUGAUGAG X CGAA AGCUGCAG 



UGCCACAU C AGCUUAAA 



UUUAAGCU CUGAUGAG X CGAA AUGUGGCA 



CAUCAGCU U AAAAUCUG 



CAGAUUUU CUGAUGAG X CGAA AGCUGAUG 



AUCAGCUU A AAAUCUGU 



ACAGAUUU CUGAUGAG X CGAA AAGCUGAU 



CUUAAAAU C UGUCAUCC 



GGAUGACA CUGAUGAG X CGAA AUUUUAAG 



AAAUCUGU C AUCCCAUG 



CAUGGGAU CUGAUGAG X CGAA ACAGAUUU 
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UCUGUCAU C CCAUGCAG 
AAAACAAU A UUGUAUAA 
AACAAUAU U GUAUAACA 
AAUAUUGU A UAACAGAC 
UAUUGUAU A ACAGACCA 
AGACCACU U CCUGAGUA 
GACCACUU C CUGAGUAG 
UCCUGAGU A GAAGAGUU 
AGAAGAGU U UCUUUGUG 
GAAGAGUU U CUUUGUGA 
AAGAGUUU C UUUGUGAA 
GAGUUUCU U UGUGAAAA 
AGUUUCUU U GUGAAAAG 



CUGCAUGG CUGAUGAG X CGAA AUGACAGA 

UUAUACAA CUGAUGAG X CGAA AUUGUUUU 

UGUUAUAC CUGAUGAG X CGAA AUAUUGUU 

GUCUGUUA CUGAUGAG X CGAA ACAAUAUU 

UGGUCUGU CUGAUGAG X CGAA AUACAAUA 

UACUCAGG CUGAUGAG X CGAA AGUGGUCU 

CUACUCAG CUGAUGAG X CGAA AAGUGGUC 

AACUCUUC CUGAUGAG X CGAA ACUCAGGA 

CACAAAGA CUGAUGAG X CGAA ACUCUUCU 

UCACAAAG CUGAUGAG X CGAA AACUCUUC 

UUCACAAA CUGAUGAG X CGAA AAACUCUU 

UUUUCACA CUGAUGAG X CGAA AGAAACUC 

CUUUUCAC CUGAUGAG X CGAA AAGAAACU 

UUAAUCUU CUGAUGAG X CGAA ACCUUUUC 



GUCAAGAU U AAGACUAA 



UUAGUCUU CUGAUGAG X CGAA AUCUUGAC 



UCAAGAUU A AGACUAAA 



UUUAGUCU CUGAUGAG X CGAA AAUCUUGA 



UUAAGACU A AAACUUAU 



AUAAGUUU CUGAUGAG X CGAA AGUCUUAA 



CUAAAACU U AUUGUUAC 



GUAACAAU CUGAUGAG X CGAA AGUUUUAG 



UAAAACUU A UUGUUACC 



GGUAACAA CUGAUGAG X CGAA AAGUUUUA 



AAACUUAU U GUUACCAU 



AUGGUAAC CUGAUGAG X CGAA AUAAGUUU 



CUUAUUGU U ACCAUAUG 



CAUAUGGU CUGAUGAG X CGAA ACAAUAAG 



UUAUUGUU A CCAUAUGU 



ACAUAUGG CUGAUGAG X CGAA AACAAUAA 



GUUACCAU A UGUAUUCA 



UGAAUACA CUGAUGAG X CGAA AUGGUAAC 



CCAUAUGU A UUCAUCUG 



CAGAUGAA CUGAUGAG X CGAA ACAUAUGG 



AUAUGUAU U CAUCUGUU 



AACAGAUG CUGAUGAG X CGAA AUACAUAU 



UAUGUAUU C AUCUGUUG 



CAACAGAU CUGAUGAG X CGAA AAUACAUA 



GUAUUCAU C UGUUGGAU 



AUCCAACA CUGAUGAG X CGAA AUGAAUAC 



UCAUCUGU U GGAUCUUG 



CAAGAUCC CUGAUGAG X CGAA ACAGAUGA 



UGUUGGAU C UUGUAAAC 



GUUUACAA CUGAUGAG X CGAA AUCCAACA 



UUGGAUCU U GUAAACAU 



AUGUUUAC CUGAUGAG X CGAA AGAUCCAA 



GAUCUUGU A AACAUGAA 



UUCAUGUU CUGAUGAG X CGAA ACAAGAUC 



UGAAAAUA CUGAUGAG X CGAA AGCCCUUU 



AAGGGCUU U AUUUUCAA 



UUGAAAAU CUGAUGAG X CGAA AAGCCCUU 



AGGGCUUU A UUUUCAAA 



UUUGAAAA CUGAUGAG X CGAA AAAGCCCU 



GGCUUUAU U UUCAAAAA 



UUUUUGAA CUGAUGAG X CGAA AUAAAGCC 



GCUUUAUU U UCAAAAAU 



AUUUUUGA CUGAUGAG X CGAA AAUAAAGC 



555 

559 
560 



587 
591 



CUUUAUUU U CAAAAAUU 
UUUAUUUU C AAAAAUUA 
UCAAAAAU U AACUUCAA 
CAAAAAUU A ACUUCAAA 
AAUUAACU U CAAAAUAA 
AUUAACUU C AAAAUAAG 
UUCAAAAU A AGUGUAUA 
AUAAGUGU A UAAAAUGC 
AAGUGUAU A AAAUGCAA 
GCAACUGU U GAUUUCCU 
CUGUUGAU U UCCUCAAC 



UAAUUUUU CUGAUGAG X CGAA AAAAUAAA 
UUGAAGUU CUGAUGAG X CGAA AUUUUUGA 
UUUGAAGU CUGAUGAG X CGAA AAUUUUUG - 
UUAUUUUG CUGAUGAG X CGAA AGUUAAUU 
CUUAUUUU CUGAUGAG X CGAA AAGUUAAU 
UAUACACU CUGAUGAG X CGAA AUUUUGAA 
GCAUUUUA CUGAUGAG X CGAA ACACUUAU 
UUGCAUUU CUGAUGAG X CGAA AUACACUU 
AGGAAAUC CUGAUGAG X CGAA ACAGUUGC 
GUUGAGGA CUGAUGAG X CGAA AUCAACAG 
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UGUUGAUU U CCUCAACA 



UGUUGAGG CUGAUGAG X CGAA AAUCAACA 



GUUGAUUU C CUCAACAU 



AUGUUGAG CUGAUGAG X CGAA AAAUCAAC 



GAUUUCCU C AACAUGGC 



GCCAUGUU CUGAUGAG X CGAA AGGAAAUC 



ACAUGGCU C ACAAAUUU 



AAAUUUGU CUGAUGAG X CGAA AGCCAUGU 



UCACAAAU U UCUAUCCC 



GGGAUAGA CUGAUGAG X CGAA AUUUGUGA 



CACAAAUU U CUAUCCCA 



UGGGAUAG CUGAUGAG X CGAA AAUUUGUG 



ACAAAUUU C UAUCCCAA 



UUGGGAUA CUGAUGAG X CGAA AAAUUUGU 



AAAUUUCU A UCCCAAAU 



AUUUGGGA CUGAUGAG X CGAA AGAAAUUU 



AUUUCUAU C CCAAAUCU 



AGAUUUGG CUGAUGAG X CGAA AUAGAAAU 



UCCCAAAU C UUUUCUGA 



UCAGAAAA CUGAUGAG X CGAA AUUUGGGA 



CCAAAUCU U UUCUGAAG 



CUUCAGAA CUGAUGAG X CGAA AGAUUUGG 



CAAAUCUU U UCUGAAGA 



UCUUCAGA CUGAUGAG X CGAA AAGAUUUG 



AAAUCUUU U CUGAAGAU 



AUCUUCAG CUGAUGAG X CGAA AAAGAUUU 



AAUCUUUU C UGAAGAUG 



CAUCUUCA CUGAUGAG X CGAA AAAAGAUU 



UGAAGAGU U UAGUUUUA 



UAAAACUA CUGAUGAG X CGAA ACUCUUCA 



GAAGAGUU U AGUUUUAA 



UUAAAACU CUGAUGAG X CGAA AACUCUUC 



AAGAGUUU A GUUUUAAA 



UUUAAAAC CUGAUGAG X CGAA AAACUCUU 



AGUUUAGU U UUAAAACU 



AGUUUUAA CUGAUGAG X CGAA ACUAAACU 



GUUUAGUU U UAAAACUG 



CAGUUUUA CUGAUGAG X CGAA AACUAAAC 



UUUAGUUU U AAAACUGC 



GCAGUUUU CUGAUGAG X CGAA AAACUAAA 



UUAGUUUU A AAACUGCA 



UGCAGUUU CUGAUGAG X CGAA AAAACUAA 



CAACAAGU U CACUUCAU 



AUGAAGUG CUGAUGAG X CGAA ACUUGUUG 



AACAAGUU C ACUUCAUA 



UAUGAAGU CUGAUGAG X CGAA AACUUGUU 



AGUUCACU U CAUAUAUA 



UAUAUAUG CUGAUGAG X CGAA AGUGAACU 



GUUCACUU C AUAUAUAA 



UUAUAUAU CUGAUGAG X CGAA AAGUGAAC 



CACUUCAU A UAUAAAGC 



GCUUUAUA CUGAUGAG X CGAA AUGAAGUG 



CUUCAUAU A UAAAGCAU 



AUGCUUUA CUGAUGAG X CGAA AUAUGAAG 



UCAUAUAU A AAGCAUUA 



UAAUGCUU CUGAUGAG X CGAA AUAUAUGA 



UAAAGCAU U AUUUUUAC 



GUAAAAAU CUGAUGAG X CGAA AUGCUUUA 



AAAGCAUU A UUUUUACU 



AGUAAAAA CUGAUGAG X CGAA AAUGCUUU 



AGCAUUAU U UUUACUCU 



AGAGUAAA CUGAUGAG X CGAA AUAAUGCU 



GCAUUAUU U UUACUCUU 



AAGAGUAA CUGAUGAG X CGAA AAUAAUGC 



CAUUAUUU U UACUCUUU 



AAAGAGUA CUGAUGAG X CGAA AAAUAAUG 



AUUAUUUU U ACUCUUUU 



AAAAGAGU CUGAUGAG X CGAA AAAAUAAU 



UUAUUUUU A CUCUUUUG 



CAAAAGAG CUGAUGAG X CGAA AAAAAUAA 



UUUUUACU C UUUUGAGG 



CCUCAAAA CUGAUGAG X CGAA AGUAAAAA 



UUUACUCU U UUGAGGUG 



CACCUCAA CUGAUGAG X CGAA AGAGUAAA 



UUACUCUU U UGAGGUGA 



UCACCUCA CUGAUGAG X CGAA AAGAGUAA 



UACUCUUU U GAGGUGAA 



UUCACCUC CUGAUGAG X CGAA AAAGAGUA 



AGGUGAAU A UAAUUUAU 



AUAAAUUA CUGAUGAG X CGAA AUUCACCU 



GUGAAUAU A AUUUAUAU 



AUAUAAAU CUGAUGAG X CGAA AUAUUCAC 



AAUAUAAU U UAUAUUAC 



GUAAUAUA CUGAUGAG X CGAA AUUAUAUU 



AUAUAAUU U AUAUUACA 



UGUAAUAU CUGAUGAG X CGAA AAUUAUAU 



UAUAAUUU A UAUUACAA 



UUGUAAUA CUGAUGAG X CGAA AAAUUAUA 



UAAUUUAU A UUACAAUG 



CAUUGUAA CUGAUGAG X CGAA AUAAAUUA 



AUUUAUAU U ACAAUGUA 



UACAUUGU CUGAUGAG X CGAA AUAUAAAU 



UUUAUAUU A CAAUGUAA 



UUACAUUG CUGAUGAG X CGAA AAUAUAAA 
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UACAAUGU A AAAGCUUC 
UAAAAGCU U CUUUAAUA 
AAAAGCUU C UUUAAUAC 
AAGCUUCU U UAAUACUA 
AGCUUCUU U AAUACUAA 
GCUUCUUU A AUACUAAG 
UCUUUAAU A CUAAGUAU 
UUAAUACU A AGUAUUUU 
UACUAAGU A UUUUUCAG 
CUAAGUAU U UUUCAGGU 
UAAGUAUU U UUCAGGUC 
AAGUAUUU U UCAGGUCU 
AGUAUUUU U CAGGUCUU 
GUAUUUUU C AGGUCUUC 
UUUCAGGU C UUCACCAA 
UCAGGUCU U CACCAAGU 
CAGGUCUU C ACCAAGUA 
CACCAAGU A UCAAAGUA 
CCAAGUAU C AAAGUAAU 
AUCAAAGU A AUAACACA 
AAAGUAAU A ACACAAAU 
UGAAGUGU C AUUAUUCA 
AGUGUCAU U AUUCAAAA 
GUGUCAUU A UUCAAAAU 
GUCAUUAU U CAAAAUAG 
UCAUUAUU C AAAAUAGU 
UUCAAAAU A GUCCACUG 
AAAAUAGU C CACUGACU 
CACUGACU C 



GAAGCUUU CUGAUGAG X CGAA ACAUUGUA 
UAUUAAAG CUGAUGAG X CGAA AGCUUUUA 
GUAUUAAA CUGAUGAG X CGAA AAGCUUUU 
UAGUAUUA CUGAUGAG X CGAA AGAAGCUU 
UUAGUAUU CUGAUGAG X CGAA AAGAAGCU 
CUUAGUAU CUGAUGAG X CGAA AAAGAAGC 
AUACUUAG CUGAUGAG X CGAA AUUAAAGA 
AAAAUACU CUGAUGAG X CGAA AGUAUUAA 
CUGAAAAA CUGAUGAG X CGAA ACUUAGUA 
ACCUGAAA CUGAUGAG X CGAA AUACUUAG 
GACCUGAA CUGAUGAG X CGAA AAUACUUA 
AGACCUGA CUGAUGAG X CGAA AAAUACUU 
AAGACCUG CUGAUGAG X CGAA AAAAUACU 
GAAGACCU CUGAUGAG X CGAA AAAAAUAC 
UUGGUGAA CUGAUGAG X CGAA ACCUGAAA 
ACUUGGUG CUGAUGAG X CGAA AGACCUGA 
UACUUGGU CUGAUGAG X CGAA AAGACCUG 
UACUUUGA CUGAUGAG X CGAA ACUUGGUG 
AUUACUUU CUGAUGAG X CGAA AUACUUGG 
UGUGUUAU CUGAUGAG X CGAA ACUUUGAU 
AUUUGUGU CUGAUGAG X CGAA AUUACUUU 
UGAAUAAU CUGAUGAG X CGAA ACACUUCA 
UUUUGAAU CUGAUGAG X CGAA AUGACACU 
AUUUUGAA CUGAUGAG X CGAA AAUGACAC 
CUAUUUUG CUGAUGAG X CGAA AUAAUGAC 
ACUAUUUU CUGAUGAG X CGAA AAUAAUGA 
CAGUGGAC CUGAUGAG X CGAA AUUUUGAA 
AGUCAGUG CUGAUGAG X CGAA ACUAUUUU 
GAUGUGAG CUGAUGAG X CGAA AGUCAGUG 



UGACUCCU C ACAUCUGU 



ACAGAUGU CUGAUGAG X CGAA 



CCUCACAU C UGUUAUCU 



AGAUAACA CUGAUGAG X CGAA AUGUGAGG 



ACAUCUGU U AUCUUAUU 



AAUAAGAU CUGAUGAG X CGAA ACAGAUGU 



847 
849 



CAUCUGUU A UCUUAUUA 



UAAUAAGA CUGAUGAG X CGAA AACAGAUG 



UCUGUUAU C UUAUUAUA 



UAUAAUAA CUGAUGAG X CGAA AUAACAGA 



UGUUAUCU U AUUAUAAA 



UUUAUAAU CUGAUGAG X CGAA AGAUAACA 



GUUAUCUU A UUAUAAAG 



CUUUAUAA CUGAUGAG X CGAA AAGAUAAC 



UAUCUUAU U AUAAAGAA 



UUCUUUAU CUGAUGAG X CGAA AUAAGAUA 



AUCUUAUU A UAAAGAAC 



GUUCUUUA CUGAUGAG X CGAA AAUAAGAU 



CUUAUUAU A AAGAACUA 



UAGUUCUU CUGAUGAG X CGAA AUAAUAAG 



AAAGAACU A UUUGUAGU 
AGAACUAU U UGUAGUAA 
GAACUAUU U GUAGUAAC 
CUAUUUGU A GUAACUAU 
UUUGUAGU A ACUAUCAG 
UAGUAACU A UCAGAAUC 
GUAACUAU C AGAAUCUA 
AUCAGAAU C UACAUUCU 



ACUACAAA C ! 



X CGAA AGUUCUUU 



UUACUACA CUGAUGAG X CGAA AUAGUUCU 

GUUACUAC CUGAUGAG X CGAA AAUAGUUC 

AUAGUUAC CUGAUGAG X CGAA ACAAAUAG 

CUGAUAGU CUGAUGAG X CGAA ACUACAAA 

GAUUCUGA CUGAUGAG X CGAA AGUUACUA 

UAGAUUCU CUGAUGAG X CGAA AUAGUUAC 

AGAAUGUA CUGAUGAG X CGAA AUUCUGAU 
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CAGAAUCU A CAUUCUAA 
AUCUACAU U CUAAAACA 
UCUACAUU C UAAAACAG 
UACAUUCU A AAACAGAA 
ACAGAAAU U GUAUUUUU 
GAAAUUGU A UUUUUUCU 
AAUUGUAU U UUUUCUAU 
AUUGUAUU U UUUCUAUG 
UUGUAUUU U UUCUAUGC 
UGUAUUUU U UCUAUGCC 
U CUAUGCCA 



UUAGAAUG CUGAUGAG X CGAA AGAUUCUG 
UGUUUUAG CUGAUGAG X CGAA AUGUAGAU 
CUGUUUUA CUGAUGAG X CGAA AAUGUAGA 
UUCUGUUU CUGAUGAG X CGAA AGAAUGUA 
AAAAAUAC CUGAUGAG X CGAA AUUUCUGU 
AGAAAAAA CUGAUGAG X CGAA ACAAUUUC 
AUAGAAAA CUGAUGAG X CGAA AUACAAUU 
CAUAGAAA CUGAUGAG X CGAA AAUACAAU 
GCAUAGAA CUGAUGAG X CGAA AAAUACAA 
GGCAUAGA CUGAUGAG X CGAA AAAAUACA 
UGGCAUAG CUGAUGAG X CGAA 



UAUUUUUU C UAUGCCAC 



GUGGCAUA CUGAUGAG X CGAA AAAAAAUA 



UUUUUUCU A UGCCACAU 



AUGUGGCA CUGAUGAG X CGAA AGAAAAAA 



UGCCACAU U AACAUCUU 



AAGAUGUU CUGAUGAG X CGAA AUGUGGCA 



GCCACAUU A ACAUCUUU 



AAAGAUGU CUGAUGAG X CGAA AAUGUGGC 



AUUAACAU C UUUUAAAG 



CUUUAAAA CUGAUGAG X CGAA AUGUUAAU 



UAACAUCU U UUAAAGUU 



AACUUUAA CUGAUGAG X CGAA AGAUGUUA 



AACAUCUU U UAAAGUUG 



CAACUUUA CUGAUGAG X CGAA AAGAUGUU 



ACAUCUUU U AAAGUUGA 



UCAACUUU CUGAUGAG X CGAA AAAGAUGU 



CAUCUUUU A AAGUUGAU 



AUCAACUU CUGAUGAG X CGAA AAAAGAUG 



UUUAAAGU U GAUGAGAA 



UUCUCAUC CUGAUGAG X CGAA ACUUUAAA 



AUGAGAAU C AAGUAUGG 



CCAUACUU CUGAUGAG X CGAA AUUCUCAU 



AAUCAAGU A UGGAAAAG 



CUUUUCCA CUGAUGAG X CGAA ACUUGAUU 



GGAAAAGU A AGGCCAUA 



UAUGGCCU CUGAUGAG X CGAA ACUUUUCC 



AAGGCCAU A CUCUUACA 



UGUAAGAG CUGAUGAG X CGAA AUGGCCUU 



GCCAUACU C UUACAUAA 



UUAUGUAA CUGAUGAG -X CGAA AGUAUGGC 



CAUACUCU U ACAUAAUA 



UAUUAUGU CUGAUGAG X CGAA AGAGUAUG 



AUACUCUU A CAUAAUAA 



UUAUUAUG CUGAUGAG X CGAA AAGAGUAU 



UCUUACAU A AUAAAAUU 



AAUUUUAU CUGAUGAG X CGAA AUGUAAGA 



UACAUAAU A AAAUUCCU 



AGGAAUUU CUGAUGAG X CGAA AUUAUGUA 



1001 
1005 
1008 
1009 
1010 
1011 
1012 
1013 
1021 
1029 



U CCUUUUAA 
AUAAAAUU C CUUUUAAG 
AAAUUCCU U UUAAGUAA 
AAUUCCUU U UAAGUAAU 
AUUCCUUU U AAGUAAUU 
UUCCUUUU A AGUAAUUU 
UUUUAAGU A AUUUUUUC 
UAAGUAAU U UUUUCAAA 
AAGUAAUU U UUUCAAAG 
AGUAAUUU U UUCAAAGA 
GUAAUUUU U UCAAAGAA 
UAAUUUUU U CAAAGAAU 
AAUUUUUU C AAAGAAUC 
CAAAGAAU C ACAGAAUU 
CACAGAAU U CUAGUACA 
ACAGAAUU C UAGUACAU 
AGAAUUCU A GUACAUGU 



UUAAAAGG CUGAUGAG X CGAA AUUUUAUU 
CUUAAAAG CUGAUGAG X CGAA AAUUUUAU 
UUACUUAA CUGAUGAG X CGAA AGGAAUUU 
AUUACUUA CUGAUGAG X CGAA AAGGAAUU 
AAUUACUU CUGAUGAG X CGAA AAAGGAAU 
AAAUUACU CUGAUGAG X CGAA AAAAGGAA 
GAAAAAAU CUGAUGAG X CGAA ACUUAAAA 
UUUGAAAA CUGAUGAG X, CGAA AUUACUUA 
CUUUGAAA CUGAUGAG X CGAA AAUUACUU 
UCUUUGAA CUGAUGAG X CGAA AAAUUACU 
UUCUUUGA CUGAUGAG X CGAA AAAAUUAC 
AUUCUUUG CUGAUGAG X CGAA AAAAAUUA 
GAUUCUUU CUGAUGAG X CGAA AAAAAAUU 
AAUUCUGU CUGAUGAG X CGAA AUUCUUUG 
UGUACUAG CUGAUGAG X CGAA AUUCUGUG 
AUGUACUA CUGAUGAG X CGAA AAUUCUGU 
ACAUGUAC CUGAUGAG X CGAA AGAAUUCU 
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1049 
1052 
1056 
1060 
1061 
1063 
1070 
1075 
1096 
1097 
1100 
1115 
1116 
1119 
1122 



AUUCUAGU A CAUGUAGG 
GUACAUGU A GGUAAAUC 
AUGUAGGU A AAUCAUAA 
AGGUAAAU C AUAAAUCU 
UAAAUCAU A AAUCUGUU 
UCAUAAAU C UGUUCUAA 
AAAUCUGU U CUAAGACA 
AAUCUGUU C UAAGACAU 
UCUGUUCU A AGACAUAU 
UAAGACAU A UGAUCAAC 
CAUAUGAU C AACAGAUG 
CUGGUGGU U AAUAUGUG 
UGGUGGUU A AUAUGUGA 
UGGUUAAU A UGUGACAG 
AGUGAGAU U AGUCAUAU 
GUGAGAUU A GUCAUAUC 
AGAUUAGU C AUAUCACU 
UUAGUCAU A UCACUAAU 
AGUCAUAU C ACUAAUAU 
AUAUCACU A AUAUACUA 
UCACUAAU A UACUAACA 
ACUAAUAU A CUAACAAC 
AAUAUACU A ACAACAGA 
AACAGAAU C UAAUCUUC 
CAGAAUCU A AUCUUCAU 
AAUCUAAU C UUCAUUUA 
UCUAAUCU U CAUUUAAG 
CUAAUCUU C AUUUAAGG 
AUCUUCAU U UAAGGCAC 
UCUUCAUU U AAGGCACU 
CUUCAUUU A AGGCACUG 
GGCACUGU A GUGAAUUA 



1201 
1202 
1207 
1210 
1212 
1214 
1216 
1217 



UAGUGAAU U 
AGUGAAUU A UCUGAGCU 
UGAAUUAU C UGAGCUAG 
UCUGAGCU A GAGUUACC 
GCUAGAGU U ACCUAGCU 
CUAGAGUU A CCUAGCUU 
AGUUACCU A GCUUACCA 
ACCUAGCU U ACCAUACU 
CCUAGCUU A CCAUACUA 
CUUACCAU A CUAUAUCU 
ACCAUACU A UAUCUUUG 
CAUACUAU A UCUUUGGA 
UACUAUAU C UUUGGAAU 
CUAUAUCU U UGGAAUCA 
UAUAUCUU U GGAAUCAU 



CCUACAUG CUGAUGAG X CGAA ACUAGAAU 
GAUUUACC CUGAUGAG X CGAA ACAUGUAC 
UUAUGAUU CUGAUGAG X CGAA ACCUACAU 
AGAUUUAU CUGAUGAG X CGAA AUUUACCU 
AACAGAUU CUGAUGAG X CGAA AUGAUUUA 
UUAGAACA CUGAUGAG X CGAA AUUUAUGA 
UGUCUUAG CUGAUGAG X CGAA ACAGAUUU 
AUGUCUUA CUGAUGAG X CGAA AACAGAUU 
AUAUGUCU CUGAUGAG X CGAA AGAACAGA 
GUUGAUCA CUGAUGAG X CGAA AUGUCUUA 
CAUCUGUU CUGAUGAG X CGAA AUCAUAUG 
CACAUAUU CUGAUGAG X CGAA ACCACCAG 
UCACAUAU CUGAUGAG X CGAA AACCACCA 
CUGUCACA CUGAUGAG X CGAA AUUAACCA 
AUAUGACU CUGAUGAG X CGAA AUCUCACU 
GAUAUGAC CUGAUGAG X CGAA AAUCUCAC 
AGUGAUAU CUGAUGAG X CGAA- ACUAAUCU 
AUUAGUGA CUGAUGAG X CGAA AUGACUAA 
AUAUUAGU CUGAUGAG X CGAA AUAUGACU 
UAGUAUAU CUGAUGAG X CGAA AGUGAUAU 
UGUUAGUA CUGAUGAG X CGAA AUUAGUGA 
GUUGUUAG CUGAUGAG X CGAA AUAUUAGU 
UCUGUUGU CUGAUGAG X CGAA AGUAUAUU 
GAAGAUUA CUGAUGAG X CGAA AUUCUGUU 
AUGAAGAU CUGAUGAG X CGAA AGAUUCUG 
UAAAUGAA CUGAUGAG X CGAA AUUAGAUU 
CUUAAAUG CUGAUGAG X CGAA AGAUUAGA 
CCUUAAAU CUGAUGAG X CGAA AAGAUUAG 
GUGCCUUA CUGAUGAG X CGAA AUGAAGAU 
AGUGCCUU CUGAUGAG X CGAA AAUGAAGA 
CAGUGCCU CUGAUGAG X CGAA AAAUGAAG 
UAAUUCAC CUGAUGAG X CGAA ACAGUGCC 
GCUCAGAU CUGAUGAG X CGAA AUUCACUA 
AGCUCAGA CUGAUGAG X CGAA AAUUCACU 
CUAGCUCA CUGAUGAG X CGAA AUAAUUCA 
GGUAACUC CUGAUGAG X CGAA AGCUCAGA 
AGCUAGGU CUGAUGAG X CGAA ACUCUAGC 
AAGCUAGG CUGAUGAG X CGAA AACUCUAG 
UGGUAAGC CUGAUGAG X CGAA AGGUAACU 
AGUAUGGU CUGAUGAG X CGAA AGCUAGGU 
UAGUAUGG CUGAUGAG X CGAA AAGCUAGG 
AGAUAUAG CUGAUGAG X CGAA AUGGUAAG 
CAAAGAUA CUGAUGAG X CGAA AGUAUGGU 
UCCAAAGA CUGAUGAG X CGAA AUAGUAUG 
AUUCCAAA CUGAUGAG X CGAA AUAUAGUA 
UGAUUCCA CUGAUGAG X CGAA AGAUAUAG 
AUGAUUCC CUGAUGAG X CGAA AAGAUAUA 
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1240 
1241 
1250 
1251 
1252 
1259 
12S2 
1264 
12S5 
1269 
1270 
1277 
1282 
1302 



1327 
1329 



1332 
1334 



1363 
1365 
1367 
1369 
1375 
1377 
1378 
1382 
1385 
1386 
1388 



UUUGGAAU C AUGAAACC 
UGAAACCU U AAGACUUC 
GAAACCUU A AGACUUCA 
UUAAGACU U CAGAAUGA 
UAAGACUU C AGAAUGAU 
AGAAUGAU U UUGCAGGU 
GAAUGAUU U UGCAGGUU 
AAUGAUUU U GCAGGUUG 
UUGCAGGU U GUCUUCCA 
CAGGUUGU C UUCCAUUC 
GGUUGUCU U CCAUUCCA 
GUUGUCUU C CAUUCCAG 
UCUUCCAU U. CCAGCCUA 
CUUCCAUU C CAGCCUAA 
UCCAGCCU A ACAUCCAA 
CCUAACAU C CAAUGCAG 
AGGAAAAU A AAAGAUUU 
UAAAAGAU U UCCAGUGA 
AAAAGAUU U CCAGUGAC 
AAAGAUUU C CAGUGACA 
AGAAAAAU A UAUUAUCU 
AAAAAUAU A UUAUCUCA 
AAAUAUAU U AUCUCAAG 
AAUAUAUU A UCUCAAGU 
UAUAUUAU C UCAAGUAU 
UAUUAUCU C AAGUAUUU 
UCUCAAGU A UUUUUUAA 
UCAAGUAU U UUUUAAAA 
CAAGUAUU U UUUAAAAA 
AAGUAUUU U UUAAAAAU 
AGUAUUUU U UAAAAAUA 
GUAUUUUU U AAAAAUAU 
UAUUUUUU A AAAAUAUA 
UUAAAAAU A UAUGAAUU 
AAAAAUAU A UGAAUUCU 
AUAUGAAU U CUCUCUCC 
UAUGAAUU C UCUCUCCA 
UGAAUUCU C UCUCCAAA 
AAUUCUCU C UCCAAAUA 
UUCUCUCU C CAAAUAUU 
CUCCAAAU A UUAACUAA 
CCAAAUAU U AACUAAUU 
CAAAUAUU A ACUAAUUA 
UAUUAACU A AUUAUUAG 
UAACUAAU U AUUAGAUU 
AACUAAUU A UUAGAUUA 
CUAAUUAU U AGAUUAUA 



GGUUUCAU CUGAUGAG X CGAA AUUCCAAA 
GAAGUCUU CUGAUGAG X CGAA AGGUUUCA 
UGAAGUCU CUGAUGAG X CGAA AAGGUUUC 
UCAUUCUG CUGAUGAG X CGAA AGUCUUAA 
AUCAUUCU CUGAUGAG X CGAA AAGUCUUA 
ACCUGCAA CUGAUGAG X CGAA AUCAUUCU 
AACCUGCA CUGAUGAG X CGAA AAUCAUUC 
CAACCUGC CUGAUGAG X CGAA AAAUCAUU 
UGGAAGAC CUGAUGAG X CGAA ACCUGCAA 
GAAUGGAA CUGAUGAG X CGAA ACAACCUG 
UGGAAUGG CUGAUGAG X CGAA AGACAACC 
CUGGAAUG CUGAUGAG X CGAA AAGACAAC 
UAGGCUGG CUGAUGAG X CGAA AUGGAAGA 
UUAGGCUG CUGAUGAG X CGAA AAUGGAAG 
UUGGAUGU CUGAUGAG X CGAA AGGCUGGA 
CUGCAUUG CUGAUGAG X CGAA AUGUUAGG 
AAAUCUUU CUGAUGAG X CGAA AUUUUCCU 
UCACUGGA CUGAUGAG X CGAA AUCUUUUA 
GUCACUGG CUGAUGAG X CGAA AAUCUUUU 
UGUCACUG CUGAUGAG X CGAA AAAUCUUU 
AGAUAAUA CUGAUGAG X CGAA AUUUUUCU 
UGAGAUAA CUGAUGAG X CGAA AUAUUUUU 
CUUGAGAU CUGAUGAG X CGAA AUAUAUUU 
ACUUGAGA CUGAUGAG X CGAA AAUAUAUU 
AUACUUGA CUGAUGAG X CGAA AUAAUAUA 
AAAUACUU CUGAUGAG X CGAA AGAUAAUA 
UUAAAAAA CUGAUGAG X CGAA ACUUGAGA 
UUUUAAAA CUGAUGAG X CGAA AUACUUGA 
UUUUUAAA CUGAUGAG X CGAA AAUACUUG 
AUUUUUAA CUGAUGAG X CGAA AAAUACUU 
UAUUUUUA 



X CGAA AAAAUACU 
AUAUUUUU CUGAUGAG X CGAA AAAAAUAC 
UAUAUUUU CUGAUGAG X CGAA AAAAAAUA 
AAUUCAUA CUGAUGAG X CGAA AUUUUUAA 
AGAAUUCA CUGAUGAG X CGAA AUAUUUUU 
GGAGAGAG CUGAUGAG X CGAA AUUCAUAU 
UGGAGAGA CUGAUGAG X CGAA AAUUCAUA 
UUUGGAGA CUGAUGAG X CGAA AGAAUUCA 
UAUUUGGA CUGAUGAG X CGAA AGAGAAUU 
AAUAUUUG CUGAUGAG X CGAA AGAGAGAA 
UUAGUUAA CUGAUGAG X CGAA AUUUGGAG 
AAUUAGUU CUGAUGAG X CGAA AUAUUUGG 
UAAUUAGU CUGAUGAG X CGAA AAUAUUUG 
CUAAUAAU CUGAUGAG X CGAA AGUUAAUA 
AAUCUAAU CUGAUGAG X CGAA AUUAGUUA 
UAAUCUAA CUGAUGAG X CGAA AAUUAGUU 
UAUAAUCU CUGAUGAG X CGAA AUAAUUAG 
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UAAUUAUU A GAUUAUAU 



AUAUAAUC CUGAUGAG X CGAA AAUAAUUA 



UAUUAGAU U AUAUUUUG 



CAAAAUAU CUGAUGAG X CGAA AUCUAAUA 



AUUAGAUU A UAUUUUGA 



UCAAAAUA CUGAUGAG X CGAA AAUCUAAU 



UAGAUUAU A UUUUGAAA 



UUUCAAAA CUGAUGAG X CGAA AUAAUCUA 



GAUUAUAU U UUGAAAUG 



CAUUUCAA CUGAUGAG X CGAA AUAUAAUC 



AUUAUAUU U UGAAAUGA 



UCAUUUCA CUGAUGAG X CGAA AAUAUAAU 



UUAUAUUU U GAAAUGAA 



UUCAUUUC CUGAUGAG X CGAA AAAUAUAA 



AAUGAACU U GUUGGCCC 



GGGCCAAC CUGAUGAG X CGAA AGUUCAUU 



GAACUUGU U GGCCCAUC 



GAUGGGCC CUGAUGAG X CGAA ACAAGUUC 



UGGCCCAU C UAUUACAU 



AUGUAAUA CUGAUGAG X CGAA AUGGGCCA 



GCCCAUCU A UUACAUCU 



iGAUGUAA CUGAUGAG X CGAA AGAUGGGC 



CCAUCUAU U ACAUCUAC 



GUAGAUGU CUGAUGAG X CGAA AUAGAUGG 



CAUCUAUU A CAUCUACA 



UGUAGAUG CUGAUGAG X CGAA AAUAGAUG 



UAUUACAU C UACAGCUG 



CAGCUGUA CUGAUGAG X CGAA AUGUAAUA 



UUACAUCU A CAGCUGAC 



GUCAGCUG CUGAUGAG X CGAA AGAUGUAA 



CUGACCCU U GAACAUGG 



CCAUGUUC CUGAUGAG X CGAA AGGGUCAG 



AUGGGGGU U AGGGGAGC 



GCUCCCCU CUGAUGAG X CGAA ACCCCCAU 



UGGGGGUU A GGGGAGCU 



AGCUCCCC CUGAUGAG X CGAA AACCCCCA 



CUGACAAU U CGUGGGUC 



GACCCACG CUGAUGAG X CGAA AUUGUCAG 



UGACAAUU C GUGGGUCC 



GGACCCAC CUGAUGAG X CGAA AAUUGUCA 



UCGUGGGU C CGCAAAAU 



AUUUUGCG CUGAUGAG X CGAA ACCCACGA 



CGCAAAAU C UUAACUAC 



GUAGUUAA CUGAUGAG X CGAA AUUUUGCG 



CAAAAUCU U AACUACCU 



AGGUAGUU CUGAUGAG X CGAA AGAUUUUG 



AAAAUCUU A ACUACCUA 



UAGGUAGU CUGAUGAG X CGAA AAGAUUUU 



UCUUAACU A CCUAAUAG 



CUAUUAGG CUGAUGAG X CGAA AGUUAAGA 



AACUACCU A AUAGCCUA 



UAGGCUAU CUGAUGAG X CGAA AGGUAGUU 



UACCUAAU A GCCUACUA 



UAGUAGGC CUGAUGAG X CGAA AUUAGGUA 



AAUAGCCU A CUAUUGAC 



GUCAAUAG CUGAUGAG X CGAA AGGCUAUU 



AGCCUACU A UUGACCAU 



AUGGUCAA CUGAUGAG X CGAA AGUAGGCU 



CCUACUAU U GACCAUAA 



UUAUGGUC CUGAUGAG X CGAA AUAGUAGG 



UUGACCAU A AACCUUAC 



GUAAGGUU CUGAUGAG X CGAA AUGGUCAA 



AUAAACCU U ACUGAUAA 



UUAUCAGU CUGAUGAG X CGAA AGGUUUAU 



UAAACCUU A CUGAUAAC 



GUUAUCAG CUGAUGAG X CGAA AAGGUUUA 



UUACUGAU A ACAUAAAC 



GUUUAUGU CUGAUGAG X CGAA AUCAGUAA 



GAUAACAU A AACAGUAA 



UUACUGUU CUGAUGAG X CGAA AUGUUAUC 



UAAACAGU A AAUUAACA 



UGUUAAUU CUGAUGAG X CGAA ACUGUUUA 



CAGUAAAU U AACACAUA 



UAUGUGUU CUGAUGAG X CGAA AUUUACUG 



AGUAAAUU A ACACAUAU 



AUAUGUGU CUGAUGAG X CGAA AAUUUACU 



UAACACAU A UUUUGCGU 



ACGCAAAA CUGAUGAG X CGAA AUGUGUUA 



ACACAUAU U UUGCGUGU 



ACACGCAA CUGAUGAG X CGAA AUAUGUGU 



CACAUAUU U UGCGUGUU 



AACACGCA CUGAUGAG X CGAA AAUAUGUG 



ACAUAUUU U GCGUGUUA 



UAACACGC CUGAUGAG X CGAA AAAUAUGU 



UUGCGUGU U AUAUGUAU 



AUACAUAU CUGAUGAG X CGAA ACACGCAA 



UGCGUGUU A UAUGUAUU 



AAUACAUA CUGAUGAG X CGAA AACACGCA 



CGUGUUAU A UGUAUUAU 



AUAAUACA CUGAUGAG X CGAA AUAACACG 



UUAUAUGU A UUAUACAC 



GUGUAUAA CUGAUGAG X CGAA ACAUAUAA 



AUAUGUAU U AUACACUA 



UAGUGUAU CUGAUGAG X CGAA AUACAUAU 
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Table 24 



UAUGUAUU A UACACUAU 



AUAGUGUA CUGAUGAG X CGAA AAUACAUA 



UGUAUUAU A CACUAUAU 



AUAUAGUG CUGAUGAG X CGAA AUAAUACA 



UAUACACU A UAUUCCUA 



UAGGAAUA CUGAUGAG X CGAA AGUGUAUA 



UACACUAU A UUCCUACA 



UGUAGGAA CUGAUGAG X CGAA AUAGUGUA 



CACUAUAU U CCUACAAU 



AUUGUAGG CUGAUGAG X CGAA AUAUAGUG 



ACUAUAUU C CUACAAUA 



UAUUGUAG CUGAUGAG X CGAA AAUAUAGU 



AUAUUCCU A CAAUAAAG 



CUUUAUUG CUGAUGAG X CGAA AGGAAUAU 



CCUACAAU A AAGUAAGC 



GCUUACUU CUGAUGAG X CGAA AUUGUAGG 



AAUAAAGU A AGCUAGAG 



CUCUAGCU CUGAUGAG X CGAA ACUUUAUU 



AGUAAGCU A GAGAAAAU 



AUUUUCUC CUGAUGAG X CGAA AGCUUACU 



GAAAAUGU U AUUUAGAA 



UUCUAAAU CUGAUGAG X CGAA ACAUUUUC 



AAAAUGUU A UUUAGAAA 



UUUCUAAA CUGAUGAG X CGAA AACAUUUU 



AAUGUUAU U UAGAAAAU 



AUUUUCUA CUGAUGAG X CGAA AUAACAUU 



AUGUUAUU U AGAAAAUC 



GAUUUUCU CUGAUGAG X CGAA AAUAACAU 



UGUUAUUU A GAAAAUCA 



UGAUUUUC CUGAUGAG X CGAA AAAUAACA 



Input Sequence = PLN. Cut Site = UH/. 

Stem Length = 8 . Core Sequence = CUGAUGAG X CGAA (X = GCCGUUAGGC or other stem II) 
PLN (Homo sapiens phospholamban (PLN) mRNA.; 1635 bp) 
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Table 25: Human Phospholamban (PLN) NCH Ribozyme and Target Sequence 



Ribozyme 



CAGAAAAC U AGCUAAAC 



GUUUAGCU CUGAUGAG X CGAA IUUUUCUG 



GAAAACUC C CUAAACAC 



GUGUUUAG CUGAUGAG X CGAA IAGUUUUC 



AAAACUCC C UAAACACC 



GGUGUUUA CUGAUGAG X CGAA IGAGUUUU 



AAACUCCC C AAACACCC 



GGGUGUUU CUGAUGAG X CGAA IGGAGUUU 



AACUCCCC A AACACCCG 



CGGGUGUU CUGAUGAG X CGAA IGGGAGUU 



UCCCCAGC U ACCCGUAA 



UUACGGGU CUGAUGAG X CGAA I CUGGGGA 



AGCUAAAC A UAAGACUU 



AAGUCUUA CUGAUGAG X CGAA IUUUAGCU 



CUAAACAC C AGACUUCA 



UGAAGUCU CUGAUGAG X CGAA IUGUUUAG 



UAAACACC C GACUUCAU 



AUGAAGUC CUGAUGAG X CGAA IGUGUUUA 



CGUAAGAC U ACAACACA 



UGUGUUGU CUGAUGAG X CGAA IUCUUACG 



AAGACUUC A ACACAAUA 



UAUUGUGU CUGAUGAG X CGAA IAAGUCUU 



CUUCAUAC A AAUACUCU 



AGAGUAUU CUGAUGAG X CGAA I UAUGAAG 



CAUACAAC A ACUCUAUA 



UAUAGAGU CUGAUGAG X CGAA IUUGUAUG 



UACAACAC A UCUAUACU 



AGUAUAGA CUGAUGAG X CGAA IUGUUGUA 



CACAAUAC U ACUGUGAU 



AUCACAGU CUGAUGAG X CGAA IUAUUGUG 



CAAUACUC U UGUGAUGA 



UCAUCACA CUGAUGAG X CGAA IAGUAUUG 



CUCUAUAC U UGAUCACA 



UGUGAUCA CUGAUGAG X CGAA I UAUAGAG 



UGAUGAUC A UGCCAAGG 



CCUUGGCA CUGAUGAG X CGAA IAUCAUCA 



AUGAUCAC A CCAAGGCU 



AGCCUUGG CUGAUGAG X CGAA IUGAUCAU 



AUCACAGC U AGGCUACC 



GGUAGCCU CUGAUGAG X CGAA ICUGUGAU 



ACAGCUGC C CUACCUAA 



UUAGGUAG CUGAUGAG X CGAA ICAGCUGU 



CAGCUGCC A UACCUAAA 



UUUAGGUA CUGAUGAG X CGAA IGCAGCUG 



GCCAAGGC U 
AAGGCUAC C AGAAGACA 
AGGCUACC U GAAGACAG 
AAGAAGAC A UCUCAUAU 
CAGUUAUC U UUUGGCUG 
GUUAUCUC A UGGCUGCC 
UAUUUGGC U GCUUUUUA 
UUGGCUGC C UUUUAUCU 
UGGCUGCC A UUUAUCUU 
CUGCCAGC U AUCUUUCU 
UUUUUAUC U CUCGACCA 
UAUCUUUC U ACCACUUA 
UCUUUCUC U CACUUAAA 
CUCUCGAC C AAAACUUC 
UCUCGACC A AAACUUCA 
UCGACCAC U ACUUCAGA 
CUUAAAAC U ACUUCCUG 
AAAACUUC A UCCUGUCC 
CUUCAGAC U GUCCUGCU 
CAGACUUC C CUGCUGGU 
AGACUUCC U UGCUGGUA 
UUCCUGUC C GGUAUCAU 



CUUCUUUU CUGAUGAG X CGAA ICCUUGGC 

UGUCUUCU CUGAUGAG X CGAA IUAGCCUU 

CUGUCUUC CUGAUGAG X CGAA IGUAGCCU 

AUAUGAGA CUGAUGAG X CGAA IUCUUCUU 

CAGCCAAA CUGAUGAG X CGAA IAUAACUG 

GGCAGCCA CUGAUGAG X CGAA IAGAUAAC 

UAAAAAGC CUGAUGAG X CGAA ICCAAAUA 

AGAUAAAA CUGAUGAG X CGAA ICAGCCAA 

AAGAUAAA CUGAUGAG X CGAA IGCAGCCA 

AGAAAGAU CUGAUGAG X CGAA ICUGGCAG 

UGGUCGAG CUGAUGAG X CGAA IAUAAAAA 

UAAGUGGU CUGAUGAG X CGAA I AAAGAUA 

UUUAAGUG CUGAUGAG X CGAA I AGAAAGA 

GAAGUUUU CUGAUGAG X CGAA IUCGAGAG 

UGAAGUUU CUGAUGAG X CGAA IGUCGAGA 

UCUGAAGU CUGAUGAG X CGAA IUGGUCGA 

CAGGAAGU CUGAUGAG X CGAA IUUUUAAG 

GGACAGGA CUGAUGAG X CGAA IAAGUUUU 

AGCAGGAC CUGAUGAG X CGAA IUCUGAAG 

ACCAGCAG CUGAUGAG X CGAA IAAGUCUG 

UACCAGCA CUGAUGAG X CGAA IGAAGUCU 

AUGAUACC CUGAUGAG X CGAA I ACAGGAA 
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UCCUGUCC U GUAUCAUG 



CAUGAUAC CUGAUGAG X CGAA IGACAGGA 



UGUCCUGC U UCAUGGAG 



CUCCAUGA CUGAUGAG X CGAA ICAGGACA 



CUGGUAUC A GAAAGUCC 



GGACUUUC CUGAUGAG X CGAA IAUACCAG 



AGAAAGUC C CCUCACUC 



GAGUGAGG CUGAUGAG X CGAA IACUUUCU 



GAAAGUCC A CUCACUCG 



CGAGUGAG CUGAUGAG X CGAA IGACUUUC 



UCCAAUAC C UCGCUCAG 



CUGAGCGA CUGAUGAG X C 



CCAAUACC U CGCUCAGC 



GCUGAGCG CUGAUGAG X CGAA IGUAUUGG 



AAUACCUC A CUCAGCUA 



UAGCUGAG CUGAUGAG X CGAA IAGGUAUU 



UACCUCAC U CAGCUAUA 



UAUAGCUG CUGAUGAG X CGAA IUGAGGUA 



UCACUCGC U UAUAAGAA 



UUCUUAUA CUGAUGAG X CGAA I CGAGUGA 



ACUCGCUC A UAAGAAGA 



UCUUCUUA CUGAUGAG X CGAA IAGCGAGU 



CGCUCAGC U GAAGAGCC 



GGCUCUUC CUGAUGAG X CGAA ICUGAGCG 



AGAAGAGC C CCAUUGAA 



UUCAAUGG CUGAUGAG X CGAA ICUCUUCU 



GAAGAGCC U CAUUGAAA 



UUUCAAUG CUGAUGAG X CGAA IGCUCUUC 



AGAGCCUC A UUGAAAUG 



CAUUUCAA CUGAUGAG X CGAA IAGGCUCU 



GCCUCAAC C AAAUGCCU 



AGGCAUUU CUGAUGAG X CGAA IUUGAGGC 



CCUCAACC A AAUGCCUC 



GAGGCAUU CUGAUGAG X CGAA I GUUGAGG 



UGAAAUGC C CAAGCACG 



CGUGCUUG CUGAUGAG X CGAA ICAUUUCA 



GAAAUGCC U AAGCACGU 



ACGUGCUU CUGAUGAG X CGAA IGCAUUUC 



AAUGCCUC A GCACGUCA 



UGACGUGC CUGAUGAG X CGAA IAGGCAUU 



GCCUCAAC A CGUCAAAA 



UUUUGACG CUGAUGAG X CGAA IUUGAGGC 



CAACAAGC A AAAAGCUA 



UAGCUUUU CUGAUGAG X CGAA ICUUGUUG 



AGCACGUC A CUACAGAA 



UUCUGUAG CUGAUGAG X CGAA IACGUGCU 



UCAAAAGC U AAUCUAUU 



AAUAGAUU CUGAUGAG X CGAA ICUUUUGA 



l CUAUUUAU 



AUAAAUAG CUGAUGAG X CGAA IUAGCUUU 



ACAGAAUC U AUCAAUUU 



AAAUUGAU CUGAUGAG X CGAA IAUUCUGU 



UAUUUAUC A CUGUCUCA 



UGAGACAG CUGAUGAG X CGAA IAUAAAUA 



UCAAUUUC U CAUCUUAA 



UUAAGAUG CUGAUGAG X CGAA IAAAUUGA 



UUUCUGUC U UUAAUAUG 



CAUAUUAA CUGAUGAG X CGAA IACAGAAA 



UCUGUCUC A AAUAUGUC 



GACAUAUU CUGAUGAG X CGAA IAGACAGA 



GUCUCAUC U AUGUCUCU 



AGAGACAU CUGAUGAG X CGAA IAUGAGAC 



AAUAUGUC U CUGAUCUG 



CAGAUCAG CUGAUGAG X CGAA IACAUAUU 



UAUGUCUC U GAUCUGUA 



UACAGAUC CUGAUGAG X CGAA IAGACAUA 



UCUCUUGC U UGUAUCAU 



AUGAUACA CUGAUGAG X CGAA ICAAGAGA 



UGCUGAUC U CAUCGUGA 



UCACGAUG CUGAUGAG X CGAA IAUCAGCA 



UCUGUAUC A GAUGCUUC 



GAAGCAUC CUGAUGAG X CGAA I AUACAG A 



CGUGAUGC U UGAAGUUC 



GAACUUCA CUGAUGAG X CGAA ICAUCACG 



GAUGCUUC U AGUUCUGC 



GCAGAACU CUGAUGAG X CGAA IAAGCAUC 



UGCUUCUC U UUCUGCUA 



UAGCAGAA CUGAUGAG X CGAA IAGAAGCA 



UGAAGUUC U CAACCUCU 



AGAGGUUG CUGAUGAG X CGAA IAACUUCA 



AGUUCUGC U CCUCUAGA 



UCUAGAGG CUGAUGAG X CGAA ICAGAACU 



UCUGCUAC A CUAGAUCU 



AGAUCUAG CUGAUGAG X CGAA I UAGCAGA 



GCUACAAC C GAUCUGCA 



UGCAGAUC CUGAUGAG X CGAA IUUGUAGC 



CUACAACC U AUCUGCAG 



CUGCAGAU CUGAUGAG X CGAA IGUUGUAG 



ACAACCUC U CUGCAGCU 



AGCUGCAG CUGAUGAG X CGAA IAGGUUGU 



UCUAGAUC U CUUGCCAC 



GUGGCAAG CUGAUGAG X CGAA IAUCUAGA 



AGAUCUGC A GCCACAUC 



GAUGUGGC CUGAUGAG X CGAA ICAGAUCU 
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UCUGCAGC U ACAUCAGC 



GCUGAUGU CUGAUGAG X CGAA ICUGCAGA 



CAGCUUGC C CAGCUUAA 



UUAAGCUG CUGAUGAG X CGAA ICAAGCUG 



AGCUUGCC A AGCUUAAA 



UUUAAGCU CUGAUGAG X CGAA IGCAAGCU 



CUUGCCAC A CUUAAAAU 



AUUUUAAG CUGAUGAG X CGAA I UGGCAAG 



GCCACAUC A AAAAUCUG 



CAGAUUUU CUGAUGAG X CGAA IAUGUGGC 



ACAUCAGC U AUCUGUCA 



UGACAGAU CUGAUGAG X CGAA ICUGAUGU 



UUAAAAUC U UCCCAUGC 



GCAUGGGA CUGAUGAG X ( 



AAUCUGUC A AUGCAGAC 



GUCUGCAU CUGAUGAG X CGAA IACAGAUU 



CUGUCAUC C CAGACAGG 



CCUGUCUG CUGAUGAG X CGAA IAUGACAG 



UGUCAUCC C AGACAGGA 



UCCUGUCU CUGAUGAG X CGAA I GAUGACA 



,VCCC A GACAGGAA 



UUCCUGUC CUGAUGAG X CGAA I GGAUGAC 



UGUUUUCC CUGAUGAG X CGAA I CAUGGGA 



AUGCAGAC A AACAAUAU 



AUAUUGUU CUGAUGAG X CGAA IUCUGCAU 



AGGAAAAC A UGUAUAAC 



GUUAUACA CUGAUGAG X CGAA IUUUUCCU 



UGUAUAAC A ACUUCCUG 



CAGGAAGU CUGAUGAG X CGAA IUUAUACA 



UAACAGAC C CCUGAGUA 



UACUCAGG CUGAUGAG X CGAA IUCUGUUA 



AACAGACC A CUGAGUAG 



CUACUCAG CUGAUGAG X CGAA IGUCUGUU 



CAGACCAC U GAGUAGAA 



UUCUACUC CUGAUGAG X CGAA IUGGUCUG 



ACCACUUC C UAGAAGAG 



CUCUUCUA CUGAUGAG X CGAA IAAGUGGU 



CCACUUCC U AGAAGAGU 



ACUCUUCU CUGAUGAG X CGAA IGAAGUGG 



AGAGUUUC U GAAAAGGU 



ACCUUUUC CUGAUGAG X CGAA IAAACUCU 



AAAAGGUC A UAAGACUA 



UAGUCUUA CUGAUGAG X CGAA IACCUUUU 



AUUAAGAC U CUUAUUGU 



ACAAUAAG CUGAUGAG X CGAA IUCUUAAU 



ACUAAAAC U GUUACCAU 



AUGGUAAC CUGAUGAG X CGAA IUUUUAGU 



AUUGUUAC C GUAUUCAU 



AUGAAUAC CUGAUGAG X CGAA IUAACAAU 



UUGUUACC A UAUUCAUC 



GAUGAAUA CUGAUGAG X CGAA IGUAACAA 



AUGUAUUC A UUGGAUCU 



AGAUCCAA CUGAUGAG X CGAA IAAUACAU 



UAUUCAUC U GAUCUUGU 



ACAAGAUC CUGAUGAG X CGAA IAUGAAUA 



GUUGGAUC U AACAUGAA 



UUCAUGUU CUGAUGAG X CGAA IAUCCA 



UUGUAAAC A AAGGGCUU 



AAGCCCUU CUGAUGAG X CGAA IUUUACAA 



AAAAGGGC U UUUCAAAA 



UUUUGAAA CUGAUGAG X CGAA ICCCUUUU 



UUAUUUUC A UUAACUUC 



GAAGUUAA CUGAUGAG X CGAA IAAAAUAA 



AAAUUAAC U AAUAAGUG 



CACUUAUU CUGAUGAG X CGAA IUUAAUUU 



UUAACUUC A AAGUGUAU 



AUACACUU CUGAUGAG X CGAA IAAGUUAA 



UAAAAUGC A UUGAUUUC 



GAAAUCAA CUGAUGAG X CGAA ICAUUUUA 



AAUGCAAC U AUUUCCUC 



GAGGAAAU CUGAUGAG X CGAA IUUGCAUU 



UUGAUUUC C CAUGGCUC 



GAGCCAUG CUGAUGAG X CGAA IAAAUCAA 



UGAUUUCC U AUGGCUCA 



UGAGCCAU CUGAUGAG X CGAA IGAAAUCA 



AUUUCCUC A GGCUCACA 



UGUGAGCC CUGAUGAG X CGAA IAGGAAAU 



UCCUCAAC A UCACAAAU 



AUUUGUGA CUGAUGAG X CGAA IUUGAGGA 



AACAUGGC U AAUUUCUA 



UAGAAAUU CUGAUGAG X CGAA ICCAUGUU 



CAUGGCUC A UUUCUAUC 



GAUAGAAA CUGAUGAG X CGAA IAGCCAUG 



UGGCUCAC A UCUAUCCC 



GGGAUAGA CUGAUGAG X CGAA IUGAGCCA 



CAAAUUUC U CAAAUCUU 



AAGAUUUG CUGAUGAG X CGAA IAAAUUUG 



UUUCUAUC C UCUUUUCU 



AGAAAAGA CUGAUGAG X CGAA IAUAGAAA 



UUCUAUCC C CUUUUCUG 



CAGAAAAG CUGAUGAG X CGAA IGAUAGAA 



UCUAUCCC A UUUUCUGA 



UCAGAAAA CUGAUGAG X CGAA IGGAUAGA 
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632 
659 
662 
664 
657 
668 
671 
677 
679 
682 
693 



755 
767 
772 



877 
881 



CCCAAAUC U UGAAGAUG 
AUCUUUUC U AUGAAGAG 
UUUAAAAC U UGCCAACA 
AAAACUGC A CAACAAGU 
AACUGCAC U ACAAGUUC 
UGCACUGC C AGUUCACU 



CUGCCAAC A CACUUCAU 
ACAAGUUC A AUAUAUAA 
AAGUUCAC U AUAUAAAG 
UUCACUUC A UAAAGCAU 
UAUAAAGC A UUUUACUC 
AUUUUUAC U UGAGGUGA 
UUUUACUC U AGGUGAAU 
UAUAUUAC A AAAAGCUU 
GUAAAAGC U UAAUACUA 



J UACUAAGU 
UUUAAUAC U AUUUUUCA 
UAUUUUUC A UUCACCAA 
UUCAGGUC U CAAGUAUC 
AGGUCUUC A GUAUCAAA 
GUCUUCAC C AUCAAAGU 
UCUUCACC A UCAAAGUA 
CAAGUAUC A AAUAACAC 
GUAAUAAC A UGAAGUGU 
AAUAACAC A AAGUGUCA 
GAAGUGUC A UCAAAAUA 
CAUUAUUC A AGUCCACU 
AAAUAGUC C ACUCCUCA 
AAUAGUCC A CUCCUCAC 
UAGUCCAC U CCUCACAU 
CCACUGAC U ACAUCUGU 
ACUGACUC C AUCUGUUA 
CUGACUCC U UCUGUUAU 
GACUCCUC A UGUUAUCU 
CUCCUCAC A UUAUCUUA 
CUCACAUC U UCUUAUUA 
CUGUUAUC U AUAAAGAA 
UAAAGAAC U GUAGUAAC 
GUAGUAAC U GAAUCUAC 
UAACUAUC A CUACAUUC 
UCAGAAUC U UCUAAAAC 
GAAUCUAC A AAAACAGA 
CUACAUUC U CAGAAAUU 
UCUAAAAC A UUGUAUUU 
AUUUUUUC U CACAUUAA 
UUCUAUGC C UAACAUCU 



CAUCUUCA CUGAUGAG X CGAA IAUUUGGG 

CUCUUCAU CUGAUGAG X CGAA IAAAAGAU 

UGUUGGCA CUGAUGAG X CGAA IUUUUAAA 

ACUUGUUG CUGAUGAG X CGAA ICAGUUUU 

GAACUUGU CUGAUGAG X CGAA IUGCAGUU 

AGUGAACU CUGAUGAG X CGAA ICAGUGCA 

AAGUGAAC CUGAUGAG X CGAA IGCAGUGC 

AUGAAGUG CUGAUGAG X CGAA IUUGGCAG 

UUAUAUAU CUGAUGAG X CGAA IAACUUGU 

CUUUAUAU CUGAUGAG X CGAA IUGAACUU 

AUGCUUUA CUGAUGAG X CGAA IAAGUGAA 

GAGUAAAA CUGAUGAG X CGAA ICUUUAUA 

UCACCUCA CUGAUGAG X CGAA IUAAAAAU 

AUUCACCU CUGAUGAG X CGAA IAGUAAAA 

AAGCUUUU CUGAUGAG X CGAA IUAAUAUA 

UAGUAUUA CUGAUGAG X CGAA ICUUUUAC 

ACUUAGUA CUGAUGAG X CGAA IAAGCUUU 

UGAAAAAU CUGAUGAG X CGAA IUAUUAAA 

UUGGUGAA CUGAUGAG X CGAA IAAAAAUA 

GAUACUUG CUGAUGAG X CGAA IACCUGAA 

UUUGAUAC CUGAUGAG X CGAA IAAGACCU 

ACUUUGAU CUGAUGAG X CGAA IUGAAGAC 

UACUUUGA CUGAUGAG X CGAA I GUGAAGA 

GUGUUAUU CUGAUGAG X CGAA IAUACUUG 

ACACUUCA CUGAUGAG X CGAA IUUAUUAC . 

UGACACUU CUGAUGAG X CGAA IUGUUAUU 

UAUUUUGA CUGAUGAG X CGAA IACACUUC 

AGUGGACU CUGAUGAG X CGAA IAAUAAUG 

UGAGGAGU CUGAUGAG X CGAA IACUAUUU 

GUGAGGAG CUGAUGAG X CGAA IGACUAUU 

AUGUGAGG CUGAUGAG X CGAA IUGGACUA 

ACAGAUGU CUGAUGAG X CGAA IUCAGUGG 

UAACAGAU CUGAUGAG X CGAA IAGUCAGU 

AUAACAGA CUGAUGAG X CGAA I GAGUCAG 

AGAUAACA CUGAUGAG X CGAA IAGGAGUC 

UAAGAUAA CUGAUGAG X CGAA IUGAGGAG 

UAAUAAGA CUGAUGAG X CGAA IAUGUGAG 

UUCUUUAU CUGAUGAG X CGAA IAUAACAG 

GUUACUAC CUGAUGAG X CGAA IUUCUUUA 

GUAGAUUC CUGAUGAG X CGAA IUUACUAC 

GAAUGUAG CUGAUGAG X CGAA IAUAGUUA 

GUUUUAGA CUGAUGAG X CGAA IAUUCUGA 

UCUGUUUU CUGAUGAG X CGAA IUAGAUUC 

AAUUUCUG CUGAUGAG X CGAA IAAUGUAG 

AAAUACAA CUGAUGAG X CGAA IUUUUAGA 

UUAAUGUG CUGAUGAG X CGAA IAAAAAAU 

AGAUGUUA CUGAUGAG X CGAA ICAUAGAA 
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1125 
1127 
1135 
1139 
1142 
1148 
1153 



1181 
1186 
1195 



1205 
1209 
1215 
1224 
1231 



UCUAUGCC A AACAUCUU 
UAUGCCAC A CAUCUUUU 
ACAUUAAC A UUAAAGUU 
UUAACAUC U AAGUUGAU 
UGAGAAUC A UGGAAAAG 
AGUAAGGC C UCUUACAU 
GUAAGGCC A CUUACAUA 
GGCCAUAC U CAUAAUAA 
CCAUACUC U UAAUAAAA 
ACUCUUAC A AAAAUUCC 
UAAAAUUC C AAGUAAUU 
AAAAUUCC U AGUAAUUU 
AUUUUUUC A AUCACAGA 
AAAGAAUC A AUUCUAGU 
AGAAUCAC A UCUAGUAC 
CAGAAUUC U CAUGUAGG 
UCUAGUAC A GGUAAAUC 
GGUAAAUC A UCUGUUCU 
CAUAAAUC U UAAGACAU 
AUCUGUUC U CAUAUGAU 
UCUAAGAC A AUCAACAG 
AUAUGAUC A AUGAGAAC 
UGAUCAAC A AGAACUGG 
AUGAGAAC U GUUAAUAU 
UAUGUGAC A GAUUAGUC 
GAUUAGUC A ACUAAUAU 
GUCAUAUC A UAUACUAA 
CAUAUCAC U UACUAACA 



AUACUAAC A AAUCUAAU 
CUAACAAC A CUAAUCUU 
ACAGAAUC U UUCAUUUA 
AUCUAAUC U UUAAGGCA 
UAAUCUUC A AGGCACUG 
UUUAAGGC A AGUGAAUU 
UAAGGCAC U UGAAUUAU 
GAAUUAUC U UAGAGUUA 
AUCUGAGC U UUACCUAG 
AGAGUUAC C UUACCAUA 
GAGUUACC U UACCAUAC 
UACCUAGC U AUACUAUA 
UAGCUUAC C UAUAUCUU 
AGCUUACC A AUAUCUUU 
UACCAUAC U CUUUGGAA 
ACUAUAUC U AAUCAUGA 
UUGGAAUC A ACCUUAAG 
CAUGAAAC C GACUUCAG 



AAGAUGUU CUGAUGAG X CGAA IGCAUAGA 
AAAAGAUG CUGAUGAG X CGAA IUGGCAUA 
AACUUUAA CUGAUGAG X CGAA IUUAAUGU 
AUCAACUU CUGAUGAG X CGAA IAUGUUAA 
CUUUUCCA CUGAUGAG X CGAA IAUUCUCA 
AUGUAAGA CUGAUGAG X CGAA ICCUUACU 
UAUGUAAG CUGAUGAG X CGAA IGCCUUAC 
UUAUUAUG CUGAUGAG X CGAA IUAUGGCC 
UUUUAUUA CUGAUGAG X CGAA IAGUAUGG 
GGAAUUUU CUGAUGAG X CGAA IUAAGAGU 
AAUUACUU CUGAUGAG X CGAA IAAUUUUA 
AAAUUACU CUGAUGAG X CGAA IGAAUUUU 
UCUGUGAU CUGAUGAG X CGAA IAAAAAAU 
ACUAGAAU CUGAUGAG X CGAA IAUUCUUU 
GUACUAGA CUGAUGAG X CGAA IUGAUUCU 
CCUACAUG CUGAUGAG X CGAA IAAUUCUG 
GAUUUACC CUGAUGAG X CGAA IUACUAGA 
AGAACAGA CUGAUGAG X CGAA IAUUUACC 
AUGUCUUA CUGAUGAG X CGAA IAUUUAUG 
AUCAUAUG CUGAUGAG X CGAA IAACAGAU 
CUGUUGAU CUGAUGAG X CGAA IUCUUAGA 
GUUCUCAU CUGAUGAG X CGAA IAUCAUAU 
CCAGUUCU CUGAUGAG X CGAA IUUGAUCA 
AUAUUAAC CUGAUGAG X CGAA IUUCUCAU 
GACUAAUC CUGAUGAG X CGAA IUCACAUA 
AUAUUAGU CUGAUGAG X CGAA IACUAAUC 
UUAGUAUA CUGAUGAG X CGAA IAUAUGAC 
UGUUAGUA CUGAUGAG X CGAA IUGAUAUG 
GAUUCUGU CUGAUGAG X CGAA IUAUAUUA 
AUUAGAUU CUGAUGAG X CGAA IUUAGUAU 
AAGAUUAG CUGAUGAG X CGAA IUUGUUAG 
UAAAUGAA CUGAUGAG X CGAA IAUUCUGU 
UGCCUUAA CUGAUGAG X CGAA IAUUAGAU 
CAGUGCCU CUGAUGAG X CGAA IAAGAUUA 
AAUUCACU CUGAUGAG X CGAA ICCUUAAA 
AUAAUUCA CUGAUGAG X CGAA IUGCCUUA 
UAACUCUA CUGAUGAG X CGAA IAUAAUUC 
CUAGGUAA CUGAUGAG X CGAA ICUCAGAU 
UAUGGUAA CUGAUGAG X CGAA IUAACUCU 
GUAUGGUA CUGAUGAG X CGAA IGUAACUC 
UAUAGUAU CUGAUGAG X CGAA ICUAGGUA 
AAGAUAUA CUGAUGAG X CGAA IUAAGCUA 
AAAGAUAU CUGAUGAG X CGAA IGUAAGCU 
UUCCAAAG CUGAUGAG X CGAA IUAUGGUA 
UCAUGAUU CUGAUGAG X CGAA IAUAUAGU 
CUUAAGGU CUGAUGAG X CGAA IAUUCCAA 
CUGAAGUC CUGAUGAG X CGAA IUUUCAUG 
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1232 
1239 
1242 
1255 
1263 
12S6 
1267 
1271 
1272 
1275 
1276 
1280 
1283 
1284 
1289 



1319 

1335 

1337 

1364 

1366 

1368 

1370 

1371 

1381 

1410 

1418 

1419 " 

1420 

1423 

1429 

1432 

1435 

1438 

1442 

1443 

1450 
1467 
1471 
1483 



AUGAAACC U ACUUCAGA 
CUUAAGAC U AAUGAUUU 
AAGACUUC A GAUUUUGC 
GAUUUUGC A GUCUUCCA 
AGGUUGUC U UUCCAGCC 
UUGUCUUC C CAGCCUAA 
UGUCUUCC A AGCCUAAC 
UUCCAUUC C UAACAUCC 
UCCAUUCC A AACAUCCA 
AUUCCAGC C AUCCAAUG 
UUCCAGCC U UCCAAUGC 
AGCCUAAC A AUGCAGGC 
CUAACAUC C CAGGCAAG 
UAACAUCC A AGGCAAGG 
UCCAAUGC A AGGAAAAU 
AUGCAGGC A AAAUAAAA 
AAGAUUUC C ACAGAAAA 
AGAUUUCC A CAGAAAAA 
CCAGUGAC A AAUAUAUU 
AUAUUAUC U UAUUUUUU 
AUUAUCUC A UUUUUUAA 
AUGAAUUC U CCAAAUAU 
GAAUUCUC U AAAUAUUA 
AUUCUCUC U AUAUUAAC 
UCUCUCUC C AUUAACUA 
CUCUCUCC A UUAACUAA 
AUAUUAAC U AUUAGAUU 
AAAUGAAC U GGCCCAUC 
UUGUUGGC C UAUUACAU 
UGUUGGCC C AUUACAUC 
GUUGGCCC A UUACAUCU 
GGCCCAUC U CAUCUACA 
UCUAUUAC A CAGCUGAC 
AUUACAUC U CUGACCCU 
ACAUCUAC A ACCCUUGA 
UCUACAGC U CUUGAACA 
CAGCUGAC C AACAUGGG 
AGCUGACC C ACAUGGGG 
GCUGACCC U CAUGGGGG 
CCUUGAAC A GGUUAGGG 
AGGGGAGC U AUUCGUGG 
GAGCUGAC A GUGGGUCC 
CGUGGGUC C AAUCUUAA 
GGGUCCGC A CUUAACUA 
GCAAAAUC U UACCUAAU 
AUCUUAAC U AAUAGCCU 
UUAACUAC C AGCCUACU 



UCUGAAGU CUGAUGAG X CGAA IGUUUCAU 
AAAUCAUU CUGAUGAG X CGAA IUCUUAAG 
GCAAAAUC CUGAUGAG X CGAA IAAGUCUU 
UGGAAGAC CUGAUGAG X CGAA ICAAAAUC 
GGCUGGAA CUGAUGAG X CGAA IACAACCU 
UUAGGCUG CUGAUGAG X CGAA IAAGACAA 
GUUAGGCU CUGAUGAG X CGAA I GAAGACA 
GGAUGUUA CUGAUGAG X CGAA IAAUGGAA 
UGGAUGUU CUGAUGAG X CGAA I GAAUGGA 
CAUUGGAU CUGAUGAG X CGAA ICUGGAAU 
GCAUUGGA CUGAUGAG X CGAA I GCUGGAA 
GCCUGCAU CUGAUGAG X CGAA IUUAGGCU 
CUUGCCUG CUGAUGAG X CGAA IAUGUUAG 
CCUUGCCU CUGAUGAG X CGAA IGAUGUUA 
AUUUUCCU CUGAUGAG X CGAA ICAUUGGA 
UUUUAUUU CUGAUGAG X CGAA ICCUGCAU 
UUUUCUGU CUGAUGAG X CGAA IAAAUCUU 
UUUUUCUG CUGAUGAG X CGAA IGAAAUCU 
AAUAUAUU CUGAUGAG X CGAA IUCACUGG 
AAAAAAUA CUGAUGAG X CGAA IAUAAUAU 
UUAAAAAA CUGAUGAG X CGAA IAGAUAAU 
AUAUUUGG CUGAUGAG X CGAA IAAUUCAU 
UAAUAUUU CUGAUGAG X CGAA IAGAAUUC 
GUUAAUAU CUGAUGAG X CGAA I AGAGAAU 
UAGUUAAU CUGAUGAG X CGAA IAGAGAGA 
UUAGUUAA CUGAUGAG X CGAA I GAGAGAG 
AAUCUAAU CUGAUGAG X CGAA IUUAAUAU 
GAUGGGCC CUGAUGAG X CGAA IUUCAUUU 
AUGUAAUA CUGAUGAG X CGAA ICCAACAA 
GAUGUAAU CUGAUGAG X CGAA IGCCAACA 
AGAUGUAA CUGAUGAG X CGAA IGGCCAAC 
UGUAGAUG CUGAUGAG X CGAA IAUGGGCC 
GUCAGCUG CUGAUGAG X CGAA IUAAUAGA 
AGGGUCAG CUGAUGAG X CGAA IAUGUAAU 
UCAAGGGU CUGAUGAG X CGAA IUAGAUGU 
UGUUCAAG CUGAUGAG X CGAA ICUGUAGA 
CCCAUGUU CUGAUGAG X CGAA IUCAGCUG 
CCCCAUGU CUGAUGAG X CGAA IGUCAGCU 
CCCCCAUG CUGAUGAG X CGAA IGGUCAGC 
CCCUAACC CUGAUGAG X CGAA IUUCAAGG 
CCACGAAU CUGAUGAG X CGAA ICUCCCCU 
GGACCCAC CUGAUGAG X CGAA IUCAGCUC 
UUAAGAUU CUGAUGAG X CGAA IACCCACG 
UAGUUAAG CUGAUGAG X CGAA ICGGACCC 
AUUAGGUA CUGAUGAG X CGAA IAUUUUGC 
AGGCUAUU CUGAUGAG X CGAA IUUAAGAU 
AGUAGGCU CUGAUGAG X CGAA IUAGUUAA 
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UAACUACC U GCCUACUA 



UAGUAGGC CUGAUGAG X CGAA IGUAGUUA 



CUAAUAGC C AUUGACCA 



UGGUCAAU CUGAUGAG X CGAA ICUAUUAG 



UAAUAGCC U UUGACCAU 



AUGGUCAA CUGAUGAG X CGAA IGCUAUUA 



UAGCCUAC U ACCAUAAA 



UUUAUGGU CUGAUGAG X CGAA IUAGGCUA 



CUAUUGAC C ACCUUACU 



AGUAAGGU CUGAUGAG X CGAA IUCAAUAG 



UAUUGACC A CCUUACUG 



CAGUAAGG CUGAUGAG X CGAA IGUCAAUA 



CCAUAAAC C UGAUAACA 



UGUUAUCA CUGAUGAG X CGAA IUUUAUGG 



CAUAAACC U GAUAACAU 



AUGUUAUC CUGAUGAG X CGAA IGUUUAUG 



AACCUUAC U ACAUAAAC 



GUUUAUGU CUGAUGAG X CGAA IUAAGGUU 



CUGAUAAC A CAGUAAAU 



AUUUACUG CUGAUGAG X CGAA IUUAUCAG 



ACAUAAAC A AUUAACAC 



GUGUUAAU CUGAUGAG X CGAA IUUUAUGU 



AAAUUAAC A UUUUGCGU 



ACGCAAAA CUGAUGAG X CGAA IUUAAUUU 



AUUAACAC A UUGCGUGU 



ACACGCAA CUGAUGAG X CGAA IUGUUAAU 



UAUUAUAC A AUUCCUAC 



GUAGGAAU CUGAUGAG X CGAA IUAUAAUA 



UUAUACAC U UCCUACAA 



UUGUAGGA CUGAUGAG X CGAA IUGUAUAA 



CUAUAUUC C AUAAAGUA 



UACUUUAU CUGAUGAG X CGAA IAAUAUAG 



UAUAUUCC U UAAAGUAA 



UUACUUUA CUGAUGAG X CGAA IGAAUAUA 



AUUCCUAC A AGUAAGCU 



AGCUUACU CUGAUGAG X CGAA IUAGGAAU 



AAGUAAGC U AAAAUGUU 



AACAUUUU CUGAUGAG X CGAA ICUUACUU 



Input Sequence = PLN. Cut Site = CH/. 

Stem Length = 8 . Core Sequence = CUGAUGAG X CGAA (X = GCCGUUAGGC or other stem II) 
PLN (Homo sapiens phospholamban (PLN) mRIMA.; 1635 bp) 
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GAUCAUCA UGAUG GCAUGCACUAUGC GCG AGUAUAGA 
GUGAUCAU UGAUG GCAUGCACUAUGC GCG ACAGUAUA 
GCUGUGAU UGAUG GCAUGCACUAUGC GCG AUCACAGU 
AGCCUUGG UGAUG GCAUGCACUAUGC GCG AGCUGUGA 
AAAGCUGG UGAUG GCAUGCACUAUGC GCG AGCCAAAU 
UAAGUGGU UGAUG GCAUGCACUAUGC GCG GAGAGAAA 
CAGCAGGA UGAUG GCAUGCACUAUGC GCG AGGAAGUC 
GAUACCAG UGAUG GCAUGCACUAUGC GCG AGGACAGG 
UAGCUGAG UGAUG GCAUGCACUAUGC GCG GAGUGAGG 
AGGCAUUU UGAUG GCAUGCACUAUGC GCG AAUGGUUG 
UGUUGAGG UGAUG GCAUGCACUAUGC GCG AUUUCAAU 
AGAUGAGA UGAUG GCAUGCACUAUGC GCG AGAAAUUG 
GCAAGAGA UGAUG GCAUGCACUAUGC GCG AUAUUAAG 
CAGAUCAG UGAUG GCAUGCACUAUGC GCG AAGAGACA 


AUACAGAU UGAUG GCAUGCACUAUGC GCG AGCAAGAG 
CGAUGAUA UGAUG GCAUGCACUAUGC GCG AGAUCAGC 
AGAAGCAU UGAUG GCAUGCACUAUGC GCG ACGAUGAU 
CAGAGAAG UGAUG GCAUGCACUAUGC GCG AUCACGAU 
CAGAACUU UGAUG GCAUGCACUAUGC GCG AGAGAAGC 
GGUUGUAG UGAUG GCAUGCACUAUGC GCG AGAACUUC 
GCAAGCUG UGAUG GCAUGCACUAUGC GCG AGAUCUAG 
UGAUGUGG UGAUG GCAUGCACUAUGC GCG AAGCUGCA 
UGGGAUGA UGAUG GCAUGCACUAUGC GCG AGAUUUUA 
CCUGUCUG UGAUG GCAUGCACUAUGC GCG AUGGGAUG 
CUGUUAUA UGAUG GCAUGCACUAUGC GCG AAUAUUGU 
CUUCUACU UGAUG GCAUGCACUAUGC GCG AGGAAGUG 
CCUUUUCA UGAUG GCAUGCACUAUGC GCG AAAGAAAC 
GACCUUUU UGAUG GCAUGCACUAUGC GCG ACAAAGAA 
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1 


UCUAUACU G UGAUGAUC 
UAUACUGU G AUGAUCAC 
ACUGUGAU G AUCACAGC 
UCACAGCU G CCAAGGCU 
AUUUGGCU G CCAGCUUU 
UUUCUCUC G ACCACUUA 
GACUUCCU G UCCUGCUG 
CCUGUCCU G CUGGUAUC 
CCUCACUC G CUCAGCUA 
CAACCAUU G AAAUGCCU 
AUUGAAAU G CCUCAACA 
CAAUUUCU G UCUCAUCU 
CUUAAUAU G UCUCUUGC 
UGUCUCUU G CUGAUCUG 


CUCUUGCU G AUCUGUAU 
GCUGAUCU G UAUCAUCG 
AUCAUCGU G AUGCUUCU 
AUCGUGAU G CUUCUCUG 
GCUUCUCU G AAGUUCUG 
GAAGUUCU G CUACAACC 
CUAGAUCU G CAGCUUGC 
UGCAGCUU G CCACAUCA 
UAAAAUCU G UCAUCCCA 
CAUCCCAU G CAGACAGG 
ACAAUAUU G UAUAACAG 
CACUUCCU G AGUAGAAG 
GUUUCUUU G UGAAAAGG 
UUCUUUGU G AAAAGGUC 


1 
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Table 28 



Table 28: Human Phospholamban (PLN) DNAzyme and Target Sequence 



Pos 


Substrate 


Seq ID 


Ribozyme 


Rz Seq ID 


44 


GACUUCAU A CAACACAA 


6 


TTGTGTTG GGCTAGCTACAACGA ATGAAGTC 


2067 


54 


AACACAAU A CUCUAUAC 


. 7 


GTATAGAG GGCTAGCTACAACGA ATTGTGTT 


2068 


59 


AAUACUCU A UACUGUGA 


9 


TCACAGTA GGCTAGCTACAACGA AGAGTATT 


2069 


61 


UACUCUAU A CUGUGAUG 


10 


CATCACAG GGCTAGCTACAACGA ATAGAGTA 


2070 


88 


CCAAGGCU A CCUAAAAG 


12 


CTTTTAGG GGCTAGCTACAACGA AGCCTTGG 


2071 


106 


AGACAGUU A UCUCAUAU 


15 


" ATATGAGA GGCTAGCTACAACGA AACTGTCT 


2072 


113 


UAUCUCAU A UUUGGCUG 


18 


CAGCCAAA GGCTAGCTACAACGA ATGAGATA 


2073 


132 


AGCUUUUU A UCUUUCUC 


25 


GAGAAAGA GGCTAGCTACAACGA AAAAAGCT 


2074 


179 


CUGCUGGU A UCAUGGAG 


39 


CTCCATGA GGCTAGCTACAACGA ACCAGCAG 


2075 


198 


AGUCCAAU A CCUCACUC 


42 


GAGTGAGG GGCTAGCTACAACGA ATTGGACT 


2076 


215 


GCUCAGCU A UAAGAAGA 


46 


TCTTCTTA GGCTAGCTACAACGA AGCTGAGC 


2077 


265 


CAAAAGCU A CAGAAUCU 


52 


AGATTCTG GGCTAGCTACAACGA AGCTTTTG 


2078 


274 


CAGAAUCU A UUUAUCAA 


54 


TTGATAAA GGCTAGCTACAACGA AGATTCTG 


2079 


278 


AUCUAUUU A UCAAUUUC 


57 


GAAATTGA GGCTAGCTACAACGA AAATAGAT 


2080 


301 


AUCUUAAU A UGUCUCUU 


67 


AAGAGACA GGCTAGCTACAACGA ATTAAGAT 


2081 


320 


UGAUCUGU A UCAUCGUG 


72 


CACGATGA GGCTAGCTACAACGA ACAGATCA 


2082 


350 


GUUCUGCU A CAACCUCU 


80 


AGAGGTTG GGCTAGCTACAACGA AGCAGAAC 


2083 


419 


AAAACAAU A UUGUAUAA 


91 


TTATACAA GGCTAGCTACAACGA ATTGTTTT 


2084 


424 


AAUAUUGU A UAACAGAC 


93 


GTCTGTTA GGCTAGCTACAACGA ACAATATT 


2085 


489 


UAAAACUU A UUGUUACC 


108 


GGTAACAA GGCTAGCTACAACGA AAGTTTTA 


2086 


495 


UUAUUGUU A CCAUAUGU 


111 1 ACATATGG GGCTAGCTACAACGA AACAATAA 


2087 


500 


GUUACCAU A UGUAUUCA 


112 


TGAATACA GGCTAGCTACAACGA ATGGTAAC 


2088 


504 


CCAUAUGU A UUCAUCUG 


113 


CAGATGAA GGCTAGCTACAACGA ACATATGG 


2089 


542 


AGGGCUUU A UUUUCAAA 


123 


TTTGAAAA GGCTAGCTACAACGA AAAGCCCT 


2090 


572 


AUAAGUGU A UAAAAUGC 


133 


GCATTTTA GGCTAGCTACAACGA ACACTTAT 


2091 


617 


AAAUUUCU A UCCCAAAU 




ATTTGGGA GGCTAGCTACAACGA AGAAATTT 


2092 


684 


CACUUCAU A UAUAAAGC 


162 


GCTTTATA GGCTAGCTACAACGA ATGAAGTG 


2093 


686 


CUUCAUAU A UAAAGCAU 




ATGCTTTA GGCTAGCTACAACGA ATATGAAG 


2094 


696 


AAAGCAUU A UUUUUACU 


166 


AGTAAAAA GGCTAGCTACAACGA AATGCTTT 


2095 


702 


UUAUUUUU A CUCUUUUG 


171 


CAAAAGAG GGCTAGCTACAACGA AAAAATAA 


2096 


719 


AGGUGAAU A UAAUUUAU 


176 


ATAAATTA GGCTAGCTACAACGA ATTCACCT 


2097 


726 


UAUAAUUU A UAUUACAA 


180 


TTGTAATA GGCTAGCTACAACGA AAATTATA 


2098 


728 


UAAUUUAU A UUACAAUG 


1B1 


CATTGTAA GGCTAGCTACAACGA ATAAATTA 


2099 


731 


UUUAUAUU A CAAUGUAA 


183 


TTACATTG GGCTAGCTACAACGA AATATAAA 


2100 


753 


UCUUUAAU A CUAAGUAU 


190 


ATACTTAG GGCTAGCTACAACGA ATTAAAGA 


2101 


760 


UACUAAGU A UUUUUCAG 


192 


CTGAAAAA GGCTAGCTACAACGA ACTTAGTA 


2102 


782 


CACCAAGU A UCAAAGUA 


201 


TACTTTGA GGCTAGCTACAACGA ACTTGGTG 


2103 


813 


GUGUCAUU A UUCAAAAU 


207 


ATTTTGAA GGCTAGCTACAACGA AATGACAC 


2104 


B47 


CAUCUGUU A UCUUAUUA 


21S 


TAATAAGA GGCTAGCTACAACGA AACAGATG 


2105 


852 


GUUAUCUU A UUAUAAAG 


219 


CTTTATAA GGCTAGCTACAACGA AAGATAAC 


2106 


855 


AUCUUAUU A UAAAGAAC 


221 


GTTCTTTA GGCTAGCTACAACGA AATAAGAT 


2107 


865 


AAAGAACU A UUUGUAGU 


223 


ACTACAAA GGCTAGCTACAACGA AGTTCTTT 


2108 


878 


UAGUAACU A UCAGAAUC 


228 


GATTCTGA GGCTAGCTACAACGA AGTTACTA 


2109 


888 


CAGAAUCU A CAUUCUAA 


231 


TTAGAATG GGCTAGCTACAACGA AGATTCTG 


2110 
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GAAAUUGU A UUUUUUCU 



AGAAAAAA GGCTAGCTACAACGA ACAATTTC 



UUUUUUCU A UGCCACAU 



ATGTGGCA GGCTAGCTACAACGA AGAAAAAA 



AAUCAAGU A UGGAAAAG 



CTTTTCCA GGCTAGCTACAACGA ACTTGATT 



AAGGCCAU A CUCUUACA 



TGTAAGAG GGCTAGCTACAACGA ATGGCCTT 



AUACUCUU A CAUAAUAA 



TTATTATG GGCTAGCTACAACGA AAGAGTAT 



AUUCUAGU A CAUGUAGG 



CCTACATG GGCTAGCTACAACGA ACTAGAAT 



UAAGACAU A UGAUCAAC 



GTTGATCA GGCTAGCTACAACGA ATGTCTTA 



UGGUUAAU A UGUGACAG 



CTGTCACA GGCTAGCTACAACGA ATTAACCA 



UUAGUCAU A UCACUAAU 



ATTAGTGA GGCTAGCTACAACGA ATGACTAA 



UCACUAAU A UACUAACA 



TGTTAGTA GGCTAGCTACAACGA ATTAGTGA 



ACUAAUAU A CUAACAAC 



GTTGTTAG GGCTAGCTACAACGA ATATTAGT 



AGUGAAUU A UCUGAGCU 



AGCTCAGA GGCTAGCTACAACGA AATTCACT 



CUAGAGUU A CCUAGCUU 



AAGCTAGG GGCTAGCTACAACGA AACTCTAG 



CCUAGCUU A CCAUACUA 



TAGTATGG GGCTAGCTACAACGA AAGCTAGG 



CUUACCAU A CUAUAUCU 



AGATATAG GGCTAGCTACAACGA ATGGTAAG 



ACCAUACU A UAUC'JUUG 



CAAAGATA GGCTAGCTACAACGA AGTATGGT 



CAUACUAU A UCUUUGGA 



TCCAAAGA GGCTAGCTACAACGA ATAGTATG 



AGAAAAAU A UAUUAUCU 



AGATAATA GGCTAGCTACAACGA ATTTTTCT 



AAAAAUAU A UUAUCUCA 



TGAGATAA GGCTAGCTACAACGA ATATTTTT 



AAUAUAUU A UCUCAAGU 



ACTTGAGA GGCTAGCTACAACGA AATATATT 



UCUCAAGU A UUUUUUAA 



TTAAAAAA GGCTAGCTACAACGA ACTTGAGA 



UUAAAAAU A UAUGAAUU 



AATTCATA GGCTAGCTACAACGA ATTTTTAA 



AAAAAUAU A UGAAUUCU 



AGAATTCA GGCTAGCTACAACGA ATATTTTT 



CUCCAAAU A UUAACUAA 



TTAGTTAA GGCTAGCTACAACGA ATTTGGAG 



AACUAAUU A UUAGAUUA 



TAATCTAA GGCTAGCTACAACGA AATTAGTT 



AUUAGAUU A UAUUUUGA 



TCAAAATA GGCTAGCTACAACGA AATCTAAT 



UAGAUUAU A UUUUGAAA 



TTTCAAAA GGCTAGCTACAACGA ATAATCTA 



GCCCAUCU A UUACAUCU 



AGATGTAA GGCTAGCTACAACGA AGATGGGC 



CAUCUAUU A CAUCUACA 



TGTAGATG GGCTAGCTACAACGA AATAGATG 



UUACAUCU A CAGCUGAC 



GTCAGCTG GGCTAGCTACAACGA AGATGTAA 



UCUUAACU A CCUAAUAG 



CTATTAGG GGCTAGCTACAACGA AGTTAAGA 



AAUAGCCU A CUAUUGAC 



GTCAATAG GGCTAGCTACAACGA AGGCTATT 



AGCCUACU A UUGACCAU 



ATGGTCAA GGCTAGCTACAACGA AGTAGGCT 



UAAACCUU A CUGAUAAC 



GTTATCAG GGCTAGCTACAACGA AAGGTTTA 



UAACACAU A UUUUGCGU 



ACGCAAAA GGCTAGCTACAACGA ATGTGTTA 



AATACATA GGCTAGCTACAACGA AACACGCA 



CGUGUUAU A UGUAUUAU 



ATAATACA GGCTAGCTACAACGA ATAACACG 



UUAUAUGU A UUAUACAC 



GTGTATAA GGCTAGCTACAACGA ACATATAA 



UAUGUAUU A UACACUAU 



ATAGTGTA GGCTAGCTACAACGA AATACATA 



UGUAUUAU A CACUAUAU 



ATATAGTG GGCTAGCTACAACGA ATAATACA 



UAUACACU A UAUUCCUA 



TAGGAATA GGCTAGCTACAACGA AGTGTATA 



UACACUAU A UUCCUACA 



TGTAGGAA GGCTAGCTACAACGA ATAGTGTA 



AUAUUCCU A CAAUAAAG 



CTTTATTG GGCTAGCTACAACGA AGGAATAT 



AAAAUGUU A UUUAGAAA 



TTTCTAAA GGCTAGCTACAACGA AACATTTT 



UCUAUACU G UGAUGAUC 



GATCATCA GGCTAGCTACAACGA AGTATAGA 



UCACAGCU G CCAAGGCU 



AGCCTTGG GGCTAGCTACAACGA AGCTGTGA 



AUUUGGCU G CCAGCUUU 



AAAGCTGG GGCTAGCTACAACGA AGCCAAAT 
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GACUUCCU G UCCUGCUG 



CAGCAGGA GGCTAGCTACAACGA AGGAAGTC 



CCUGUCCU G CUGGUAUC 



GATACCAG GGCTAGCTACAACGA AGGACAGG 



CCUCACUC G CUCAGCUA 



TAGCTGAG GGCTAGCTACAACGA GAGTGAGG 



AUUGAAAU G CCUCAACA 



TGTTGAGG GGCTAGCTACAACGA ATTTCAAT 



CAAUUUCU G UCUCAUCU 



:tagctacaacga agaaattg 



CUUAAUAU G UCUCUUGC 



GCAAGAGA GGCTAGCTACAACGA ATATTAAG 



UGUCUCUU G CUGAUCUG 



CAGATCAG GGCTAGCTACAACGA AAGAGACA 



GCUGAUCU G UAUCAUCG 



CGATGATA GGCTAGCTACAACGA AGATCAGC 



AUCGUGAU G CUUCUCUG 



CAGAGAAG GGCTAGCTACAACGA ATCACGAT 



GAAGUUCU G CUACAACC 



GGTTGTAG GGCTAGCTACAACGA AGAACTTC 



CUAGAUCU G CAGCUUGC 



GCAAGCTG GGCTAGCTACAACGA AGATCTAG 



UGCAGCUU G CCACAUCA 



TGATGTGG GGCTAGCTACAACGA AAGCTGCA 



UAAAAUCU G UCAUCCCA 



TGGGATGA GGCTAGCTACAACGA AGATTTTA 



CAUCCCAU G CAGACAGG 



CCTGTCTG GGCTAGCTACAACGA ATGGGATG 



ACAAUAUU G UAUAACAG 



CTGTTATA GGCTAGCTACAACGA AATATTGT 



GUUUCUUU G UGAAAAGG 



CCTTTTCA GGCTAGCTACAACGA AAAGAAAC 



AACUUAUU G UUACCAUA 



TATGGTAA GGCTAGCTACAACGA AATAAGTT 



UACCAUAU G UAUUCAUC 



GATGAATA GGCTAGCTACAACGA ATATGGTA 



AUUCAUCU G UUGGAUCU 



AGATCCAA GGCTAGCTACAACGA AGATGAAT 



UGGAUCUU G UAAACAUG 



CATGTTTA GGCTAGCTACAACGA AAGATCCA 



AAAUAAGU G UAUAAAAU 



ATTTTATA GGCTAGCTACAACGA ACTTATTT 



UAUAAAAU G CAACUGUU 



AACAGTTG GGCTAGCTACAACGA ATTTTATA 



AUGCAACU G UUGAUUUC 



GAAATCAA GGCTAGCTACAACGA AGTTGCAT 



UUAAAACU G CACUGCCA 



TGGCAGTG GGCTAGCTACAACGA AGTTTTAA 



ACUGCACU G CCAACAAG 



CTTGTTGG GGCTAGCTACAACGA AGTGCAGT 



AUUACAAU G UAAAAGCU 



AGCTTTTA GGCTAGCTACAACGA ATTGTAAT 



AAUGAAGU G UCAUUAUU 



AATAATGA GGCTAGCTACAACGA ACTTCATT 



UCACAUCU G UUAUCUUA 



TAAGATAA GGCTAGCTACAACGA AGATGTGA 



AACUAUUU G UAGUAACU 



AGTTACTA GGCTAGCTACAACGA AAATAGTT 



CAGAAAUU G UAUUUUTJU 



AAAAAATA GGCTAGCTACAACGA AATTTCTG 



UUUUCUAU G CCACAUUA 



TAATGTGG GGCTAGCTACAACGA ATAGAAAA 



UAGUACAU G UAGGUAAA 



TTTACCTA GGCTAGCTACAACGA ATGTACTA 



AUAAAUCU G UUCUAAGA 



TCTTAGAA GGCTAGCTACAACGA AGATTTAT 



GUUAAUAU G UGACAGUG 



CACTGTCA GGCTAGCTACAACGA ATATTAAC 



AAGGCACU G UAGUGAAU 



ATTCACTA GGCTAGCTACAACGA AGTGCCTT 



AUGAUUUU G CAGGUUGU 



ACAACCTG GGCTAGCTACAACGA AAAATCAT 



UGCAGGUU G UCUUCCAU 



ATGGAAGA GGCTAGCTACAACGA AACCTGCA 



CAUCCAAU G CAGGCAAG 



CTTGCCTG GGCTAGCTACAACGA ATTGGATG 



AUGAACUU G UUGGCCCA 



TGGGCCAA GGCTAGCTACAACGA AAGTTCAT 



GUGGGUCC G CAAAAUCU 



AGATTTTG GGCTAGCTACAACGA GGACCCAC 



CAUAUUUU G CGUGUUAU 



ATAACACG GGCTAGCTACAACGA AAAATATG 



UUUUGCGU G UUAUAUGU 



ACATATAA GGCTAGCTACAACGA ACGCAAAA 



UGUUAUAU G UAUUAUAC 



GTATAATA GGCTAGCTACAACGA ATATAACA 



GAGAAAAU G UUAUUUAG 



CTAAATAA GGCTAGCTACAACGA ATTTTCTC 



ACUCCCCA G CUAAACAC 



GTGTTTAG GGCTAGCTACAACGA TGGGGAGT 



AAACACCC G UAAGACUU 



AAGTCTTA GGCTAGCTACAACGA GGGTGTTT 



UGAUCACA G CUGCCAAG 



CTTGGCAG GGCTAGCTACAACGA TGTGATCA 
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CUGCCAAG G CUACCUAA 
AGAAGACA G UUAUCUCA 
CAUAUUUG G CUGCCAGC 
GGCUGCCA G CUUUUUAU 
UCCUGCUG G UAUCAUGG 
UGGAGAAA G UCCAAUAC 
CUCGCUCA G CUAUAAGA 
UAAGAAGA G CCUCAACC 
CUCAACAA G CACGUCAA 
ACAAGCAC G UCAAAAGC 
CGUCAAAA G CUACAGAA 
GUAUCAUC G UGAUGCUU 
UCUCUGAA G UUCUGCUA 
GAUCUGCA G CUUGCCAC 
CCACAUCA G CUUAAAAU 
CUUCCUGA G UAGAAGAG 
GUAGAAGA G UUUCUUUG 
GUGAAAAG G UCAAGAUU 
UGAAAAGG G CUUUAUUU 
CAAAAUAA G UGUAUAAA 
UCAACAUG G CUCACAAA 
GAUGAAGA G UUUAGUUU 
AGAGUUUA G UUUUAAAA 
GCCAACAA G UUCACUUC 
UAUAUAAA G CAUUAUUU 
CUUUUGAG G UGAAUAUA 
AUGUAAAA G CUUCUUUA 
AAUACUAA G UAUUUUUC 
UUUUUCAG G UCUUCACC 
UUCACCAA G UAUCAAAG 
GUAUCAAA G UAAUAACA 
CAAAUGAA G UGUCAUUA 
UCAAAAUA G UCCACUGA 
UAUUUGUA G UAACUAUC 
CUUUUAAA G UUGAUGAG 
AGAAUCAA G UAUGGAAA 
AUGGAAAA G 



TTAGGTAG GGCTAGCTACAACGA CTTGGCAG 
TGAGATAA GGCTAGCTACAACGA TGTCTTCT 
GCTGGCAG GGCTAGCTACAACGA CAAATATG 
ATAAAAAG GGCTAGCTACAACGA TGGCAGCC 
CCATGATA GGCTAGCTACAACGA CAGCAGGA 
GTATTGGA GGCTAGCTACAACGA TTTCTCCA 
TCTTATAG GGCTAGCTACAACGA TGAGCGAG 
GGTTGAGG GGCTAGCTACAACGA TCTTCTTA 
TTGACGTG GGCTAGCTACAACGA TTGTTGAG 
GCTTTTGA GGCTAGCTACAACGA GTGCTTGT 
TTCTGTAG GGCTAGCTACAACGA TTTTGACG 
AAGCATCA GGCTAGCTACAACGA GATGATAC 
TAGCAGAA GGCTAGCTACAACGA TTCAGAGA 
GTGGCAAG GGCTAGCTACAACGA TGCAGATC 
ATTTTAAG GGCTAGCTACAACGA TGATGTGG 
CTCTTCTA GGCTAGCTACAACGA TCAGGAAG 
CAAAGAAA GGCTAGCTACAACGA TCTTCTAC 
AATCTTGA GGCTAGCTACAACGA CTTTTCAC 
AAATAAAG GGCTAGCTACAACGA CCTTTTCA 
TTTATACA GGCTAGCTACAACGA TTATTTTG 
TTTGTGAG GGCTAGCTACAACGA CATGTTGA 
AAACTAAA GGCTAGCTACAACGA TCTTCATC 
TTTTAAAA GGCTAGCTACAACGA TAAACTCT 
GAAGTGAA GGCTAGCTACAACGA TTGTTGGC 
AAATAATG GGCTAGCTACAACGA TTTATATA 
TATATTCA GGCTAGCTACAACGA CTCAAAAG 
TAAAGAAG GGCTAGCTACAACGA TTTTACAT 
GAAAAATA GGCTAGCTACAACGA TTAGTATT 
GGTGAAGA GGCTAGCTACAACGA CTGAAAAA 
CTTTGATA GGCTAGCTACAACGA TTGGTGAA 
TGTTATTA GGCTAGCTACAACGA TTTGATAC 
TAATGACA GGCTAGCTACAACGA TTCATTTG 
TCAGTGGA GGCTAGCTACAACGA TATTTTGA 
GATAGTTA GGCTAGCTACAACGA TACAAATA 
CTCATCAA GGCTAGCTACAACGA TTTAAAAG 
TTTCCATA GGCTAGCTACAACGA TTGATTCT 
TGGCCTTA GGCTAGCTACAACGA TTTTCCAT 



AAAGUAAG G CCAUACUC 



GAGTATGG GGCTAGCTACAACGA CTTACTTT 



CCUUUUAA G UAAUUUUU 
GAAUUCUA G UACAUGUA 
ACAUGUAG G UAAAUCAU 
GAGAACUG G UGGUUAAU 
AACUGGUG G UUAAUAUG 
AUGUGACA G UGAGAUUA 
UGAGAUUA G UCAUAUCA 
CAUUUAAG G CACUGUAG 
GCACUGUA G UGAAUUAU 



AAAAATTA GGCTAGCTACAACGA 
TACATGTA GGCTAGCTACAACGA TAGAATTC 
ATGATTTA GGCTAGCTACAACGA CTACATGT 
ATTAACCA GGCTAGCTACAACGA CAGTTCTC 
CATATTAA GGCTAGCTACAACGA CACCAGTT 
TAATCTCA GGCTAGCTACAACGA TGTCACAT 
TGATATGA GGCTAGCTACAACGA TAATCTCA 
CTACAGTG GGCTAGCTACAACGA CTTAAATG 
ATAATTCA GGCTAGCTACAACGA TACAGTGC 
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UUAUCUGA G CUAGAGUU 



AACTCTAG GGCTAGCTACAACGA TCAGATAA 



GAGCUAGA G UUACCUAG 



CTAGGTAA GGCTAGCTACAACGA TCTAGCTC 



GUUACCUA G CUUACCAU 



ATGGTAAG GGCTAGCTACAACGA TAGGTAAC 



UUUUGCAG G UUGUCUUC 



GAAGACAA GGCTAGCTACAACGA CTGCAAAA 



CCAUUCCA G CCUAACAU 



ATGTTAGG GGCTAGCTACAACGA TGGAATGG 



CAAUGCAG G CAAGGAAA 



TTTCCTTG GGCTAGCTACAACGA CTGCATTG 



GAUUUCCA G UGACAGAA 



TTCTGTCA GGCTAGCTACAACGA TGGAAATC 



UAUCUCAA G UAUUUUUU 



AAAAAATA GGCTAGCTACAACGA TTGAGATA 



ACUUGUUG G CCCAUCUA 



TAGATGGG GGCTAGCTACAACGA CAACAAGT 



CAUCUACA G CUGACCCU 



AGGGTCAG GGCTAGCTACAACGA TGTAGATG 



ACAUGGGG G UUAGGGGA 



TCCCCTAA GGCTAGCTACAACGA CCCCATGT 



UUAGGGGA G CUGACAAU 



ATTGTCAG GGCTAGCTACAACGA TCCCCTAA 



GACAAUUC G UGGGUCCG 



CGGACCCA GGCTAGCTACAACGA GAATTGTC 



AUUCGUGG G UCCGCAAA 



TTTGCGGA GGCTAGCTACAACGA CCACGAAT 



ACCUAAUA G CCUACUAU 



ATAGTAGG GGCTAGCTACAACGA TATTAGGT 



CAUAAACA G UAAAUUAA 



TTAATTTA GGCTAGCTACAACGA TGTTTATG 



UAUUUUGC G UGUUAUAU 



ATATAACA GGCTAGCTACAACGA GCAAAATA 



ACAAUAAA G UAAGCUAG 



CTAGCTTA GGCTAGCTACAACGA TTTATTGT 



UAAAGUAA G CUAGAGAA 



TTCTCTAG GGCTAGCTACAACGA TTACTTTA 



GUCAGAAA A CUCCCCAG 



CTGGGGAG GGCTAGCTACAACGA TTTCTGAC 



CCAGCUAA A CACCCGUA 



TACGGGTG GGCTAGCTACAACGA TTAGCTGG 



AGCUAAAC A CCCGUAAG 



CTTACGGG GGCTAGCTACAACGA GTTTAGCT 



CCCGUAAG A CUUCAUAC 



GTATGAAG GGCTAGCTACAACGA CTTACGGG 



AAGACUUC A UACAACAC 



GTGTTGTA GGCTAGCTACAACGA GAAGTCTT 



UUCAUACA A CACAAUAC 



GTATTGTG GGCTAGCTACAACGA TGTATGAA 



CAUACAAC A CAAUACUC 



GAGTATTG GGCTAGCTACAACGA GTTGTATG 



ACAACACA A UACUCUAU 



ATAGAGTA GGCTAGCTACAACGA TGTGTTGT 



AUACUGUG A UGAUCACA 



TGTGATCA GGCTAGCTACAACGA CACAGTAT 



CUGUGAUG A UCACAGCU 



AGCTGTGA GGCTAGCTACAACGA CATCACAG 



UGAUGAUC A CAGCUGCC 



GGCAGCTG GGCTAGCTACAACGA GATCATCA 



AAAAGAAG A CAGUUAUC 



GATAACTG GGCTAGCTACAACGA CTTCTTTT 



GUUAUCUC A UAUUUGGC 



GCCAAATA GGCTAGCTACAACGA GAGATAAC 



UUCUCUCG A CCACUUAA 



TTAAGTGG GGCTAGCTACAACGA CGAGAGAA 



UCUCGACC A CUUAAAAC 



GTTTTAAG GGCTAGCTACAACGA GGTCGAGA 



CACUUAAA A CUUCAGAC 



GTCTGAAG GGCTAGCTACAACGA TTTAAGTG 



AACUUCAG A CUUCCUGU 



ACAGGAAG GGCTAGCTACAACGA CTGAAGTT 



TTTCTCCA GGCTAGCTACAACGA GATACCAG 



AAAGUCCA I 



GTGAGGTA GGCTAGCTACAACGA TGGACTTT 



AAUACCUC A CUCGCUCA 



TGAGCGAG GGCTAGCTACAACGA GAGGTATT 



GAGCCUCA A CCAUUGAA 



TTCAATGG GGCTAGCTACAACGA TGAGGCTC 



CCUCAACC A UUGAAAUG 
CCAUUGAA A UGCCUCAA 



CATTTCAA GGCTAGCTACAACGA GGTTGAGG 



TTGAGGCA GGCTAGCTACAACGA TTCAATGG 



AUGCCUCA A CAAGCACG 



CGTGCTTG GGCTAGCTACAACGA TGAGGCAT 



CAACAAGC A CGUCAAAA 



TTTTGACG GGCTAGCTACAACGA GCTTGTTG 



GCUACAGA A UCUAUUUA 



TAAATAGA GGCTAGCTACAACGA TCTGTAGC 



AUUUAUCA A UUUCUGUC 



UCUGUCUC A UCUUAAUA 



GACAGAAA GGCTAGCTACAACGA TGATAAAT 
TATTAAGA GGCTAGCTACAACGA GAGACAGA 
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UCAUCUUA A UAUGUCUC 



GAGACATA GGCTAGCTACAACGA TAAGATGA 



UCUUGCUG A UCUGUAUC 



GATACAGA GGCTAGCTACAACGA CAGCAAGA 



UCUGUAUC A UCGUGAUG 



CATCACGA GGCTAGCTACAACGA GATACAGA 



UCAUCGUG A UGCUUCUC 



GAGAAGCA GGCTAGCTACAACGA CACGATGA 



CUGCUACA A CCUCUAGA 



TCTAGAGG GGCTAGCTACAACGA TGTAGCAG 



ACCUCUAG A UCUGCAGC 



GCTGCAGA GGCTAGCTACAACGA CTAGAGGT 



375 
377 



AGCUUGCC A CAUCAGCU 



AGCTGATG GGCTAGCTACAACGA GGCAAGCT 



CUUGCCAC A UCAGCUUA 



TAAGCTGA GGCTAGCTACAACGA GTGGCAAG 



AGCUUAAA A UCUGUCAU 



ATGACAGA GGCTAGCTACAACGA TTTAAGCT 



AAUCUGUC A UCCCAUGC 



GCATGGGA GGCTAGCTACAACGA GACAGATT 



GUCAUCCC A UGCAGACA 



TGTCTGCA GGCTAGCTACAACGA GGGATGAC 



CCAUGCAG A CAGGAAAA 



TTTTCCTG GGCTAGCTACAACGA CTGCATGG 



ACAGGAAA A CAAUAUUG 



CAATATTG GGCTAGCTACAACGA TTTCCTGT 



GGAAAACA A UAUUGUAU 



ATACAATA GGCTAGCTACAACGA TGTTTTCC 



AUUGUAUA A CAGACCAC 



GTGGTCTG GGCTAGCTACAACGA TATACAAT 



UAUAACAG A CCACUUCC 



GGAAGTGG GGCTAGCTACAACGA CTGTTATA 



AACAGACC A CUUCCUGA 



TCAGGAAG GGCTAGCTACAACGA GGTCTGTT 



AGGUCAAG A UUAAGACU 



AGTCTTAA GGCTAGCTACAACGA CTTGACCT 



AGAUUAAG A CUAAAACU 



AGTTTTAG GGCTAGCTACAACGA CTTAATCT 



AGACUAAA A CUUAUUGU 



ACAATAAG GGCTAGCTACAACGA TTTAGTCT 



UUGUUACC A UAUGUAUU 



AATACATA GGCTAGCTACAACGA GGTAACAA 



AUGUAUUC A UCUGUUGG 



CCAACAGA GGCTAGCTACAACGA GAATACAT 



UCUGUUGG A UCUUGUAA 



TTACAAGA GGCTAGCTACAACGA CCAACAGA 



UCUUGUAA A CAUGAAAA 



TTTTCATG GGCTAGCTACAACGA TTACAAGA 



UUGUAAAC A UGAAAAGG 



CCTTTTCA GGCTAGCTACAACGA GTTTACAA 



UUUCAAAA A UUAACUUC 



GAAGTTAA GGCTAGCTACAACGA TTTTGAAA 
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AAAAAUUA A CUUCAAAA 



TTTTGAAG GGCTAGCTACAACGA TAATTTTT 



ACUUCAAA A UAAGUGUA 



TACACTTA GGCTAGCTACAACGA TTTGAAGT 



UGUAUAAA A UGCAACUG 



CAGTTGCA GGCTAGCTACAACGA TTTATACA 



AAAAUGCA A CUGUUGAU 



ATCAACAG GGCTAGCTACAACGA TGCATTTT 



AACUGUUG A UUUCCUCA 



TGAGGAAA GGCTAGCTACAACGA CAACAGTT 



UUUCCUCA A CAUGGCUC 



GAGCCATG GGCTAGCTACAACGA TGAGGAAA 



600 
607 
611 



UCCUCAAC A UGGCUCAC 



GTGAGCCA GGCTAGCTACAACGA GTTGAGGA 



CAUGGCUC A CAAAUUUC 



GAAATTTG GGCTAGCTACAACGA GAGCCATG 



GCUCACAA A UUUCUAUC 



GATAGAAA GGCTAGCTACAACGA TTGTGAGC 



UAUCCCAA A UCUUUUCU 



AGAAAAGA GGCTAGCTACAACGA TTGGGATA 



UUCUGAAG A UGAAGAGU 



ACTCTTCA GGCTAGCTACAACGA CTTCAGAA 



GUUUUAAA A CUGCACUG 



CAGTGCAG GGCTAGCTACAACGA TTTAAAAC 



AAAACUGC A CUGCCAAC 



GTTGGCAG GGCTAGCTACAACGA GCAGTTTT 



CACUGCCA A CAAGUUCA 



TGAACTTG GGCTAGCTACAACGA TGGCAGTG 



ACAAGUUC A CUUCAUAU 



ATATGAAG GGCTAGCTACAACGA GAACTTGT 



UUCACUUC A UAUAUAAA 



TTTATATA GGCTAGCTACAACGA GAAGTGAA 



UAUAAAGC A UUAUUUUU 



AAAAATAA GGCTAGCTACAACGA GCTTTATA 



UGAGGUGA A UAUAAUUU 



AAATTATA GGCTAGCTACAACGA TCACCTCA 



UGAAUAUA A UUUAUAUU 



AATATAAA GGCTAGCTACAACGA TATATTCA 



AUAUUACA A UGUAAAAG 



CTTTTACA GGCTAGCTACAACGA TGTAATAT 



CUUCUUUA A UACUAAGU 



ACTTAGTA GGCTAGCTACAACGA TAAAGAAG 
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AGGUCUUC A CCAAGUAU 



ATACTTGG GGCTAGCTACAACGA GAAGACCT 



UCAAAGUA A UAACACAA 



TTGTGTTA GGCTAGCTACAACGA TACTTTGA 



AAGUAAUA A CACAAAUG 



CATTTGTG GGCTAGCTACAACGA TATTACTT 



GUAAUAAC A CAAAUGAA 



TTCATTTG GGCTAGCTACAACGA GTTATTAC 



UAACACAA A UGAAGUGU 



ACACTTCA GGCTAGCTACAACGA TTGTGTTA 



GAAGUGUC A UUAUUCAA 



TTGAATAA GGCTAGCTACAACGA GACACTTC 



UAUUCAAA A UAGUCCAC 



GTGGACTA GGCTAGCTACAACGA TTTGAATA 



AAUAGUCC A CUGACUCC 



GGAGTCAG GGCTAGCTACAACGA GGACTATT 



GUCCACUG A CUCCUCAC 



GTGAGGAG GGCTAGCTACAACGA CAGTGGAC 



GACUCCUC A CAUCUGUU 



AACAGATG GGCTAGCTACAACGA GAGGAGTC 



CUCCUCAC A UCUGUUAU 



ATAACAGA GGCTAGCTACAACGA GTGAGGAG 



UAUAAAGA A CUAUUUGU 



ACAAATAG GGCTAGCTACAACGA TCTTTATA 



UUGUAGUA A CUAUCAGA 



TCTGATAG GGCTAGCTACAACGA TACTACAA 



CUAUCAGA A UCUACAUU 



AATGTAGA GGCTAGCTACAACGA TCTGATAG 



GAAUCUAC A UUCUAAAA 



TTTTAGAA GGCTAGCTACAACGA GTAGATTC 



AUUCUAAA A CAGAAAUU 



AATTTCTG GGCTAGCTACAACGA TTTAGAAT 



AAACAGAA A UUGUAUUU 



AAATACAA GGCTAGCTACAACGA TTCTGTTT 



UCUAUGCC A CAUUAACA 



TGTTAATG GGCTAGCTACAACGA GG CAT AG A 



UAUGCCAC A UUAACAUC 



GATGTTAA GGCTAGCTACAACGA GTGGCATA 



CCACAUUA A CAUCUUUU 



AAAAGATG GGCTAGCTACAACGA TAATGTGG 



ACAUUAAC A UCUUUUAA 



TTAAAAGA GGCTAGCTACAACGA GTTAATGT 



UAAAGUUG A UGAGAAUC 



GATTCTCA GGCTAGCTACAACGA CAACTTTA 



UGAUGAGA A UCAAGUAU 



ATACTTGA GGCTAGCTACAACGA TCTCATCA 



GUAAGGCC A UACUCUUA 



TAAGAGTA GGCTAGCTACAACGA GGCCTTAC 



ACUCUUAC A UAAUAAAA 



TTTTATTA GGCTAGCTACAACGA GTAAGAGT 



CUUACAUA A UAAAAUUC 



GAATTTTA GGCTAGCTACAACGA TATGTAAG 



AUAAUAAA A UUCCUUUU 



AAAAGGAA GGCTAGCTACAACGA TTTATTAT 



UUUAAGUA A UUUUUUCA 



TGAAAAAA GGCTAGCTACAACGA TACTTAAA 



UUCAAAGA A UCACAGAA 



TTCTGTGA GGCTAGCTACAACGA TCTTTGAA 



AAAGAAUC A CAGAAUUC 



GAATTCTG GGCTAGCTACAACGA GATTCTTT 



AUCACAGA A UUCUAGUA 



TACTAGAA GGCTAGCTACAACGA TCTGTGAT 



UCUAGUAC A UGUAGGUA 



TACCTACA GGCTAGCTACAACGA GTACTAGA 



GUAGGUAA A UCAUAAAU 



ATTTATGA GGCTAGCTACAACGA TTACCTAC 



GGUAAAUC A UAAAUCUG 



CAGATTTA GGCTAGCTACAACGA GATTTACC 



AAUCAUAA A UCUGUUCU 



AGAACAGA GGCTAGCTACAACGA TTATGATT 



GUUCUAAG A CAUAUGAU 



ATCATATG GGCTAGCTACAACGA CTTAGAAC 



UCUAAGAC A UAUGAUCA 



TGATCATA GGCTAGCTACAACGA GTCTTAGA 



GACAUAUG A UCAACAGA 



TCTGTTGA GGCTAGCTACAACGA CATATGTC 



UAUGAUCA A CAGAUGAG 



CTCATCTG GGCTAGCTACAACGA TGATCATA 



AUCAACAG A UGAGAACU 



AGTTCTCA GGCTAGCTACAACGA CTGTTGAT 



AGAUGAGA A CUGGUGGU 



ACCACCAG GGCTAGCTACAACGA TCTCATCT 



GGUGGUUA A UAUGUGAC 



GTCACATA GGCTAGCTACAACGA TAACCACC 



AAUAUGUG A CAGUGAGA 



TCTCACTG GGCTAGCTACAACGA CACATATT 



ACAGUGAG A UUAGUCAU 



ATGACTAA GGCTAGCTACAACGA CTCACTGT 



GAUUAGUC A UAUCACUA 



TAGTGATA GGCTAGCTACAACGA GACTAATC 



GUCAUAUC A CUAAUAUA 



TATATTAG GGCTAGCTACAACGA GATATGAC 



UAUCACUA A UAUACUAA 



TTAGTATA GGCTAGCTACAACGA TAGTGATA 
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1175 



1285 
1300 



1325 
1352 
1360 
1373 
1379 
1383 
1391 



1533 
1536 
1538 
1542 



AUAUACUA A CAACAGAA 
UACUAACA A CAGAAUCU 
ACAACAGA A UCUAAUCU 
AGAAUCUA A UCUUCAUU 
UAAUCUUC A UUUAAGGC 
UUUAAGGC A CUGUAGUG 
UGUAGUGA A UUAUCUGA 
AGCUUACC A UACUAUAU 
UCUUUGGA A UCAUGAAA 
UUGGAAUC A UGAAACCU 
AUCAUGAA A CCUUAAGA 
ACCUUAAG A CUUCAGAA 
ACUUCAGA A UGAUUUUG 
UCAGAAUG A UUUUGCAG 
UGUCUUCC A UUCCAGCC 
CCAGCCUA A CAUCCAAU 
AGCCUAAC A UCCAAUGC 
AACAUCCA A UGCAGGCA 
CAAGGAAA A UAAAAGAU 
AAUAAAAG A UUUCCAGU 
UUCCAGUG A CAGAAAAA 
ACAGAAAA A UALTAUUAU 
UUUUAAAA A UAUAUGAA 
AUAUAUGA A UUCUCUCU 
CUCUCCAA A UAUUAACU 
AAAUAUUA A CUAAUUAU 
AUUAACUA A UUAUUAGA 
AUUAUUAG A UUAUAUUU 
AUUUUGAA A UGAACUUG 
UGAAAUGA A CUUGUUGG 
GUUGGCCC A UCUAUUAC 
UCUAUUAC A UCUACAGC 
UACAGCUG A CCCUUGAA 
ACCCUUGA A CAUGGGGG 
CCUUGAAC A UGGGGGUU 
GGGAGCUG A CAAUUCGU 
AGCUGACA A UUCGUGGG 
UCCGCAAA A UCUUAACU 
AAAUCUUA A CUACCUAA 
ACUACCUA A UAGCCUAC 
UACUAUUG A CCAUAAAC 
UAUUGACC A UAAACCUU 
GACCAUAA A CCUUACUG 
CCUUACUG A UAACAUAA 
UACUGAUA A CAUAAACA 
CUGAUAAC A UAAACAGU 
A CAGUAAAU 



TTCTGTTG GGCTAGCTACAACGA TAGTATAT 
AGATTCTG GGCTAGCTACAACGA TGTTAGTA 
AGATTAGA GGCTAGCTACAACGA TCTGTTGT 
AATGAAGA GGCTAGCTACAACGA TAGATTCT 
GCCTTAAA GGCTAGCTACAACGA GAAGATTA 
CACTACAG GGCTAGCTACAACGA GCCTTAAA 
TCAGATAA GGCTAGCTACAACGA TCACTACA 
ATATAGTA GGCTAGCTACAACGA GGTAAGCT 
TTTCATGA GGCTAGCTACAACGA TCCAAAGA 
AGGTTTCA GGCTAGCTACAACGA GATTCCAA 
TCTTAAGG GGCTAGCTACAACGA TTCATGAT 
TTCTGAAG GGCTAGCTACAACGA CTTAAGGT 
CAAAATCA GGCTAGCTACAACGA TCTGAAGT 
CTGCAAAA GGCTAGCTACAACGA CATTCTGA 
GGCTGGAA GGCTAGCTACAACGA GGAAGACA 
ATTGGATG GGCTAGCTACAACGA TAGGCTGG 
GCATTGGA GGCTAGCTACAACGA GTTAGGCT 
TGCCTGCA GGCTAGCTACAACGA TGGATGTT 
ATCTTTTA GGCTAGCTACAACGA TTTCCTTG 
ACTGGAAA GGCTAGCTACAACGA CTTTTATT 
TTTTTCTG GGCTAGCTACAACGA CACTGGAA 
ATAATATA GGCTAGCTACAACGA TTTTCTGT 
TTCATATA GGCTAGCTACAACGA TTTTAAAA 
AGAGAGAA GGCTAGCTACAACGA TCATATAT 
AGTTAATA GGCTAGCTACAACGA TTGGAGAG 
ATAATTAG GGCTAGCTACAACGA TAATATTT 
TCTAATAA GGCTAGCTACAACGA TAGTTAAT 
AAATATAA GGCTAGCTACAACGA CTAATAAT™ 
CAAGTTCA GGCTAGCTACAACGA TTCAAAAT 
CCAACAAG GGCTAGCTACAACGA TCATTTCA 
GTAATAGA GGCTAGCTACAACGA GGGCCAAC 
GCTGTAGA GGCTAGCTACAACGA GTAATAGA 
TTCAAGGG GGCTAGCTACAACGA CAGCTGTA 
CCCCCATG GGCTAGCTACAACGA TCAAGGGT 
AACCCCCA GGCTAGCTACAACGA GTTCAAGG 
ACGAATTG GGCTAGCTACAACGA CAGCTCCC 
CCCACGAA GGCTAGCTACAACGA TGTCAGCT 
AGTTAAGA GGCTAGCTACAACGA TTTGCGGA 
TTAGGTAG GGCTAGCTACAACGA TAAGATTT 
GTAGGCTA GGCTAGCTACAACGA TAGGTAGT 
GTTTATGG GGCTAGCTACAACGA CAATAGTA 
AAGGTTTA GGCTAGCTACAACGA GGTCAATA 
CAGTAAGG GGCTAGCTACAACGA TTATGGTC 
TTATGTTA GGCTAGCTACAACGA CAGTAAGG 
TGTTTATG GGCTAGCTACAACGA TATCAGTA 
ACTGTTTA GGCTAGCTACAACGA GTTATCAG 
ATTTACTG GGCTAGCTACAACGA TTATGTTA 
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AACAGUAA A UUAACACA 



TGTGTTAA GGCTAGCTACAACGA TTACTGTT 



GUAAAUUA A CACAUAUU 



AATATGTG GGCTAGCTACAACGA TAATTTAC 



AAAUUAAC A CAUAUUUU 



AAAATATG GGCTAGCTACAACGA GTTAATTT 



AUUAACAC A UAUUUUGC 



GCAAAATA GGCTAGCTACAACGA GTGTTAAT 



UAUUAUAC A CUAUAUUC 



GAATATAG GGCTAGCTACAACGA GTATAATA 



UUCCUACA A UAAAGUAA 



TTACTTTA GGCTAGCTACAACGA TGTAGGAA 



UAGAGAAA A UGUUAUUU 



AAATAACA GGCTAGCTACAACGA TTTCTCTA 



Input Sequence = PLN. Cut Site = R/Y 

Stem Length = 8 . Core Sequence = GGCTAGCTACAACGA 

PLN (Homo sapiens phospholamban (PLN) mRNA.; 1635 bp) 
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Table 30: Human Phospholamban (PLN) Antisense and Target Sequence 



CAGAGUCAGAAAACUCCCCAGCUAA 
AGAGUCAGAAAACUCCCCAGCUAAA 
GAGUCAGAAAACUCCCCAGCUAAAG 
AGUCAGAAAACUCCCCAGCUAAACA 
GUCAGAAAACUCCCCAGCUAAACAC 
UCAGAAAACUCCCCAGCUAAACACC 
CAGAAAACUCCCCAGCUAAACACCC 
AGAAAACUCCCCAGCUAAACACCCG 
GAAAACUCCCCAGCUAAACACCCGU 
AAAACUCCCCAGCUAAACACCCGUA 
AAACUCCCCAGCUAAACACCCGUAA 
AACUCCCCAGCUAAACACCCGUAAG 
ACUCCCCAGCUAAACACCCGUAAGA 
CUCCCCAGCUAAACACCCGUAAGAC 
UCCCCAGCUAAACACCCGUAAGACU 
CCCCAGCUAAACACCCGUAAGACUU 
CCCAGCUAAACACCCGUAAGACUUC 
CCAGCUAAACACCCGUAAGACUUCA 
CAGCUAAACACCCGUAAGACUUCAU 
AGCUAAACACCCGUAAGACUUCAUA 
GCUAAACACCCGUAAGACUUCAUAC 
CUAAACACCCGUAAGACUUCAUACA 
UAAACACCCGUAAGACUUCAUACAA 
AAACACCCGUAAGACOUCAUACAAC 
AACACCCGUAAGACUUCAUACAACA 
ACACCCGUAAGACUUCAUACAACAC 
CACCCGUAAGACUUCAUACAACACA 
ACCCGUAAGACUUCAUACAACACAA 
CCCGUAAGACUUCAUACAACACAAU 
UGUGAUGAUCACAGCUGCCAAGGCU 
GUGAUGAUCACAGCUGCCAAGGCUA 
UGAUGAUCACAGCUGCCAAGGCUAC 
GAUGAUCACAGCUGCCAAGGCUACC 
AUGAUCACAGCUGCCAAGGCUACCU 
UGAUCACAGCUGCCAAGGCUACCUA 
GAUCACAGCUGCCAAGGCUACCUAA 
AUCACAGCUGCCAAGGCUACCUAAA 
UCACAGCUGCCAAGGCUACCUAAAA 
CACAGCUGCCAAGGCUACCUAAAAG 
ACAGCUGCCAAGGCUACCUAAAAGA 
CAGCUGCCAAGGCUACCUAAAAGAA 
AGCUGCCAAGGCUACCUAAAAGAAG 
GCUGCCAAGGCUACCUAAAAGAAGA 
CUGCCAAGGCUACCUAAAAGAAGAC 



TTAGCTGGGGAGTTTTCTGACTCTG 
TTTAGCTGGGGAGTTTTCTGACTCT 
GTTTAGCTGGGGAGTTTTCTGACTC 
TGTTTAGCTGGGGAGTTTTCTGACT 
GTGTTTAGCTGGGGAGTTTTCTGAC 
GGTGTTTAGCTGGGGAGTTTTCTGA 
GGGTGTTTAGCTGGGGAGTTTTCTG 
CGGGTGTTTAGCTGGGGAGTTTTCT 
ACGGGTGTTTAGCTGGGGAGTTTTC 
TACGGGTGTTTAGCTGGGGAGTTTT 
TTACGGGTGTTTAGCTGGGGAGTTT 
CTTACGGGTGTTTAGCTGGGGAGTT 
TCTTACGGGTGTTTAGCTGGGGAGT 
GTCTTACGGGTGTTTAGCTGGGGAG 
AGTCTTACGGGTGTTTAGCTGGGGA 
AAGTCTTACGGGTGTTTAGCTGGGG 
GAAGTCTTACGGGTGTTTAGCTGGG 
TGAAGTCTTACGGGTGTTTAGCTGG 
ATGAAGTCTTACGGGTGTTTAGCTG 
TATGAAGTCTTACGGGTGTTTAGCT 
GTATGAAGTCTTACGGGTGTTTAGC 
TGTATGAAGTCTTACGGGTGTTTAG 
TTGTATGAAGTCTTACGGGTGTTTA 
GTTGTATGAAGTCTTACGGGTGTTT 
TGTTGTATGAAGTCTTACGGGTGTT 
GTGTTGTATGAAGTCTTACGGGTGT 
TGTGTTGTATGAAGTCTTACGGGTG 
TTGTGTTGTATGAAGTCTTACGGGT 
ATTGTGTTGTATGAAGTCTTACGGG 
AGCCTTGGCAGCTGTGATCATCACA 
TAGCCTTGGCAGCTGTGATCATCAC 
GTAGCCTTGGCAGCTGTGATCATCA 
GGTAGCCTTGGCAGCTGTGATCATC 
AGGTAGCCTTGGCAGCTGTGATCAT 
TAGGTAGCCTTGGCAGCTGTGATCA 
TTAGGTAGCCTTGGCAGCTGTGATC 
TTTAGGTAGCCTTGGCAGCTGTGAT 
TTTTAGGTAGCCTTGGCAGCTGTGA 
CTTTTAGGTAGCCTTGGCAGCTGTG 
TCTTTTAGGTAGCCTTGGCAGCTGT 
TTCTTTTAGGTAGCCTTGGCAGCTG 
CTTCTTTTAGGTAGCCTTGGCAGCT 
TCTTCTTTTAGGTAGCCTTGGCAGC 
GTCTTCTTTTAGGTAGCCTTGGCAG 
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UGCC AAGGCUAC CUAAAAGAAGACA 



GCCAAGGCUACCUAAAAGAAGACAG 



TGTCTTCTTTTAGGTAGCCTTGGCA 



CTGTCTTCTTTTAGGTAGCCTTGGC 



CCAAGGCUACCUAAAAGAAGACAGU 



ACTGTCTTCTTTTAGGTAGCCTTGG 



CAAGGCUACCUAAAAGAAGACAGUU 



AACTGTCTTCTTTTAGGTAGCCTTG 



AGACAGUUAUCUCAUAUUUGGCUGC 



GACAGUUAUCUCAUAUUUGGCUGCC 



GCAGCCAAATATGAGATAACTGTCT 



GGCAGCCAAATATGAGATAACTGTC 



ACAGUUAUCUCAUAUUUGGCUGCCA 



TGGCAGCCAAATATGAGATAACTGT 



CAGUUAUCUCAUAUUUGGCUGCCAG 



CTGGCAGCCAAATATGAGATAACTG 



AGUUAUCUCAUAUUUGGCUGCCAGC 



GCTGGCAGCCAAATATGAGATAACT 



GUUAUCUCAUAUUUGGCUGCCAGCU 



AGCTGGCAGCCAAATATGAGATAAC 



UUAUCUCAUAUUUGGCUGCCAGCUU 



AAGCTGGCAGCCAAATATGAGATAA 



UAUCUCAUAUUUGGCUGCCAGCUUU 



AAAGCTGGCAGCCAAATATGAGATA 



AUCUCAUAUUUGGCUGCCAGCUUUU 



AAAAGCTGGCAGCCAAATATGAGAT 



UCUCAUAUUUGGCUGCCAGCUUUUU 



AAAAAGCTGGCAGC CAAATATGAGA 



CUCAUAUUUGGCUGCCAGCUUUUUA 



TAAAAAGCTGGCAGCCAAATATGAG 



UCAUAUUUGGCUGCCAGCUUUUUAU 



ATAAAAAGCTGGCAGCCAAATATGA 



CAUAUUUGGCUGCCAGCUUUUUAUC 



GATAAAAAGCTGGCAGCCAAATATG 



AUAUUUGGCUGCCAGCUUUUUAUCU 



AGATAAAAAGCTGGCAGCCAAATAT 



UAUUUGGCUGCCAGCUUUUUAUCUU 



AAGATAAAAAGCTGGCAGCCAAATA 



AUUUGGCUGCCAGCUUUUUAUCUUU 



AAAGATAAAAAGCTGGCAGCCAAAT 



UUUGGCUGCCAGCUUUUUAUCUUUC 



GAAAGATAAAAAGCTGGCAGCCAAA 



UUGGCUGCCAGCUUUUUAUCUUUCU 



AGAAAGATAAAAAGCTGGCAGCCAA 



UGGCUGCCAGCUUUUUAUCUUUCUC 



GAGAAAGATAAAAAGCTGGCAGCCA 



GGCUGCCAGCUUUUUAUCUUUCUCU 



AGAGAAAGATAAAAAGCTGGCAGCC 



GCUGCCAGCUUUUUAUCUUUCUCUC 



GAGAGAAAGATAAAAAGCTGGCAGC 



CUGCCAGCUUUUUAUCUUUCUCUCG 



CGAGAGAAAGATAAAAAGCTGGCAG 



UGCCAGCUUUUUAUCUUUCUCUCGA 



TCGAGAGAAAGATAAAAAGCTGGCA 



GCCAGCUUUUUAUCUUUCUCUCGAC 



GTCGAGAGAAAGATAAAAAGCTGGC 



CCAGCUUUUUAUCUUUCUCUCGACC 



GGTCGAGAGAAAGATAAAAAGCTGG 



CAGCUUUUUAUCUUUCUCUCGACCA 



TGGTCGAGAGAAAGATAAAAAGCTG 



AGCUUUUUAUCUUUCUCUCGACCAC 



GTGGTCGAGAGAAAGATAAAAAGCT 



■ GCUUUUUAUCUUUCUCUCGACCACU 



AGTGGTCGAGAGAAAGATAAAAAGC 



CUUUUUAUCUUUCUCUCGACCACUU 



AAGTGG TCGAGAGAAAGATAAAAAG 



AUCUUUCUCUCGACCACUUAAAACU 



AGTTTTAAGTGGTCGAGAGAAAGAT 



UCUUUCUCUCGACCACUUAAAACUU 



AAGTTTTAAGTGGTCGAGAGAAAGA 



CUUUCUCUCGACCACUUAAAACUUC 



GAAGTTTTAAGTGGTCGAGAGAAAG 



UUUCUCUCGACCACUUAAAACUUCA 



TGAAGTTTTAAGTGGTCGAGAGAAA 



UUCUCUCGACCACUUAAAACUUCAG 



CTGAAGTTTTAAGTGGTCGAGAGAA 



UCUCUCGACCACUUAAAACUUCAGA 



TCTGAAGTTTTAAGTGGTCGAGAGA 



CUCUCGACCACUUAAAACUUCAGAC 



GTCTGAAGTTTTAAGTGGTCGAGAG 



UCUCGACCACUUAAAACUUCAGACU 



AGTCTGAAGTTTTAAGTGGTCGAGA 



CUCGACCACUUAAAACUUCAGACUU 



AAGTCTGAAGTTTTAAGTGGTCGAG 



UCGACCACUUAAAACUUCAGACUUC 



GAAGTCTGAAGTTTTAAGTGGTCGA 



CGACCACUUAAAACUUCAGACUUCC 



GGAAGTCTGAAGTTTTAAGTGGTCG 



GACCACUUAAAACUUCAGACUUCCU 



AGGAAGTCTGAAGTTTTAAGTGGTC 



ACCACUUAAAACUUCAGACUUCCUG 



CAGGAAGTCTGAAGTTTTAAGTGGT 



CCACUUAAAACUUCAGACUUCCUGU 



ACAGGAAGTCTGAAGT' 
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ACUUAAAACUUCAGACUUCCUGUCC 



GGACAGGAAGTCTGAAGTTTTAAGT 



CUUAAAACUUCAGACUUCCUGUCCU 
UUAAAACUUCAGACUUCCUGUCCUG 



AGGACAGGAAGTCTGAAGTTTTAAG 



UAAAACUUCAGACUUCCUGUCCUGC 



GCAGGACAGGAAGTCTGAAGTTTTA 



AAAACUUCAGACUUCCUGUCCUGCU 



AGCAGGACAGGAAGTCTGAAGTTTT 



AAACUUCAGACUUCCUGUCCUGCUG 



CAGCAGGACAGGAAGTCTGAAGTTT 



AACUUCAGACUUCCUGUCCUGCUGG 



CCAGCAGGACAGGAAGTCTGAAGTT 



ACUUCAGACUUCCUGUCCUGCUGGU 



ACCAGCAGGACAGGAAGTCTGAAGT 



CUUCAGACUUCCUGUCCUGCUGGUA 



TACCAGCAGGACAGGAAGTCTGAAG 



UUCAGACUUCCUGUCCUGCUGGUAU 



ATACCAGCAGGACAGGAAGTCTGAA 



UCAGACUUCCUGUCCUGCUGGUAUC 



GATACCAGCAGGACAGGAAGTCTGA 



CAGACUUCCUGUCCUGCUGGUAUCA 



TGATACCAGCAGGACAGGAAGTCTG 



AGACUUCCUGUCCUGCUGGUAUCAU 



ATGATACCAGCAGGACAGGAAGTCT 



GACUUCCUGUCCUGCUGGUAUCAUG 



CATGATACCAGCAGGACAGGAAGTC 



ACUUCCUGUCCUGCUGGUAUCAUGG 



CCATGATACCAGCAGGACAGGAAGT 



CUUCCUGUCCUGCUGGUAUCAUGGA 



TCCATGATACCAGCAGGACAGGAAG 



UUCCUGUCCUGCUGGUAUCAUGGAG 



CTCCATGATACCAGCAGGACAGGAA 



UCCUGUCCUGCUGGUAUCAUGGAGA 



TCTCCATGATACCAGCAGGACAGGA 



CCUGUCCUGCUGGUAUCAUGGAGAA 



TTCTCCATGATACCAGCAGGACAGG 



CUGUCCUGCUGGUAUCAUGGAGAAA 



TTTCTCCATGATACCAGCAGGACAG 



UGUCCUGCUGGUAUCAUGGAGAAAG 



CTTTCTCCATGATACCAGCAGGACA 



GUCCUGCUGGUAUCAUGGAGAAAGU 



ACTTTCTCCATGATACCAGCAGGAC 



UCCUGCUGGUAUCAUGGAGAAAGUC 



GACTTTCTCCATGATACCAGCAGGA 



CCUGCUGGUAUCAUGGAGAAAGUCC 



GGACTTTCTCCATGATACCAGCAGG 



UCAUGGAGAAAGUCCAAUACCUCAC 



GTGAGGTATTGGACTTTCTCCATGA 



CAUGGAGAAAGUCCAAUACCUCACU 



AGTGAGGTATTGGACTTTCTCCATG 



AUGGAGAAAGUCCAAUACCUCACUC 



GAGTGAGGTATTGGACTTTCTCCAT 



UGGAGAAAGUCCAAUACCUCACUCG 



CGAGTGAGGTATTGGACTTTCTCCA 



GGAGAAAGUCCAAUACCUCACUCGC 



GCGAGTGAGGTATTGGACTTTCTCC 



GAGAAAGUCCAAUACCUCACUCGCU 



AGCGAGTGAGGTATTGGACTTTCTC 



AGAAAGUCCAAUACCUCACUCGCUC 



GAGCGAGTGAGGTATTGGACTTTCT 



GAAAGUCCAAUACCUCACUCGCUCA 



TGAGCGAGTGAGGTATTGGACTTTC 



AAAGUCCAAUACCUCACUCGCUCAG 



CTGAGCGAGTGAGGTATTGGACTTT 



AAGUCCAAUACCUCACUCGCUCAGC 



GCTGAGCGAGTGAGGTATTGGACTT 



AGUCCAAUACCUCACUCGCUCAGCU 



AGCTGAGCGAGTGAGGTATTGGACT 



GUCCAAUACCUCACUCGCUCAGCUA 



TAGCTGAGCGAGTGAGGTATTGGAC 



UCCAAUACCUCACUCGCUCAGCUAU 



ATAGCTGAGCGAGTGAGGTATTGGA 



CCAAUACCUCACUCGCUCAGCUAUA 



TATAGCTGAGCGAGTGAGGTATTGG 



CAAUACCUCACUCGCUCAGCUAUAA 



TTATAGCTGAGCGAGTGAGGTATTG 



AAUACCUCACUCGCUCAGCUAUAAG 



CTTATAGCTGAGCGAGTGAGGTATT 



AUACCUCACUCGCUCAGCUAUAAGA 



TCTTATAGCTGAGCGAGTGAGGTAT 



UACCUCACUCGCUCAGCUAUAAGAA 



TT CTTATAGCTGAGCGAGTGAGGTA 



ACCUCACUCGCUCAGCUAUAAGAAG 



CTTCTTATAGCTGAGCGAGTGAGGT 



CCUCACUCGCUCAGCUAUAAGAAGA 



TCTTCTTATAGCTGAGCGAGTGAGG 



CUCACUCGCUCAGCUAUAAGAAGAG 



CTCTTCTTATAGCTGAGCGAGTGAG 



UCACUCGCUCAGCUAUAAGAAGAGC 



GCTCTTCTTATAGCTGAGCGAGTGA 



CACUCGCUCAGCUAUAAGAAGAGCC 



GGCTCTTCTTATAGCTGAGCGAGTG 
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ACUCGCUCAGCUAUAAGAAGAGCCU 



AGGCTCTT CTTATAGCTGAGCGAGT 



CUCGCUCAGCUAUAAGAAGAGCCUC 



GAGGCTCTTCTTATAGCTGAGCGAG 



UCGCUCAGCUAUAAGAAGAGCCUCA 



TGAGGCTCTTCTTATAGCTGAGCGA 



CGCUCAGCUAUAAGAAGAGCCUCAA 



TTGAGGCTCTTCTTATAGCTGAGCG 



GCUCAGCUAUAAGAAGAGCCUCAAC 



GTTGAGGCTCTTCTTATAGCTGAGC 



CUCAGCUAUAAGAAGAGCCUCAACC 



GGTTGAGGCTCTTCTTATAGCTGAG 



UCAGCUAUAAGAAGAGCCUCAACCA 



TGGTTGAGGCTCTTCTTATAGCTGA 



CAGCUAUAAGAAGAGCCUCAACCAU 



ATGGTTGAGGCTCTTCTTATAGCTG 



AGCUAUAAGAAGAGCCUCAACCAUU 



AATGGTTGAGGCTCTTCTTATAGCT 



GCUAUAAGAAGAGCCUCAACCAUUG 



CAATGGTTGAGGCTCTTCTTATAGC 



CUAUAAGAAGAGCCUCAACCAUUGA 



TCAATGGTTGAGGCTCTTCTTATAG 



UAUAAGAAGAGCCUCAACCAUUGAA 



TTCAATGGTTGAGGCTCTTCTTATA 



AUAAGAAGAGCCUCAACCAUUGAAA 



TTTCAATGGTTGAGGCTCTTCTTAT 



UAAGAAGAGCCUCAACCAUUGAAAU 



ATTTCAATGGTTGAGGCTCTTCTTA 



AAGAAGAGCCUCAACCAUUGAAAUG 



CATTTCAATGGTTGAGGCTCTTCTT 



AGAAGAGC CUCAACCAUUGAAAUGC 



GCATTTCAATGGTTGAGGCTCTTCT 



GAAGAGCCUCAACCAUUGAAAUGCC 



GGCATTTCAATGGTTGAGGCTCTTC 



AAGAGCCUCAACCAUUGAAAUGCCU 



AGGCATTTCAATGGTTGAGGCTCTT 



AGAGCCUCAACCAUUGAAAUGC CUC 



GAGGCATTT CAATGGTTGAGGCTCT 



GAGCCUCAACCAUUGAAAUGCCUCA 



TGAGGCATTTCAATGGTTGAGGCTC 



AGCCUCAACCAUUGAAAUGCCUCAA 



TTGAGGCATTTCAATGGTTGAGGCT 



GCCUCAACCAUUGAAAUGCCUCAAC 



GTTGAGGCATTTCAATGGTTGAGGC 



CCUCAACCAUUGAAAUGCCUCAACA 



TGTTGAGGCATTTCAATGGTTGAGG 



CUCAACCAUUGAAAUGC CUCAACAA 



TTGTTGAGGCATTTCAATGGTTGAG 



UCAACCAUUGAAAUGCCUCAACAAG 



CTTGTTGAGGCATTTCAATGGTTGA 



CAACCAUUGAAAUGCCUCAACAAGC 



GCTTGTTGAGGCATTTCAATGGTTG 



AACCAUUGAAAUGCCUCAACAAGCA 



TGCTTGTTGAGGCATTTCAATGGTT 



ACCAUUGAAAUGCCUCAACAAGCAC 



GTGCTTGTTGAGGCATTTCAATGGT 



CCAUUGAAAUGCCUCAACAAGCACG 



CGTGCTTGTTGAGGCATTTCAATGG 



CAUUGAAAUGCCUCAACAAGCACGU 



ACGTGCTTGTTGAGGCATTTCAATG 



AUUGAAAUGCCUCAACAAGCACGUC 



UUGAAAUGCCUCAACAAGCACGUCA 



TGACGTGCTTGTTGAGGCATTTCAA 



UGAAAUGCCUCAACAAGCACGUCAA 



TTGACGTGCTTGTTGAGGCATTTCA 



- GAAAUGCCUCAACAAGCACGUCAAA 



TTTGACGTGCTTGTTGAGGCATTTC 



AAAUGCCUCAACAAGCACGUCAAAA 



TTTTGACGTGCTI 



AAUGCCUCAACAAGCACGUCAAAAG 



CTTTTGACGTGCTTGTTGAGGCATT 



AUGCCUCAACAAGCACGUCAAAAGC 



GCTTTTGACGTGCTTGTTGAGGCAT 



UGCCUCAACAAGCACGUCAAAAGCU 



AGCTTTTGACGTGCTTGTTGAGGCA 



GCCUCAACAAGCACGUCAAAAGCUA 



TAGCTTTTGACGTGCTTGTTGAGGC 



CCUCAACAAGCACGUCAAAAGCUAC 



GTAGCTTTTGACGTGCTTGTTGAGG 



CUCAACAAGCACGUCAAAAGCUACA 



TGTAGCTTTTGACGTGCTTGTTGAG 



UCAACAAGCACGUCAAAAGCUACAG 



CTGTAGCTTTTGACGTGCTTGTTGA 



CAACAAGCACGUCAAAAGCUACAGA 



TCTGTAGCTTTTGACGTGCTTGTTG 



AACAAGCACGUCAAAAGCUACAGAA 



TTCTGTAGCTTTTGACGTGCTTGTT 



ACAAGCACGUCAAAAGCUACAGAAU 



CAAGCACGUCAAAAGCUACAGAAUC 



ATTCTGTAGCTTTTGACGTGCTTGT 



GATTCTGTAGCTTTTGACGTGCTTG 



AAGCACGUCAAAAGCUACAGAAUCU 



AGATTCTGTAGCTTTTGACGTGCTT 
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250 


AGCACGUCAAAAGCUACAGAAUCUA 


2632 


TAGATTCTGTAGCTTTTGACGTGCT 


3236 


251 


GCACGUCAAAAGCUACAGAAUCUAU 


2633 


ATAGATTCTGTAGCTTTTGACGTGC 


3237 


252 


CACGUCAAAAGCUACAGAAUCUAUU 


2634 


AATAGATTCTGTAGCTTTTGACGTG 


3238 


253 


ACGUCAAAAGCUACAGAAUCUAUUU 


2635 


AAATAGATTCTGTAGCTTTTGACGT 


3239 


254 


CGUCAAAAGCUACAGAAUCUAUUUA 


2636 


TAAATAGATTCTGTAGCTTTTGACG 


3240 


255 


GUCAAAAG CUACAGAAUCUAUUUAU 


2637 


ATAAATAGATTCTGTAGCTTTTGAC 


3241 


256 


UCAAAAGCUACAGAAUCUAUUUAUC 


2638 


GATAAATAGATTCTGTAGCTTTTGA 


3242 


257 


CAAAAGCUACAGAAUCUAUUUAUCA 


2539 


TGATAAATAGATTCTGTAGCTTTTG 


3243 


258 


AAAAGCUACAGAAUCUAUUUAUCAA 


2640 


TTGATAAATAGATTCTGTAGCTTTT 


3244 


259 


AAAGCUACAGAAUCUAUUUAUCAAU 


2641 


ATTGATAAATAGATTCTGTAGCTTT 


3245 


2S0 


AAGCUACAGAAUCUAUUUAUCAAUU 


2642 


AATTGATAAATAGATTCTGTAGCTT 


3246 


261 


AGCUACAGAAUCUAUUUAUCAAUUU 


2643 


AAATTGATAAATAGATTCTGTAGCT 


3247 


262 


GCUACAGAAUCUAUUUAUCAAUUUC 


2644 


GAAATTGATAAATAGATTCTGTAGC 


3248 


263 


CUACAGAAUCUAUUUAUCAAUUUCU 


2645 


AGAAATTGATAAATAGATTCTGTAG 


3249 


264 


UACAGAAUCUAUUUAUCAAUUUCUG 


2646 


CAGAAATTGATAAATAGATTCTGTA 


3250 


265 


ACAGAAUCUAUUUAUCAAUUUCUGU 


2647 


ACAGAAATTGATAAATAGATTCTGT 


3251 


266 


CAGAAUCUAUUUAUCAAUUUCUGUC 


2648 


GACAGAAATTGATAAATAGATTCTG 


3252 


267 


AGAAUCUAUUUAUCAAUUUCUGUCU 


2649 


AGACAGAAATTGATAAATAGATTCT 


3253 


268 


GAAUCUAUUUAUCAAUUUCUGUCUC 


2650 


GAGACAGAAATTGATAAATAGATTC 


3254 


269 


AAUCUAUUUAUCAAUUUCUGUCUCA 


2651 


TGAGACAGAAATTGATAAATAGATT 


3255 


270 


AUCUAUUUAUCAAUUUCUGUCUCAU 


2652 


ATGAGACAGAAATTGATAAATAGAT 


3256 


271 


UCUAUUUAUCAAUUUCUGUCUCAUC 


2653 


GATGAGACAGAAATTGATAAATAGA 


3257 


272 


CUAUUUAUCAAUUUCUGUCUCAUCU 


2654 


AGATGAGACAGAAATTGATAAATAG 


3258 


273 


UAUUUAUCAAUUUCUGUCUCAUCUU 


2655 


AAGATGAGACAGAAATTGATAAATA 


3259 


274 


AUUUAUCAAUUUCUGUCUCAUCUUA 


2656 


TAAGATGAGACAGAAATTGATAAAT 


3260 


275 


UUUAUCAAUUUCUGUCUCAUCUUAA 


2657 


TTAAGATGAGACAGAAATTGATAAA 


3261 


276 


UUAUCAAUUUCUGUCUCAUCUUAAU 


2658 


ATTAAGATGAGACAGAAATTGATAA 


3262 


277 


UAUCAAUUUCUGUCUCAUCUUAAUA 


2659 


TATTAAGATGAGACAGAAATTGATA 


3263 


278 


AUCAAUUUCUGUCUCAUCUUAAUAU 


2660 


ATATTAAGATGAGACAGAAATTGAT 


3264 


279 


UCAAUUUCUGUCUCAUCUUAAUAUG 


2661 


CATATTAAGATGAGACAGAAATTGA 


3265 


280 


CAAUUUCUGUCUCAUCUUAAUAUGU 


2662 


ACATATTAAGATGAGACAGAAATTG 


3266 


281 


AAUUUCUGUCUCAUCUUAAUAUGUC 


2663 


GACATATTAAGATGAGACAGAAATT 


3267 


282 


AUUUCUGUCUCAUCUUAAUAUGUCU 


2664 


AGACATATTAAGATGAGACAGAAAT 


3268 


283 


UUUCUGUCUCAUCUUAAUAUGUCUC 


2665 


GAGACATATTAAGATGAGACAGAAA 


3269 


284 


UUCUGUCUCAUCUUAAUAUGUCUCU 


2666 


AGAGACATATTAAGATGAGACAGAA 


3270 


285 


UCUGUCUCAUCUUAAUAUGUCUCUU 


2667 


AAGAGACATATTAAGATGAGACAGA 


3271 


286 


CUGUCUCAUCUUAAUAUGUCUCUUG 


2668 


CAAGAGACATATTAAGATGAGACAG 


3272 


287 


UGUCUCAUCUUAAUAUGUCUCUUGC 


2669 


GCAAGAGACATATTAAGATGAGACA 


3273 


288 


GUCUCAUCUUAAUAUGUCUCUUGCU 


2670 


AGCAAGAGACATATTAAGATGAGAC 


3274 


289 


UCUCAUCUUAAUAUGUCUCUUGCUG 


2671 


CAGCAAGAGACATATTAAGATGAGA 


3275 


290 


CUCAUCUUAAUAUGUCUCUUGCUGA 


2672 


TCAGCAAGAGACATATTAAGATGAG 




291 


UCAUCUUAAUAUGUCUCUUGCUGAO 


2673 


ATCAGCAAGAGACATATTAAGATGA 


3277 


292 


CAUCUUAAUAUGUCUCUUGCUGAUC 


2674 


GATCAGCAAGAGACATATTAAGATG 


3278 


293 


AUCUUAAUAUGUCUCUUGCUGAUCU 


2675 


AGATCAGCAAGAGACATATTAAGAT 


3279 


2 94 


UCUUAAUAUGUCUCUUGCUGAUCUG 


2676 


CAGATCAGCAAGAGACATATTAAGA 


3280 


295 


CUUAAUAUGUCUCUUGCUGAUCUGU 


2677 


ACAGATCAGCAAGAGACATATTAAG 


3281 


296 


UUAAUAUGUCUCUUGCUGAUCUGUA 


2678 


TACAGATCAGCAAGAGACATATTAA 


3282 
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UAAUAUGUCUCUUGCUGAUCUGUAU 



ATACAGATCAGCAAGAGACATATTA 



AAUAUGUCUCUUGCUGAUCUGUAUC 



GATACAGATCAGCAAGAGACATATT 



AUAUGUCUCUUGCUGAUCUGUAUCA 



TGATACAGATCAGCAAGAGACATAT 



UAUGUCUCUUGCUGAUCUGUAUCAU 



ATGATACAGATCAGCAAGAGACATA 



AUGUCUCUUGCUGAUCUGUAUCAUC 



GATGATACAGATCAGCAAGAGACAT 



UGUCUCUUGCUGAUCUGUAUCAUCG 



CGATGATACAGATCAGCAAGAGACA 



GUCUCUUGCUGAUCUGUAUCAUCGU 



ACGATGATACAGATCAGCAAGAGAC 



UCUCUUGCUGAUCUGUAUCAUCGUG 



CACGATGATACAGATCAGCAAGAGA 



CUCUUGCUGAUCUGUAUCAUCGUGA 



TCACGATGATACAGATCAGCAAGAG 



UCUUGCUGAUCUGUAUCAUCGUGAU 



ATCACGATGATACAGATCAGCAAGA 



CUUGCUGAUCUGUAUCAUCGUGAUG 



CATCACGATGATACAGATCAGCAAG 



UUGCUGAUCUGUAUCAUCGUGAUGC 



GCATCACGATGATACAGATCAGCAA 



UGCUGAUCUGUAUCAUCGUGAUGCU 



AGCATCACGATGATACAGATCAGCA 



GCUGAUCUGUAUCAUCGUGAUGCUU 



AAGCATCACGATGATACAGATCAGC 



CUGAUCUGUAUCAUCGUGAUGCUUC 



GAAGCATCACGATGATACAGATCAG 



UGAUCUGUAUCAUCGUGAUGCUUCU 



AGAAGCATCACGATGATACAGATCA 



GAUCUGUAUCAUCGUGAUGCUUCUC 



GAGAAGCATCACGATGATACAGATC 



AUCUGUAUCAUCGUGAUGCUUCUCU 



AGAGAAGCATCACGATGATACAGAT 



UCUGUAUCAUCGUGAUGCUUCUCUG 



CAGAGAAGCATCACGATGATACAGA 



CUGUAUCAUCGUGAUGCUUCUCUGA 



TCAGAGAAGCATCACGATGATACAG 



UGUAUCAUCGUGAUGCUUCUCUGAA 



, TTCAGAGAAGCATCACGATGATACA 



GUAUCAUCGUGAUGCUUCUCUGAAG 



CTTCAGAGAAGCATCACGATGATAC 



UAUCAUCGUGAUGCUUCUCUGAAGU 



ACTTCAGAGAAGCATCACGATGATA 



AUCAUCGUGAUGCUUCUCUGAAGUU 



AACTTCAGAGAAGCATCACGATGAT 



UCAUCGUGAUGCUUCUCUGAAGUUC 



GAACTTCAGAGAAGCATCACGATGA 



CAUCGUGAUGCUUCUCUGAAGUUCU 



AGAACTTCAGAGAAGCATCACGATG 



AUCGUGAUGCUUCUCUGAAGUUCUG 



CAGAACTTCAGAGAAGCATCACGAT 



UCGUGAUGCUUCUCUGAAGUUCUGC 



GCAGAACTTCAGAGAAGCATCACGA 



CGUGAUGCUUCUCUGAAGUUCUGCU 



AGCAGAACTTCAGAGAAGCATCACG 



GUGAUGCUUCUCUGAAGUUCUGCUA 



TAGCAGAACTTCAGAGAAGCATCAC 



UGAUGCUUCUCUGAAGUUCUGCUAC 



GTAGCAGAACTTCAGAGAAGCATCA 



GAUGCUUCUCUGAAGUUCUGCUACA 



TGTAGCAGAACTTCAGAGAAGCATC 



AUGCUUCUCUGAAGUUCUGCUACAA 



TTGTAGCAGAACTTCAGAGAAGCAT 



UGCUUCUCUGAAGUUCUGCUACAAC 



GTTGTAGCAGAACTTCAGAGAAGCA 



GCUUCUCUGAAGUUCUGCUACAACC 



GGTTGTAGCAGAACTTCAGAGAAGC 



CUUCUCUGAAGUUCUGCUACAACCU 



AGGTTGTAGCAGAACTTCAGAGAAG 



UUCUCUGAAGUUCUGCUACAACCUC 



GAGGTTGTAGCAGAACTTCAGAGAA 



UCUCUGAAGUUCUGCUACAACCUCU 



AGAGGTTGTAGCAGAACTTCAGAGA 



CUCUGAAGUUCUGCUACAACCUCUA 



TAGAGGTTGTAGCAGAACTTCAGAG 



UCUGAAGUUCUGCUACAACCUCUAG 



CTAGAGGTTGTAGCAGAACTTCAGA 



CUGAAGUUCUGCUACAACCUCUAGA 



TCTAGAGGTTGTAGCAGAACTTCAG 



UGAAGUUCUGCUACAACCUCUAGAU 



ATCTAGAGGTTGTAGCAGAACTTCA 



GAAGUUCUGCUACAACCUCUAGAUC 



GATCTAGAGGTTGTAGCAGAACTTC 



AAGUUCUGCUACAACCUCUAGAUCU 



AGATCTAGAGGTTGTAGCAGAACTT 



AGUUCUGCUACAACCUCUAGAUCUG 



CAGATCTAGAGGTTGTAGCAGAACT 



GUUCUGCUACAACCUCUAGAUCUGC 



GCAGATCTAGAGGTTGTAGCAGAAC 



UUCUGCUACAACCUCUAGAUCUGCA 



TGCAGATCTAGAGGTTGTAGCAGAA 
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UCUG CUAC AAC CUCUAGAUCUGCAG 
CUGCUACAACCUCUAGAUCUGCAGC 
UGCUACAACCUCUAGAUCUGCAGCU 
GCUACAACCUCUAGAUCUGCAGCUU 
CUACAACCUCUAGAUCUGCAGCUUG 
UACAACCUCUAGAUCUGCAGCUUGC 
ACAACCUCUAGAUCUGCAGCUUGCC 
CAAC CUCUAGAUCUGCAGCUUGCCA 
AACCUCUAGAUCUGCAGCUUGCCAC 
ACCUCUAGAUCUGCAGCUUGCCACA 
CCUCUAGAUCUGCAGCUUGCCACAU 
CUCUAGAUCUGCAGCUUGCCACAUC 
UCUAGAUCUGCAGCUUGCCACAUCA 
CUAGAUCUGCAGCUUGCCACAUCAG 
UAGAUCUGCAGCUUGCCACAUCAGC 
GCUUGCCACAUCAGCUUAAAAUCUG 
CUUGCCACAUCAGCUUAAAAUCUGU 
UUGCCACAUCAGCUUAAAAUCUGUC 
UGCCACAUCAGCUUAAAAUCUGUCA 
GCCACAUCAGCUUAAAAUCUGUCAU 
CCACAUCAGCUUAAAAUCUGUCAUC 
CACAUCAGCUUAAAAUCUGUCAUCC 
ACAUCAGCUUAAAAUCUGUCAUCCC 
CAUCAGCUUAAAAUCUGUCAUCCCA 
AUCAGCUUAAAAUCUGUCAUCCCAU 
UCAGCUUAAAAUCUGUCAUCCCAUG . 
CAGCUUAAAAUCUGUCAUCCCAUGC 
AGCUUAAAAUCUGUCAUCCCAUGCA 
GCUUAAAAUCUGUCAUCCCAUGCAG 
CUUAAAAUCUGUCAUCCCAUGCAGA 
UUAAAAUCUGUCAUCCCAUGCAGAC 
UAAAAUCUGUCAUCCCAUGCAGACA 
UGUCAUCC CAUGC AGACAGGAAAAC 
GUCAUCCCAUGCAGACAGGAAAACA 
UCAUCCCAUGCAGACAGGAAAACAA 
CAUCCCAUGCAGACAGGAAAACAAU 
AUCCCAUGCAGACAGGAAAACAAUA 
UCCCAUGCAGACAGGAAAACAAUAU 
CCCAUGCAGACAGGAAAACAAUAUU 
CCAUGCAGACAGGAAAACAAUAUUG 
CAUGCAGACAGGAAAACAAUAUUGU 
AUGCAGACAGGAAAACAAUAUUGUA 
UGCAGACAGGAAAACAAUAUUGUAU 
AACAGACCACUUCCUGAGUAGAAGA 
ACAGACCACUUCCUGAGUAGAAGAG 
CAGACCACUUCCUGAGUAGAAGAGU 
GACCACUUCCUGAGUAGAAGAGUUU 



CTGCAGATCTAGAGGTTGTAGCAGA 
GCTGCAGATCTAGAGGTTGTAGCAG 
AGCTGCAGATCTAGAGGTTGTAGCA 
AAGCTGCAGATCTAGAGGTTGTAGC 
CAAGCTGCAGATCTAGAGGTTGTAG 
GCAAGCTGCAGATCTAGAGGTTGTA 
GGCAAGCTGCAGATCTAGAGGTTGT 
TGGCAAGCTGCAGATCTAGAGGTTG 
GTGGCAAGCTGCAGATCTAGAGGTT 
TGTGGCAAGCTGCAGATCTAGAGGT 
ATGTGGCAAGCTGCAGATCTAGAGG 
GATGTGGCAAGCTGCAGATCTAGAG 
TGATGTGGCAAGCTGCAGATCTAGA 
CTGATGTGGCAAGCTGCAGATCTAG 
GCTGATGTGGCAAGCTGCAGATCTA 
~ CAGATTTTAAGCTGATGTGGCAAGC 
ACAGATTTTAAGCTGATGTGGCAAG 
GACAGATTTTAAGCTGATGTGGCAA 
TGACAGATTTTAAGCTGATGTGGCA 
ATGACAGATTTTAAGCTGATGTGGC 
GATGACAGATTTTAAGCTGATGTGG 
GGATGACAGATTTTAAGCTGATGTG 
GGGATGACAGATTTTAAGCTGATGT 
TGGGATGACAGATTTTAAGCTGATG 
ATGGGATGACAGATTTTAAGCTGAT 
CATGGGATGACAGATTTTAAGCTGA 
GCATGGGATGACAGATTTTAAGCTG 
TGCATGGGATGACAGATTTTAAGCT 
CTGCATGGGATGACAGATTTTAAGC 
TCTGCATGGGATGACAGATTTTAAG 
GTCTGCATGGGATGACAGATTTTAA 
TGTCTGCATGGGATGACAGATTTTA 
GTTTTCCTGTCTGCATGGGATGACA 
TGTTTTCCTGTCTGCATGGGATGAC 
TTGTTTTCCTGTCTGCATGGGATGA 
ATTGTTTTCCTGTCTGCATGGGATG 
TATTGTTTTCCTGTCTGCATGGGAT 
ATATTGTTTTCCTGTCTGCATGGGA 
AATATTGTTTTCCTGTCTGCATGGG 
CAATATTGTTTTCCTGTCTGCATGG 
ACAATATTGTTTTCCTGTCTGCATG 
TACAATATTGTTTTCCTGTCTGCAT 
ATACAATATTGTTTTCCTGTCTGCA 
TCTTCTACTCAGGAAGTGGTCTGTT 
CTCTTCTACTCAGGAAGTGGTCTGT 
ACTCTTCTACTCAGGAAGTGGTCTG 
AAACTCTTCTACTCAGGAAGTGGTC 
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431 


ACCACUUCCUGAGUAGAAGAGUUUC 


2773 


GAAACTCTTCTACTCAGGAAGTGGT 


3377 


432 


CCACUUCCUGAGUAGAAGAGUUUCU 


2774 


AGAAACTCTTCTACTCAGGAAGTGG 


3378 


445 


AGAAGAGUUUCUUUGUGAAAAGGUC 


2775 


GACCTTTTCACAAAGAAACTCTTCT 


3379 


446 


GAAGAGUUUCUUUGUGAAAAGGUCA 


2776 . 


TGACCTTTTCACAAAGAAACTCTTC 


3380 


447 


AAGAGUUUCUUUGUGAAAAGGUCAA 


2777 


TTGACCTTTTCACAAAGAAACTCTT 


3381 


448 


AGAGUUUCUUUGUGAAAAGGUCAAG 


2778 


CTTGACCTTTTCACAAAGAAACTCT 


3382 


449 


GAGUUUCUUUGUGAAAAGGUCAAGA 


2779 


T CTTGACCTTTTCACAAAGAAACTC 


3383 


450 


AGUUUCUUUGUGAAAAGGUCAAGAU 


2780 


ATCTTGACCTTTTCACAAAGAAACT 


3384 


451 


GUUUCUUUGUGAAAAGGUCAAGAUU 


2781 


AATCTTGACCTTTTCACAAAGAAAC 


3385 


452 


UUUCUUUGUGAAAAGGUCAAGAUUA 


2782 


TAATCTTGACCTTTTCACAAAGAAA 


3386 


453 


UUCUUUGUGAAAAGGUCAAGAUUAA 


2783 


TTAATCTTGACCTTTTCACAAAGAA 


3387 


504 


AUUCAUCUGUUGGAUCUUGUAAACA 


2784 


TGTTTACAAGATCCAACAGATGAAT 


3388 


505 


UUCAUCUGUUGGAUCUUGUAAACAU 


2785 


ATGTTTACAAGATCCAACAGATGAA 


3389 


506 


UCAUCUGUUGGAUCUUGUAAACAUG 


2786 


CATGTTTACAAGATCCAACAGATGA 


3390 


507 


CAUCUGUUGGAUCUUGUAAACAUGA 


2787 


TCATGTTTACAAGATCCAACAGATG 


3391 


508 


AUCUGUUGGAUCUUGUAAACAUGAA 


2788 


TTCATGTTTACAAGATCCAACAGAT 


3392 


509 


UCUGUUGGAUCUUGUAAACAUGAAA 


2789 


TTTCATGTTTACAAGATCCAACAGA 


3393 


5X0 


CUGUUGGAUCUUGUAAACAUGAAAA 


2790 


TTTTCATGTTTACAAGATCCAACAG 


3394 


511 


UGUUGGAUCUUGUAAACAUGAAAAG 


2791 


CTTTTCATGTTTACAAGATCCAACA 


3395 


512 


GUUGGAUCUUGUAAACAUGAAAAGG 


2792 


CCTTTTCATGTTTACAAGATCCAAC 


3396 


513 


UUGGAUCUUGUAAACAUGAAAAGGG 


2793 


CCCTTTTCATGTTTACAAGATCCAA 


3397 


514 


UGGAUCUUGUAAACAUGAAAAGGGC 


2794 


GC CCTTTTCATGTTTACAAGATCCA 


3398 


515 


GGAUCUUGUAAACAUGAAAAGGGCU 


2795 


AGCCCTTTTCATGTTTACAAGATCC 


3399 


51S 


GAUCUUGUAAACAUGAAAAGGGCUU 


2796 


AAGCCCTTTTCATGTTTACAAGATC 


3400 


517 


AUCUUGUAAACAUGAAAAGGGCUUU 


2797 


AAAGCC CTTTTCATGTTTACAAGAT 


3401 


518 


UCUUGUAAACAUGAAAAGGGCUUUA 


2798 


TAAAGCCCTTTTCATGTTTACAAGA 


3 402 


519 


CUUGUAAACAUGAAAAGGGCUUUAU 


2799 


ATAAAGCCCTTTTCATGTTTACAAG 


3403 


520 


UUGUAAACAUGAAAAGGGCUUUAUU 


2800 


AATAAAGCCCTTTTCATGTTTACAA 


3404 


521 


UGUAAACAUGAAAAGGGCUUUAUUU 


2801 


AAATAAAGCCCTTTTCATGTTTACA 


3405 


522 


GUAAACAUGAAAAGGGCUUUAUUUU 


2802 


AAAATAAAGCCCTTTTCATGTTTAC 


3406 


531 


AAAAGGGCUUUAUUUUCAAAAAUUA 


2803 


TAATTTTTGAAAATAAAGCCCTTTT 


3407 


532 


AAAGGGCUUUAUUUUCAAAAAUUAA 


2804 


TTAATTTTTGAAAATAAAGCCCTTT 


3408 


533 


AAGGGCUUUAUUUUCAAAAAUUAAC 


2805 


GTTAATTTTTGAAAATAAAGCCCT.T 


3409 


534 


AGGGCUUUAUUUUCAAAAAUUAACU 


2806 


AGTTAATTTTTGAAAATAAAGCCCT 


3410 


535 


GGGCUUUAUUUUCAAAAAUUAACUU 


2807 


AAGTTAATTTTTGAAAATAAAGCCC 


3411 


570 


GUAUAAAAUGCAACUGUUGAUUUC C 


2808 


GGAAATCAACAGTTGCATTTTATAC 


3412 


571 


UAUAAAAUGCAACUGUUGAUUUCCU 


2809 


AGGAAATCAACAGTTGCATTTTATA 


3413 


572 


AUAAAAUGCAACUGUUGAUUUCCUC 


2810 


GAGGAAATCAACAGTTGCATTTTAT 


3414 


573 


UAAAAUGCAACUGUUGAUUUCCUCA 


2811 


TGAGGAAATCAACAGTTGCATTTTA 


3415 


574 


AAAAUGCAACUGUUGAUUUCCUCAA 


2812 


TTGAGGAAATCAACAGTTGCATTTT 


3416 


586 


UUGAUUUCCUCAACAUGGCUCACAA 


2813 


TTGTGAGCCATGTTGAGGAAATCAA 




5B7 


UGAUUUCCUCAACAUGGCUCACAAA 


2814 


TTTGTGAGCCATGTTGAGGAAATCA 




588 


GAUUUCCUCAACAUGGCUCACAAAU 


2815 


ATTTGTGAGCCATGTTGAGGAAATC 


3419 


589 


AUUUCCUCAACAUGGCUCACAAAUU 


2816 


AATTTGTGAGCCATGTTGAGGAAAT 


3420 


590 


UUUCCUCAACAUGGCUCACAAAUUU 


2817 


AAATTTGTGAGCCATGTTGAGGAAA 


3421 


591 


UUCCUCAACAUGGCUCACAAAUUUC 


2818 


GAAATTTGTGAGCCATGTTGAGGAA 


3422 


592 


UCCUCAACAUGGCUCACAAAUUUCU 


2819 


AGAAATTTGTGAGCCATGTTGAGGA 


3423 
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593 


CCUCAACAUGGCUCACAAAUUUCUA 


2820 


TAGAAATTTGTGAGCCATGTTGAGG 


3424 


594 


CUCAACAUGGCUCACAAAUUUCUAU 


2821 


ATAGAAATTTGTGAGCCATGTTGAG 


3425 


595 


UCAACAUGGCUCACAAAUUUCUAUC 


2822 


GATAGAAATTTGTGAGCCATGTTGA 


3426 


596 


CAACAUGGCUCACAAAUUUCUAUCC 


2823 


GGATAGAAATTTGTGAGCCATGTTG 


3427 


597 


AACAUGGCUCACAAAUUUCUAUCCC 


2824 


GGGATAGAAATT TGTGAGC CATGTT 


3428 


598 


ACAUGGCUCACAAAUUUCUAUC CCA 


2825 


TGGGATAGAAATTTGTGAGCCATGT 


3429 


599 


CAUGGCUCACAAAUUUCUAUCCCAA 


2826 


TTGGGATAGAAATTTGTGAGCCATG 


3430 




AUGGCUCACAAAUUUCUAUCCCAAA 


2827 


TTTGGGATAGAAATTTGTGAGCCAT 


3431 


601 


UGGCUCACAAAUUUCUAUCCCAAAU 


2828 


ATTTGGGATAGAAATTTGTGAGCCA 


3432 


602 


GGCUCACAAAUUUCUAUCCCAAAUC 


2829 


GATTTGGGATAGAAATTTGTGAGCC 


3433 




GCUCACAAAUUUCUAUCCCAAAUCU 


2830 


AGATTTGGGATAGAAATTTGTGAGC 


3434 


— — 


CUCACAAAUUUCUAUCCCAAAUCUU 


2831 


AAGATTTGGGATAGAAATTTGTGAG 


3435 


605 


UCACAAAUUUCUAUCCCAAAUCUUU 


2832 


AAAGATTTGGGATAGAAATTTGTGA 


3436 


606 


CACAAAUUUCUAUCCCAAAUCUUUU 


2833 


AAAAGATTTGGGATAGAAATTTGTG 


3437 




ACAAAUUUCUAUCCCAAAUCUUUUC 


2834 


GAAAAGATTTGGGATAGAAATTTGT 


3438 


608 


CAAAUUUCUAUCCCAAAUCUUUUCU 


2835 


AGAAAAGATTTGGGATAGAAATTTG 


3439 




AAAUUUCUAUCCCAAAUCUUUUCUG 


2836 


CAGAAAAGATTTGGGATAGAAATTT 


3440 





AAUUUCUAUCCCAAAUCUUUUCUGA 


2837 


TCAGAAAAGATTTGGGATAGAAATT 


3441 


— iii — 


AUUUCUAUCCCAAAUCUUUUCUGAA 


2838 


TTCAGAAAAGATTTGGGATAGAAAT 


3442 


612 


UUUCUAUCCCAAAUCUUUUCUGAAG 


2839 


CTTCAGAAAAGATTTGGGATAGAAA 


3443 


613 


UUCUAUCCCAAAUCUUUUCUGAAGA 


2840 


TCTTCAGAAAAGATTTGGGATAGAA 


3444 


644 


GUUUAGUUUUAAAACUGCACUGCCA 


2841 


TGGCAGTGCAGTTTTAAAACTAAAC 


3445 




UU1 J AGUUUUAAAACUG CACUGC C AA 


2842 


TTGGCAGTGCAGTTTTAAAACTAAA 


3446 


— iii — 


UUAGUUUUAAAACUGCACUGCCAAC 


2843 


GTTGGCAGTGCAGTTTTAAAACTAA 


3447 


647 


UAGUUUUAAAACUGCACUGCCAACA 


2844 


TGTTGGCAGTGCAGTTTTAAAACTA 


3448 


648 


AGUUUUAAAACUGCACUGCCAACAA 


2845 


TTGTTGGCAGTGCAGTTTTAAAACT 


3449 


649 


GUUUUAAAACUGCACUGCCAACAAG 


2846 


CTTG7TGGCAGTGCAGTTTTAAAAC 


3450 


650 


UUUUAAAACUGCACUGCCAACAAGU 


2847 


ACTTGTTGGCAGTGCAGTTTTAAAA 


3451 


651 


UUUAAAACUGCACUGCCAACAAGUU 


2848 


AACTTGTTGGCAGTGCAGTTTTAAA 


3452 


652 


UUAAAACUGCACUGCCAACAAGUUC 


2849 


GAACTTGTTGGCAGTGCAGTTTTAA 


3453 


653 


UAAAACUGCACUGCCAACAAGUUCA 


2850 


TGAACTTGTTGGCAGTGCAGTTTTA 


3454 


654 


AAAACUGCACUGCCAACAAGUUCAC 


2851 


GTGAACTTGTTGGCAGTGCAGTTTT 


3455 


655 


AAACUGCACUGCCAACAAGUUCACU 


2852 


AGTGAACTTGTTGGCAGTGCAGTTT 


3456 


656 


AACUGCACUGCCAACAAGUUCACUU 


2853 


AAGTGAACTTGTTGGCAGTGCAGTT 


3457 


657 


ACUGCACUGC CAAC AAGUUC ACUUC 


2854 


GAAGTGAACTTGTTGGCAGTGCAGT 


3458 




CUGCACUGCCAACAAGUUCACUUCA 


2855 


TGAAGTGAACTTGTTGGCAGTGCAG 


3459 


659 


UGCACUGCCAACAAGUUCACUUCAU 


2856 


ATGAAGTGAACTTGTTGGCAGTGCA 


3460 


660 


GCACUGCCAACAAGUUCACUUCAUA 


2857 


TATGAAGTGAACTTGTTGGCAGTGC 


3461 


661 


CACUGCCAACAAGUUCACUUCAUAU 


2858 


ATATGAAGTGAACTTGTTGGCAGTG 


3462 


662 


ACUGCCAACAAGUUCACUUCAUAUA 


2859 


TATATGAAGTGAACTTGTTGGCAGT 


3463 


663 


CUGCCAACAAGUUCACUUCAUAUAU 


2860 


ATATATGAAGTGAACTTGTTC-GCAG 


3464 


755 


UAAGUAUUUUUCAGGUCUUCACCAA 


2861 


TTGGTGAAGACCTGAAAAATACTTA 


3465 


756 


AAGUAUUUUUCAGGUCUUCACCAAG 


2862 


CTTGGTGAAGACCTGAAAAATACTT 


3466 


757 


AGUAUUUUUCAGGUCUUCACCAAGU 


2863 


ACTTGGTGAAGACCTGAAAAATACT 


3467 


760 


AUUUUUCAGGUCUUCACCAAGUAUC 


2864 


GATACTTGGTGAAGACCTGAAAAAT 


3468 




UUUUUCAGGUCUUCACCAAGUAUCA 


2865 


TGATACTTGGTGAAGACCTGAAAAA 


3469 


762 


UUUUCAGGUCUUCACCAAGUAUCAA 


2866 


TTGATACTTGGTGAAGACCTGAAAA 


3470 
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763 


UUUCAGGUCUUCACCAAGUAUCAAA 


2867 


TTTGATACTTGGTGAAGACCTGAAA 


3471 


764 


UUCAGGUCUUCACCAAGUAUCAAAG 


2868 


CTTTGATACTTGGTGAAGACCTGAA 


3472 


755 


UCAGGUCUUCACCAAGUAUCAAAGU 


2869 


ACTTTGATACTTGGTGAAGACCTGA 


3473 


756 


CAGGUCUUCAC CAAGUAUCAAAGUA 


2870 


TACTTTGATACTTGGTGAAGACCTG 


3474 


813 


AUUCAAAAUAGUCCACUGACUCCUC 


2871 


GAGGAGTCAGTGGACTATTTTGAAT 


3475 


814 


UUCAAAAUAGUCCACUGACUCCUCA 


2872 


TGAGGAGTCAGTGGACTATTTTGAA 


3476 


815 


UCAAAAUAGUCCACUGACUCCUCAC 


2873 


GTGAGGAGTCAGTGGACTATTTTGA 


3477 


816 


CAAAAUAGUCCACUGACUCCUCACA 


2874 


TGTGAGGAGTCAGTGGACTATTTTG 


3478 


817 


AAAAUAGUCCACUGACUCCUCACAU 


2875 


ATGTGAGGAGTCAGTGGACTATTTT 


3479 


818 


AAAUAGUCCACUGACUCCUCACAUC 


2875 


GATGTGAGGAGTCAGTGGACTATTT 


3480 


819 


AAUAGUCCACUGACUCCUCACAUCU 


2877 


AGATGTGAGGAGTCAGTGGACTATT 


3481 


820 


AUAGUCCACUGACUCCUCACAUCUG 


2878 


CAGATGTGAGGAGTCAGTGGACTAT 


3482 


821 


UAGUCCACUGACUCCUCACAUCUGU 


2879 


ACAGATGTGAGGAGTCAGTGGACTA 


3483 


822 


AGUCCACUGACUCCUCACAUCUGUU 


2880 


AACAGATGTGAGGAGTCAGTGGACT 


3484 


823 


GUCCACUGACUCCUCACAUCUGUUA 


2881 


TAACAGATGTGAGGAGTCAGTGGAC 


3485 


824 


UCCACUGACUCCUCACAUCUGUUAU 


2882 


ATAACAGATGTGAGGAGTCAGTGGA 


3486 


825 


CCACUGACUCCUCACAUCUGUUAUC 


2883 


G AT AACAGATGTGAGGAGT C AGTGG 


3487 


911 


UUUUUCUAUGCCACAUUAACAUCUU 


2884 


AAGATGTTAATGTGGCATAGAAAAA 


3488 


912 


UUUUCUAUGCCACAUUAACAUCUUU 


2885 


AAAGATGTTAATGTGGCATAGAAAA 


3489 


913 


UUUCUAUGCCACAUUAACAUCUUUU 


2886 


AAAAGATGTTAATGTGGCATAGAAA 


3490 


919 


UGCCACAUUAACAUCUUUUAAAGUU 


2887 


AACTTTAAAAGATGTTAATGTGGCA 


3491 


920 


GCCACAUUAACAUCUUUUAAAGUUG 


2888 


CAACTTTAAAAGATGTTAATGTGGC 


3492 


948 


AGAAUCAAGUAUGGAAAAGUAAG3C 


2889 


GCCTTACTTTTCCATACTTGATTCT 


3493 




GAAUCAAGUAUGGAAAAGUAAGGCC 


2890 


GGCCTTACTTTTCCATACTTGATTC 


3494 


950 


AAUCAAGUAUGGAAAAGUAAGGCCA 


2891 


TGGCCTTACTTTTCCATACTTGATT 


3495 


959 


UGGAAAAGUAAGGCCAUACUCUUAC 


2892 


GTAAGAGTATGGCCTTACTTTTCCA 


3496 


960 


GGAAAAGUAAGGC C AUACU CUUACA 


2893 


TGTAAGAGTATGGCCTTACTTTTCC 


3497 


1067 


CAUAUGAUCAACAGAUGAGAACUGG 


2894 


CCAGTTCTCATCTGTTGATCATATG 


3498 


1069 


UAUGAUCAACAGAUGAGAACUGGUG 


2895 


CACCAGTTCTCATCTGTTGATCATA 


3499 


1070 


AUGAUCAACAGAUGAGAACUGGUGG 


2 896 


CCACCAGTTCTCATCTGTTGATCAT 


3500 


1071 


UGAUCAACAGAUGAGAACUGGUGGU 


2897 


ACCACCAGTTCTCATCTGTTGATCA 


3501 


1072 


GAUCAACAGAUGAGAACUGGUGGUU 


2898 


AACCACCAGTTCTCATCTGTTGATC 


3502 


1073 


AUCAACAGAUGAGAACUGGUGGUUA 


2899 


TAACCACCAGTTCTCATCTGTTGAT 


3503 


1074 


UCAACAGAUGAGAACUGGUGGUUAA 


2900 


TTAACCACCAGTTCTCATCTGTTGA 


3504 


1075 


CAACAGAUGAGAACUGGUGGUUAAU 


2901 


ATTAACCACCAGTTCTCATCTGTTG 


3505 


1078 


CAGAUGAGAACUGGUGGUUAAUAUG 


2902 


CATATTAACCACCAGTTCTCATCTG 


3506 


1080 


GAUGAGAACUGGUGGUUAAUAUGUG 


2903 


CACATATTAACCACCAGTTCTCATC 


3507 


1081 


AUGAGAACUGGUGGUUAAUAUGUGA . 


2904 


TCACATATTAACCACCAGTTCTCAT 


3508 


1082 


UGAGAACUGGUGGUUAAUAUGUGAC 


2905 


GTCACATATTAACCACCAGTTCTCA 


3509 


1083 


GAGAACUGGUGGUUAAUAUGUGACA 


2906 


TGTCACATATTAACCACCAGTTCTC 


3510 


1086 


AACUGGUGGUUAAUAUGUGACAGUG 


2907 


CACTGTCACATATTAACCACCAGTT 




1087 


ACUGGUGGUUAAUAUGUGACAGUGA 


2908 


TCACTGTCACATATTAACCACCAGT 


3512 


1088 


CUGGUGGUUAAUAUGUGACAGUGAG 


2909 


CTCACTGTCACATATTAACCAC CAG 


3513 


1089 


UGGUGGUUAAUAUGUGACAGUGAGA 


2910 


TCTCACTGTCACATATTAACCACCA 


3514 


1141 


CAGAAUCUAAUCUUCAUUUAAGGCA 


2911 


TGCCTTAAATGAAGATTAGATTCTG 


3515 


1150 


AUCUUCAUUUAAGGCACUGUAGUGA 


2912 


TCACTACAGTGCCTTAAATGAAGAT 


3516 


1151 


UCUUCAUUUAAGGCACUGUAGUGAA 


2913 


TTCACTACAGTGCCTTAAATGAAGA 


3517 
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1153 


UUCAUUUAAGGCACUGUAGUGAAUU 


2914 


AATTCACTACAGTGCCTTAAATGAA 


3513 


1161 


AGGCACUGUAGUGAAUUAUCUGAGC 


2915 


GCTCAGATAATTCACTACAGTGCCT 


3519 


1162 


GGCACUGUAGUGAAUUAUCUGAGCU 


2916 


AGCTCAGATAATTCACTACAGTGCC 


3520 


1211 


UAUCUUUGGAAUCAUGAAACCUUAA 


2917 


TTAAGGTTTCATGATTCCAAAGATA 


3521 


1212 


AUCUUUGGAAUCAUGAAACCUUAAG 


2918 


CTTAAGGTTTCATGATTCCAAAGAT 


3522 


1213 


UCUUUGG AAUC AUGAAAC CUUAAGA 


2919 


TCTTAAGGTTTCATGATTCCAAAGA 


3523 


1214 


CUUUGGAAUCAUGAAACCUUAAGAC 


2920 


GTCTTAAGGTTTCATGATTCCAAAG 


3524 


1215 


UUUGGAAUCAUGAAAC CUUAAGACU 


2921 


AGTCTTAAGGTTTCATGATTCCAAA 


3525 


1216 


UUGGAAUCAUGAAACCUUAAGACUU 


2922 


AAGTCTTAAGGTTTCATGATTCCAA 


3526 


1217 


UGGAAUCAUGAAACCUUAAGACUUC 


2923 


GAAGTCTTAAGGTTTCATGATTCCA 


3527 


1218 


GGAAUC AUGAAAC CUUAAGACUUCA 


2924 


TGAAGTCTTAAGGTTTCATGATTCC 


3528 


1223 


CAUGAAACCUUAAGACUUCAGAAUG 


2925 


CATTCTGAAGTCTTAAGGTTTCATG 


3529 


1230 


CCUUAAGACUUCAGAAUGAUUUUGC 


2926 


GCAAAATCATTCTGAAGTCTTAAGG 


3530 


1231 


CUUAAGACUUCAGAAUGAUUUUGCA 


2927 


TGCAAAATCATTCTGAAGTCTTAAG 


3531 


1232 


UUAAGACUUCAGAAUGAUUUUGCAG 


2928 


CTGCAAAATCATTCTGAAGTCTTAA 


3532 


1233 


U AAGACUU CAGAAUGAUUUUGCAGG 


2929 


CCTGCAAAATCATTCTGAAGTCTTA 


3533 


1234 


AAGACUUCAGAAUGAUUUUGCAGGU. 


2930 


ACCTGCAAAATCATTCTGAAGTCTT 


3534 




AGACUUCAGAAUGAUUUUGCAGGUU 


2931 


AACCTGCAAAATCATTCTGAAGTCT 


3535 


1236 


GACUUCAGAAUGAUUUUGCAGGUUG 


2932 


CAACCTGCAAAATCATTCTGAAGTC 


3536 


1237 


ACUUCAGAAUGAUUUUGCAGGUUGU 


2933 


ACAACCTGCAAAATCATTCTGAAGT 


3537 


1238 


CUUCAGAAUGAUUUUGCAGGUUGUC 


2934 


GACAACCTGCAAAATCATTCTGAAG 


3538 


1239 


UUCAGAAUGAUUUUGCAGGUUGUCU 


2935 


AGACAACCTGCAAAATCATTCTGAA 


3539 


1240 


UCAGAAUGAUUUUGCAGGUUGUCUU 


2936 


AAGACAACCTGCAAAATCATTCTGA 


3540 


1241 


CAGAAUGAUUUUGCAGGUUGUCUUC 


2937 


GAAGACAACCTGCAAAATCATTCTG 


3541 




AGAAUGAUUUUGCAGGUUGUCUUCC 


2938 


GGAAGACAACCTGCAAAATCATTCT 


3542 


1243 


GAAUGAUUUUGCAGGUUGUCUUCCA 


2939 


TGGAAGACAACCTGCAAAAT CATTC 


3543 


1244 


AAUGAUUUUGCAGGUUGUCUUCCAU 


2940 


ATGGAAGACAACCTGCAAAATCATT 


3544 


1245 


AUGAUUUUGCAGGUUGUCUUCCAUU 


2941 


AATGGAAGACAACCTGCAAAATCAT 


3545 


1246 
1247 


UGAUUUUGCAGGUUGUCUUCCAUUC 


2942 


GAATGGAAGACAACCTGCAAAATCA 


3546 


GAUUUUGCAGGUUGUCUUCCAUUCC 


2943 


GGAATGGAAGACAACCTGCAAAATC 


3547 


1248 


AUUUUGCAGGUUGUCUUCCAUUCCA 


2944 


TGGAATGGAAGACAACCTGCAAAAT 


3548 


1249 


UUUUGCAGGUUGUCUUCCAUUCCAG 


2945 


CTGGAATGGAAGACAACCTGCAAAA 


3549 


1250 


UUUGCAGGUUGUCUUCCAUUCCAGC 


2946 


GCTGGAATGGAAGACAACCTGCAAA 


3550 


1251 


UUGCAGGUUGUCUUCCAUUCCAGCC 


2947 


GGCTGGAATGGAAGACAACCTGCAA 


3551 


1252 


UGCAGGUUGUCUUCCAUUCCAGCCU 


2948 


AGGCTGGAATGGAAGACAACCTGCA 


3552 


1253 


GCAGGUUGUCUUC CAUUCCAGCCUA 


2949 


TAGGCTGGAATGGAAGACAACCTGC 


■ 3553 


1254 


CAGGUUGUCUUCCAUUCCAGCCUAA 


2950 


TTAGGCTGGAATGGAAGACAACCTG 


3554 


1255 


AGGUUGUCUUCCAUUCCAGCCUAAC 


2951 


GTTAGGCTGGAATGGAAGACAACCT 


3555 ■ 


1256 


GGUUGUCUUC CAUUCCAGCCUAACA 


2952 


TGTTAGGCTGGAATGGAAGACAACC 


3556 


1257 


GUUGUCUUCCAUUCCAGCCUAACAU 


2953 


ATGTTAGGCTGGAATGGAAGACAAC 


3557 


1258 


UUGUCUUCCAUUCCAGCCUAACAUC 


2954 


GATGTTAGGCTGGAATGGAAGACAA 




1259 


UGUCUUCCAUUCCAGCCUAACAUCC 


2955 


GGATGTTAGGCTGGAATGGAAGACA 


3559 


1260 


GUCUUCCAUUCCAGCCUAACAUCCA 


2956 


TGGATGTTAGGCTGGAATGGAAGAC 


3560 


1261 


UCUUCCAUUCCAGCCUAACAUCCAA 


2957 


TTGGATGTTAGGCTGGAATGGAAGA 


3561 


1262 


CUUCCAUUCCAGCCUAACAUCCAAU 


2958 


ATTGGATGTTAGGCTGGAATGGAAG 


3562 


1263 


UUCCAUUCCAGCCUAACAUCCAAUG 


2959 


CATTGGATGTTAGGCTGGAATGGAA 


3563 


1264 


UCCAUUCCAGCCUAACAUCCAAUGC 


2960 


GCATTGGATGTTAGGCTGGAATGGA 


3564 
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CCAUUCCAGCCUAACAUCCAAUGCA 



TGCATTGGATGTTAGGCTGGAATGG 



CAUUCCAGCCUAACAUCCAAUGCAG 



CTGCATTGGATGTTAGGCTGGAATG 



AUUCCAGCCUAACAUC CAAUGCAGG 



CCTGCATTGGATGTTAGGCTGGAAT 



CCUAACAUCCAAUGCAGGCAAGGAA 



TTCCTTGCCTGCATTGGATGTTAGG 



CUAACAUCCAAUGCAGGCAAGGAAA 



TTTCCTTGCCTGCATTGGATGTTAG 



UAACAUCCAAUGCAGGCAAGGAAAA 



TTTTCCTTGCCTGCATTGGATGTTA 



AACAUCCAAUGCAGGCAAGGAAAAU 



ATTTTCCTTGCCTGCATTGGATGTT 



ACAUCCAAUGCAGGCAAGGAAAAUA 



TATTTTCCTTGCCTGCATTGGATGT 



CAUCCAAUGCAGGCAAGGAAAAUAA 



TTATTTTCCTTGCCTGCATTGGATG 



AUCCAAUGCAGGCAAGGAAAAUAAA 



TTTATTTTCCTTGCCTGCATTGGAT • 



UCCAAUGCAGGCAAGGAAAAUAAAA 



TTTTATTTTCCTTGCCTGCATTGGA 



CCAAUGCAGGCAAGGAAAAUAAAAG 



CTTTTATTTTCCTTGCCTGCATTGG 



CAAUGCAGGCAAGGAAAAUAAAAGA 



TCTTTTATTTTCCTTGCCTGCATTG 



AAUGCAGGCAAGGAAAAUAAAAGAU 



ATCTTTTATTTTCCTTGCCTGCATT 



AUGCAGGCAAGGAAAAUAAAAGAUU 



AATCTTTTATTTTCCTTGCCTGCAT 



UGCAGGCAAGGAAAAUAAAAGAUUU 



AAATCTTTTATTTTCCTTGCCTGCA 



GCAGGCAAGGAAAAUAAAAGAUUUC 



GAAATCTTTTATTTTCCTTGCCTGC 



UAAAAGAUUUCCAGUGACAGAAAAA 



TTTTTCTGTCACTGGAAATCTTTTA 



AAAAGAUUUCCAGUGACAGAAAAAU 



ATTTTTCTGTCACTGGAAATCTTTT 



UAUAUUUUGAAAUGAACUUGUUGGC 



GCCAACAAGTTCATTTCAAAATATA 



AUAUUUUGAAAUGAACUUGUUGGCC 



GGCCAACAAGTTCATTTCAAAATAT 



UAUUUUGAAAUGAACUUGUUGGCCC 



GGGCCAACAAGTTCATTTCAAAATA 



AUUUUGAAAUGAACUUGUUGGCCCA 



TGGGCCAACAAGTTCATTTCAAAAT 



LTUUUGAAAUGAACUUGUUGGCCCAU 



ATGGGCCAACAAGTTCATTTCAAAA 



UUUGAAAUGAACUUGUUGGCCCAUC 



GATGGGCCAACAAGTTCATTTCAAA 



UUGAAAUGAACUUGUUGGCCCAUCU 



AGATGGGCCAACAAGTTCATTTCAA 



UGAAAUGAACUUGUUGGCCCAUCUA 



TAGATGGGCCAACAAGTTCATTTCA 



GAAAUGAACUUGUUGGCCCAUCUAU 



ATAGATGGGCCAACAAGTTCATTTC 



AAAUGAACUUGUUGGCCCAUCUAUU 



AATAGATGGGCCAACAAGTTCATTT 



AAUGAACUUGUUGGCCCAUCUAUUA 



TAATAGATGGGCCAACAAGTTCATT 



AUGAACUUGUUGGCCCAUCUAUUAC 



GTAATAGATGGGCCAACAAGTTCAT 



UGAACUUGUUGGCCCAUCUAUUACA 



TGTAATAGATGGGCCAACAAGTTCA 



GAACUUGUUGGCCCAUCUAUUACAU 



ATGTAATAGATGGGCCAACAAGTTC 



AACUUGUUGGCCCAUCUAUUACAUC 



GATGTAATAGATGGGCCAACAAGTT 



ACUUGUUGGCCCAUCUAUUACAUCU 



AGATGTAATAGATGGGCCAACAAGT 



CUUGUUGGCCCAUCUAUUACAUCUA 



TAGATGTAATAGATGGGCCAACAAG 



UUGUUGGCCCAUCUAUUACAUCUAC 



GTAGATGTAATAGATGGGCCAACAA 



UGUUGGCCCAUCUAUUACAUCUACA 



TGTAGATGTAATAGATGGGCCAACA 



GUUGGCCCAUCUAUUACAUCUACAG 



CTGTAGATGTAATAGATGGGCCAAC 



UUGGCCCAUCUAUUACAUCUACAGC 



GCTGTAGATGTAATAGATGGGCCAA 



UGGCCCAUCUAUUACAUCUACAGCU 



AGCTGTAGATGTAATAGATGGGCCA 



GGCCCAUCUAUUACAUCUACAGCUG 



CAGCTGTAGATGTAATAGATGGGCC 



GCCCAUCUAUUACAUCUACAGCUGA 



TCAGCTGTAGATGTAATAGATGGGC 



CUAUUACAUCUACAGCUGACCCUUG 



CAAGGGTCAGCTGTAGATGTAATAG 



UAUUACAUCUACAGCUGACCCUUGA 



TCAAGGGTCAGCTGTAGATGTAATA 



AUUACAUCUACAGCUGACCCUUGAA 



TTCAAGGGTCAGCTGTAGATGTAAT 



UUACAUCUACAGCUGACCCUUGAAC 



GTTCAAGGGTCAGCTGTAGATGTAA 
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1426 


UACAUCUACAGCUGACCCUUGAACA 


3008 


TGTTCAAGGGTCAGCTGTAGATGTA 


3612 


1427 


ACAUCUACAGCUGACCCUUGAACAU 


3009 


ATGTTCAAGGGTCAGCTGTAGATGT 


3613 


1428 


CAUCUACAGCUGACCCUUGAACAUG 


3010 


CATGTTCAAGGGTCAGCTGTAGATG 


3614 


1429 


AUCUACAGCUGACCCUUGAACAUGG 


3011 


CCATGTTCAAGGGTCAGCTGTAGAT 


3615 


1442 


CCUUGAACAUGGGGGUUAGGGGAGC 


3012 


GCTCCCCTAACCCCCATGTTCAAGG 


3616 


1443 


CUUGAACAUGGGGGUUAGGGGAGCU 


3013 


AGCTCCCCTAACCCCCATGTTCAAG 


3617 


1444 


UUGAACAUGGGGGUUAGGGGAGCUG 


3014 


CAGCTGCCCTAACCCCCATGTTCAA 


3618 


1445 


UGAACAUGGGGGUUAGGGGAGCUGA 


3015 


TC AGCTCC CCTAAC C CC CATGT TCA 


3619 


1446 


GAACAUGGGGGUUAGGGGAGCUGAC 


3016 


GTCAGCTCCCCTAAC CCCCATGTTC 


3620 


1447 


AACAUGGGGGUUAGGGGAGCUGACA 


3017 


TGTCAGCTCCCCTAACCCCCATGTT 


3621 


1448 


ACAUGGGGGUUAGGGGAGCUGACAA 


3018 


TTGTCAGCTCCCCTAACCCCCATGT 


3622 


1449 


CAUGGGGGUUAGGGGAGCUGACAAU 


3019 


ATTGTCAGCTCCCCTAACCCCCATG 


3623 


1450 


AUGGGGGUUAGGGGAGCUGACAAUU 


3020 


AATTGTCAGCTCCCCTAACCCCCAT 


3624 


1451 


UGGGGGUUAGGGGAGCUGACAAUUC 


3021 


GAATTGTCAGCTCCCCTAACCCCCA 


3625 


1452 


GGGGGUUAGGGGAGCUGACAAUUCG 


3022 


CGAATTGTCAGCTCCCCTAACCCCC 


3626 


1453 


GGGGUUAGGGGAGCUGACAAUUCGU 


3023 


ACGAATTGTCAGCTCCCCTAACCCC 


3627 


1454 


GGGUUAGGGGAGCUGACAAUUCGUG 


3024 


CACGAATTGTCAGCTCCCCTAACCC 


3628 


1455 


GGUUAGGGGAGCUGACAAUUCGUGG 


3025 


CCACGAATTGTCAGCTCCCCTAACC 


3629 


1456 


GUUAGGGGAGCUGACAAUUCGUGGG 


3026 


CCCACGAATTGTCAGCTCCCCTAAC 


3630 


1457 


UUAGGGGAGCUGACAAUUCGUGGGU 


3027 


ACCCACGAATTGTCAGCTCCCCTAA 


3631 


1458 


UAGGGGAGCUGACAAUUCGUGGGUC 


3028 


GACCCACGAATTGTCAGCTCCCCTA 


3632 


. 1459 


AGGGGAGCUGACAAUUCGUGGGUCC 


3029 


GGACCCACGAATTGTCAGCTCCCCT 


3633 


1460 


GGGGAGCUGACAAUUCGUGGGUCCG 


3030 


CGGACCCACGAATTGTCAGCTCCCC 


3634 


1462 


GGAGCUGACAAUUCGUGGGUCCGCA 


3031 


TGCGGACCCACGAATTGTCAGCTCC 


3635 


14 63 


GAGCUGACAAUUCGUGGGUCCGCAA 


3032 


TTGCGGACCCACGAATTGTCAGCTC 


3636 


1464 


AGCUGACAAUUCGUGGGUCCGCAAA 


3033 


TTTGCGGACCCACGAATTGTCAGCT 


3637 


1465 


GCUGACAAUUCGUGGGUCCGCAAAA 


3034 


TTTTGCGGACCCACGAATTGTCAGC 


3638 


1466 


CUGACAAUUCGUGGGUCCGCAAAAU 


3035 


ATTTTGCGGACCCACGAATTGTCAG 


3639 


1467 


UGACAAUUCGUGGGUCCGCAAAAUC 


3036 


GATTTTGCGGACC CACGAATTGTCA 


3640 


1468 


GACAAUUCGUGGGUCCGCAAAAUCU 


3037 


AGATTTTGCGGACCCACGAATTGTC 


3641 


1469 


ACAAUUCGUGGGUCCGCAAAAUCUU 


3038 


AAGATTTTGCGGACCCACGAATTGT 


3642 


1470 


CAAUUCGUGGGUCCGCAAAAUCUUA 


3039 


TAAGATTTTGCGGACCCACGAATTG 


3643 


1471 


AAUUCGUGGGUCCGCAAAAUCUUAA 


3040 


TTAAGATTTTGCGGACCCACGAATT 


3644 


1472 


AUUCGUGGGUCCGCAAAAUCUUAAC 


3041 


GTTAAGATTTTGCGGACCCACGAAT 


3645 


1473 


UUCGUGGGUCCGCAAAAUCUUAACU 


3042 


AGTTAAGATTTTGCGGACCCACGAA 


3646 


1474 


UCGUGGGUCCGCAAAAUCUUAACUA 


3043 


TAGTTAAGATTTTGCGGACCCACGA 


3647 


1475 


CGUGGGUCCGCAAAAUCUUAACUAC 


3044 


GTAGTTAAGATTTTGCGGACCCACG 




1476 


GUGGGUCCGCAAAAUCUUAACUACC 


3045 


GGTAGTTAAGATTTTGCGGACCCAC 


3649 


1477 


UGGGUCCGCAAAAUCUUAACUACCU 


3046 


AGGTAGTTAAGATTTTGCGGACCCA 


3650 


1478 


GGGUCCGCAAAAUCUUAACUACCUA 


3047 


TAGGTAGTTAAGATTTTGCGGACCC 


3651 


1479 


GGUCCGCAAAAUCUUAACUACCUAA 


3048 


TTAGGTAGTTAAGATTTTGCGGACC 


3652 


1480 


GUCCGCAAAAUCUUAACUACCUAAU 


3049 


ATTAGGTAGTTAAGATTTTGCGGAC 


3653 


1481 


UCCGCAAAAUCUUAACUACCUAAUA 


3050 


TATTAGGTAGTTAAGATTTTGCGGA 


3654 



Input Sequence = PLN 
Oligo Length = 25 

PLN (Homo sapiens phospholamban (PLN) mRNA.; 1635 bp) 
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Table 32 

Table 32: RNA Cleavage by NCH-XYLO Ribozyme 
(Reaction: 50 mM TRIS-Cl pH 7.5, 10 mM Mg 2+ , 37°C, 500 nM FINAL [Rz], Trace 
Substrate) 



DATA SUMMARY 



NCH-Xylo Ribozyme (RPI 
No.) 


TARGET TRIPLET 5'- 

NCX-3' 


k ob S (min.' 1 ) 
guua + A15.1 = xylo 


14827 


5'-GCA-3' 


1.649 


14828 


5'-ACA-3' 


0.293 


14829 


5'-UCA-3' 


0.272 


14830 


5'-CCA-3' 


0.214 
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Table 34 



Table 34: Anti-HER2 NCH Ribozyme and Target Sequences 



nt. 


NCH Substrate 


Seq. ID 


NCH Ribozyme Sequence 


Seq. ID 


Position 


Sequence 


Nos. 




Nos. 


14 


— — — 

JIGGUAAC 




GGGCCAG CUGAUGAG X CGAA IUUACCU 




15 


GGUAACC C UGGCCCC 




GGGGCCA CUGAUGAG X CGAA IGUUACC 




16 


GUAACCC U GGCCCCU 




AGGGGCC CUGAUGAG X CGAA IGGUUAC 




20 


CCCUGGC C CCUUUGG^ 




CCAAAGG CUGAUGAG X CGAA ICCAGGG 




21 


CCUGGCC C CUUUG U 




ACCAAAG CUGAUGAG X CGAA IGCCAGG 




22 


CUGGCCC C UUUGGUC 




GACCAAA CUGAUGAG X CGAA IGGCCAG 




23 


UGGCCCC U UUGGUCG 




CGACCAA CUGAUGAG X CGAA IGGGCCA 




35 


UCGGGGC C CCGGGCA 




UGCCCGG CUGAUGAG X CGAA ICCCCGA 




36 


CGGGGCC C CGGGCAG 




CUGCCCG CUGAUGAG X CGAA IGCCCCG 





37 


GGGGCCC C GGGCAGC 




GCUGCCC CUGAUGAG X CGAA IGGCCCC 




42 


CCCGGGC A GCCGCGC 




GCGCGGC CUGAUGAG X CGAA ICCCGGG 




45 


GGGCAGC C GCGCGCC 




GGCGCGC CUGAUGAG X CGAA ICUGCCC 




52 


CGCGCGC C CCUUCCC 




GGGAAGG CUGAUGAG X CGAA ICGCGCG 




53 


GCGCGCC C CUUCCCA 




UGGGAAG CUGAUGAG X CGAA IGCGCGC 




54 


CGCGCCC C UUCCCAC 




GUGGGAA CUGAUGAG X CGAA IGGCGCG 




55 


GCGCCCC U UCCCACG 




CGUGGGA CUGAUGAG X CGAA IGGGCGC 




58 


CCCCUUC C CACGGGG 




CCCCGUG CUGAUGAG X CGAA IAAGGGG 


! 


59 


CCCUUCC C ACGGGGC 




GCCCCGU CUGAUGAG X CGAA IGAAGGG 




60 


CCUUCCC A CGGGGCC 




GGCCCCG CUGAUGAG X CGAA IGGAAGG 




67 


ACGGGGC C CUUUACU 




AGUAAAG CUGAUGAG X CGAA ICCCCGU 




68 


CGGGGCC C UUUACUG 




CAGUAAA CUGAUGAG X CGAA IGCCCCG 




69 


GGGGCCC U UUACUGC 




GCAGUAA CUGAUGAG X CGAA IGGCCCC 




74 


CCUUUAC U GCGCCGC 




GCGGCGC CUGAUGAG X CGAA IUAAAGG 




79 


ACUGCGC C GCGCGCC 




GGCGCGC CUGAUGAG X CGAA ICGCAGU 




86 


CGCGCGC C CGGCCCC 




GGGGCCG CUGAUGAG X CGAA ICGCGCG 




87 


GCGCGCC C GGCCCCC 




GGGGGCC CUGAUGAG X. CGAA IGCGCGC 




91 


GCCCGGC C CCCACCC 




GGGUGGG CUGAUGAG X CGAA ICCGGGC 




92 


CCCGGCC C CCACCCC 




GGGGUGG CUGAUGAG X CGAA IGCCGGG 




93 


CCGGCCC C CftCCCCU 




AGGGGUG CUGAUGAG X CGAA IGGCCGG 




94 


CGGCCCC C A 




GAGGGGU CUGAUGAG X CGAA IGGGCCG 




95 


GGCCCCC A CCCCUCG 




CGAGGGG CUGAUGAG X CGAA IGGGGCC 










UGCGAGG CUGAUGAG X CGAA IUGGGGG 




98 


CCCCACC C CUCGCAG 




CUGCGAG CUGAUGAG X CGAA IGUGGGG 




99 


CCCACCC C UCGCAGC 




GCUGCGA CUGAUGAG X CGAA IGGUGGG 




100 


CCACCCC U CGCAGCA 




UGCUGCG CUGAUGAG X CGAA IGGGUGG 




104 


CCCUCGC A GCACCCC 




GGGGUGC CUGAUGAG X CGAA ICGAGGG 




107 


UCGCAGC A CCCCGCG 




CGCGGGG CUGAUGAG X CGAA ICUGCGA 




109 


GCAGCAC C CCGCGCC 




GGCGCGG CUGAUGAG X CGAA IUGCUGC 




110 


CAGCACC C CGCGCCC 




GGGCGCG CUGAUGAG X CGAA IGUGCUG 




111 


AGCACCC C GCGCCCC 




GGGGCGC CUGAUGAG X CGAA IGGUGCU 




116 


CCCGCGC C CCGCGCC 




GGCGCGG CUGAUGAG X CGAA ICGCGGG 




117 


CCGCGCC C CGCGCCC 




GGGCGCG CUGAUGAG X CGAA IGCGCGG 
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CGCGCCC C GCGCCCU 



CCCGCGC C CUCCCAG 



CCGCGCC C UCCCAGC 



AGGGCGC CUGAUGAG X CGAA IGGCGCG 



CUGGGAG CUGAUGAG X CGAA ICGCGGG 



GCUGGGA CUGAUGAG X CGAA IGCGCGG 



CGCGCCC U CCCAGCC 



CGCCCUC C CAGCCGG 



GGCUGGG CUGAUGAG X CGAA IGGCGCG 



CCGGCUG CUGAUGAG X CGAA IAGGGCG 



GCCCUCC C AGCCGGG 



CCCGGCU CUGAUGAG X CGAA IGAGGGC 



CCCUCCC A GCCGGGU 



ACCCGGC CUGAUGAG X CGAA IGGAGGG 



UCCCAGC C GGGUCCA 



CCGGGUC C AGCCGGA 
CGGGUCC A GCCGGAG 



UGGACCC CUGAUGAG X CGAA ICUGGGA 



UCCGGCU CUGAUGAG X CGAA IACCCGG 



CUCCGGC CUGAUGAG X CGAA IGACCCG 



GUCCAGC C GGAGCCA 



UGGCUCC CUGAUGAG X CGAA ICUGGAC 



CCGGAGC C AUGGGGC 



CGGAGCC A UGGGGCC 



AUGGGGC C GGAGCCG 



CCGGAGC C GCAGUGA 



GCCCCAU CUGAUGAG X CGAA ICUCCGG 



GGCCCCA CUGAUGAG X CGAA IGCUCCG 



CGGCUCC CUGAUGAG X CGAA ICCCCAU 



UCACUGC CUGAUGAG X CGAA ICUCCGG 



GAGCCGC A GUGAGCA 



AGUGAGC A CCAUGGA 



UGAGCAC C AUGGAGC 



GAGCACC A UGGAGCU 



UGCUCAC CUGAUGAG X CGAA ICGGCUC 



UCCAUGG CUGAUGAG X CGAA ICUCACU 



GCUCCAU CUGAUGAG X CGAA IUGCUCA 



AGCUCCA CUGAUGAG X CGAA IGUGCUC 



AUGGAGC U GGCGGCC 



UGGCGGC C UUGUGCC 



GGCGGCC U UGUGCCG 



GGCCGCC CUGAUGAG X CGAA ICUCCAU 



GGCACAA CUGAUGAG X CGAA ICCGCCA 



CGGCACA CUGAUGAG X CGAA IGCCGCC 



CUUC-UGC C GCUGGGG 



CCCCAGC CUGAUGAG X CGAA ICACAAG 



GUGCCGC U GGGGGCU 



AGCCCCC CUGAUGAG X CGAA ICGGCAC 



UGGGGGC U CCUCCUC 



GGGGCUC C UCCUCGC 



GGGCUCC U CCUCGCC 



GAGGAGG CUGAUGAG X CGAA ICCCCCA 



GCGAGGA CUGAUGAG X CGAA IAGCCCC 



GGCGAGG CUGAUGAG X CGAA IGAGCCC 



GCUCCUC C UCGCCCU 



AGGGCGA CUGAUGAG X CGAA IAGGAGC 



CUCCUCC I 
UCCUCGC C 



: c ucuugcc 
: u cuugccc 



GAGGGCG CUGAUGAG X CGAA IGAGGAG 



GCAAGAG CUGAUGAG X CGAA ICGAGGA 



GGCAAGA CUGAUGAG X CGAA IGCGAGG 



GGGCAAG CUGAUGAG X CGAA IGGCGAG 



CGCCCUC U UGCCCCC 



: C CCCCGGA 
: C CCCGGAG 



GGGGGCA CUGAUGAG X CGAA IAGGGCG 



UCCGGGG CUGAUGAG X CGAA ICAAGAG 



CUCCGGG CUGAUGAG X CGAA IGCAAGA 



CUUGCCC C CCGGAGC 



UUGCCCC C CGGAGCC 



GCUCCGG CUGAUGAG X CGAA IGGCAAG 



GGCUCCG CUGAUGAG X CGAA IGGGCAA 



UGCCCCC C GGAGCCG 



CGGCUCC CUGAUGAG X CGAA IGGGGCA 



CCGGAGC C GCGAGCA 



CGCGAGC A CCCAAGU 



CGAGCAC C CAAGUGU 



UGCUCGC CUGAUGAG X CGAA ICUCCGG 



ACUUGGG CUGAUGAG X CGAA ICUCGCG 



ACACUUG CUGAUGAG X CGAA IUGCUCG 



GAGCACC C AAGUGUG 



AGCACCC A AGUGUGC 



AGUGUGC A CCGGCAC 



CACACUU CUGAUGAG X CGAA IGUGCUC 



GCACACU CUGAUGAG X CGAA IGGUGCU 



GUGCCGG CUGAUGAG X CGAA ICACACU 



UGUGCAC C GGCACAG 



CUGUGCC CUGAUGAG X CGAA IUGCACA 



CACCGGC A CAGACAU 



CCGGCAC A GACAUGA 



AUGUCUG CUGAUGAG X CGAA ICCGGUG 



UCAUGUC CUGAUGAG X CGAA IUGCCGG 
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CACAGAC A UGAAGCU 



AGCUUCA CUGAUGAG X CGAA IUCUGUG 



AUGAAGC U GCGGCUC 



GAGCCGC CUGAUGAG X CGAA ICUUCAU 



CUGCGGC U CCCUGCC 



GGCAGGG CUGAUGAG X CGAA ICCGCAG 



GCGGCUC C CUGCCAG 



CUGGCAG CUGAUGAG X CGAA IAGCCGC 



CGGCUCC C UGCCAGU 



ACUGGCA CUGAUGAG X CGAA IGAGCCG 



GGCUCCC U GCCAGUC 



GACUGGC CUGAUGAG X CGAA IGGAGCC 



UCCCUGC C AGUCCCG 



CGGGACU CUGAUGAG X CGAA ICAGGGA 



CCCUGCC A GUCCCGA 



UCGGGAC CUGAUGAG X CGAA IGCAGGG 



GCCAGUC C CGAGACC 



GGUCUCG CUGAUGAG X CGAA IACUGGC 



CCAGUCC C GAGACCC 



GGGUCUC CUGAUGAG X CGAA IGACUGG 



CCGAGAC C CACCUGG 



CCAGGUG CUGAUGAG X CGAA IUCUCGG 



CGAGACC C ACCUGGA 



UCCAGGU CUGAUGAG X CGAA IGUCUCG 



GAGACCC A CCUGGAC 



GACCCAC C UGGACAU 



GUCCAGG CUGAUGAG X CGAA IGGUCUC 
AUGUCCA CUGAUGAG X CGAA IUGGGUC 



ACCCACC U GGACAUG 



CCUGGAC A UGCUCCG 



CAUGUCC CUGAUGAG X CGAA IGUGGGU 



CGGAGCA CUGAUGAG X CGAA IUCCAGG 



GACAUGC U CCGCCAC 



GUGGCGG CUGAUGAG X CGAA ICAUGUC 



CAUGCUC C GCCACCU 



AGGUGGC CUGAUGAG X CGAA IAGCAUG 



GCUCCGC C ACCUCUA 



UAGAGGU CUGAUGAG X CGAA I CGGAGC 



CUCCGCC A CCUCUAC 



GUAGAGG CUGAUGAG X CGAA IGCGGAG 



CCGCCAC C UCUACCA 



UGGUAGA CUGAUGAG X CGAA IUGGCGG 



CGCCACC U CUACCAG 



CCACCUC U ACCAGGG 



CUGGUAG CUGAUGAG X CGAA IGUGGCG 



CCCUGGU CUGAUGAG X CGAA IAGGUGG 



CCUCUAC C AGGGCUG 



CAGCCCU CUGAUGAG X CGAA IUAGAGG 



CUCUACC A GGGCUGC 



GCAGCCC CUGAUGAG X CGAA IGUAGAG 



CCAGGGC U GCCAGGU 



ACCUGGC CUGAUGAG X CGAA ICCCUGG 



GGGCUGC C AGGUGGU 



ACCACCU CUGAUGAG X CGAA ICAGCCC 



GGCUGCC A GGUGGUG 



CACCACC CUGAUGAG X CGAA IGCAGCC 



GUGGUGC A GGGAAAC 



GUUUCCC CUGAUGAG X CGAA ICACCAC 



GGGAAAC C UGGAACU 



AGUUCCA CUGAUGAG X CGAA IUUUCCC 



GGAAACC U GGAACUC 



GAGUUCC CUGAUGAG X CGAA IGUUUCC 



CUGGAAC U CACCUAC 



GUAGGUG CUGAUGAG X CGAA IUUCCAG 



GGAACUC A CCUACCU 



AGGUAGG CUGAUGAG X CGAA IAGUUCC 



AACUCAC C UACCUGC 



GCAGGUA CUGAUGAG X CGAA IUGAGUU 



ACUCACC U ACCUGCC 



CACCUAC C UGCCCAC 



ACCUACC U GCCCACC 



GGCAGGU CUGAUGAG X CGAA IGUGAGU 



GUGGGCA CUGAUGAG X CGAA IUAGGUG 



GGUGGGC CUGAUGAG X CGAA IGUAGGU 



UACCUGC C CACCAAU 



AUUGGUG CUGAUGAG X CGAA ICAGGUA 



ACCUGCC C ACCAAUG 



CAUUGGU CUGAUGAG X CGAA IGCAGGU 



CCUGCCC A CCAAUGC 



UGCCCAC C AAUGCCA 



GCAUUGG CUGAUGAG X CGAA IGGCAGG 



UGGCAUU CUGAUGAG X CGAA IUGGGCA 



GCCCACC A AUGCCAG 



CCAAUGC C AGCCUGU 



CUGGCAU CUGAUGAG X CGAA IGUGGGC 



ACAGGCU CUGAUGAG X CGAA ICAUUGG 



CAAUGCC A GCCUGUC 



GACAGGC CUGAUGAG X CGAA IGCAUUG 



UGCCAGC C UGUCCUU 



AAGGACA CUGAUGAG X CGAA ICUGGCA 



GCCAGCC U GUCCUUC 



GAAGGAC CUGAUGAG X CGAA IGCUGGC 



GCCUGUC C UUCCUGC 



GCAGGAA CUGAUGAG X CGAA IACAGGC 
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CCUGUCC U UCCUGCA 



UGCAGGA CUGAUGAG X CGAA IGACAGG 



UCCUGCA CUGAUGAG X CGAA IAAGGAC 



UCCUUCC U GCAGGAU 



AUCCUGC CUGAUGAG X CGAA IGAAGGA 



UUCCUGC A GGAUAUC 



GAUAUCC CUGAUGAG X CGAA ICAGGAA 



GGAUAUC C AGGAGGU 



ACCUCCU CUGAUGAG X CGAA IAUAUCC 



GAUAUCC A GGAGGUG 



CACCUCC CUGAUGAG X CGAA IGAUAUC 



GAGGUGC A GGGCUAC 



GUAGCCC CUGAUGAG X CGAA ICACCUC 



GCAGGGC U ACGUGCU 



AGCACGU CUGAUGAG X CGAA ICCCUGC 



UACGUGC U CAUCGCU 



AGCGAUG CUGAUGAG X CGAA ICACGUA 



CGUGCUC A UCGCUCA 



UGAGCGA CUGAUGAG X CGAA IAGCACG 



UCAUCGC U CACAACC 



GGUUGUG CUGAUGAG X CGAA ICGAUGA 



AUCGCUC A CAACCAA 



UUGGUUG CUGAUGAG X CGAA IAGCGAU 



CGCUCAC A ACCAAGU 



ACUUGGU CUGAUGAG X CGAA IUGAGCG 



UCACAAC C AAGUGAG 



CUCACUU CUGAUGAG X CGAA IUUGUGA 



CACAACC A AGUGAGG 



CCUCACU CUGAUGAG X CGAA IGUUGUG 



GUGAGGC A GGUCCCA 



UGGGACC CUGAUGAG X CGAA ICCUCAC 



GCAGGUC C CACUGCA 



UGCAGUG CUGAUGAG X CGAA IACCUGC 



CAGGUCC C ACUGCAG 



CUGCAGU CUGAUGAG X CGAA IGACCUG 



AGGUCCC A CUGCAGA 



UCUGCAG CUGAUGAG X CGAA IGGACCU 



GUCCCAC U GCAGAGG 



CCUCUGC CUGAUGAG X CGAA IUGGGAC 



CCACUGC A GAGGCUG 



CAGCCUC CUGAUGAG X CGAA ICAGUGG 



CAGAGGC U GCGGAUU 



AAUCCGC CUGAUGAG X CGAA ICCUCUG 



GCGAGGC A CCCAGCU 



AGCUGGG CUGAUGAG X CGAA ICCUCGC 



GAGGCAC C CAGCUCU 



AGAGCUG CUGAUGAG X CGAA IUGCCUC 



AGGCACC C AGCUCUU 



AAGAGCU CUGAUGAG X CGAA IGUGCCU 



GGCACCC A GCUCUUU 



AAAGAGC CUGAUGAG X CGAA IGGUGCC 



ACCCAGC U CUUUGAG 



CUCAAAG CUGAUGAG X CGAA ICUGGGU 



CCAGCUC U UUGAGGA 



UCCUCAA CUGAUGAG X CGAA IAGCUGG 



UGAGGAC A ACUAUGC 



GCAUAGU CUGAUGAG X CGAA IUCCUCA 



GGACAAC U AUGCCCU 



AGGGCAU CUGAUGAG X CGAA IUUGUCC 



ACUAUGC C CUGGCCG 



CGGCCAG CUGAUGAG X CGAA ICAUAGU 



CUAUGCC C UGGCCGU 



ACGGCCA CUGAUGAG X CGAA IGCAUAG 



UAUGCCC U GGCCGUG 



CACGGCC CUGAUGAG X CGAA IGGCAUA 



CCCUGGC C GUGCUAG 



CUAGCAC CUGAUGAG X CGAA ICCAGGG 



GCCGUGC U AGACAAU 



AUUGUCU CUGAUGAG X CGAA ICACGGC 



GCUAGAC A AUGGAGA 



UCUCCAU CUGAUGAG X CGAA IUCUAGC 



UGGAGAC C CGCUGAA 



UUCAGCG CUGAUGAG X CGAA IUCUCCA 



GGAGACC C GCUGAAC 



GUUCAGC CUGAUGAG X CGAA IGUCUCC 



GACCCGC U GAACAAU 



AUUGUUC CUGAUGAG X CGAA ICGGGUC 



GCUGAAC A AUACCAC 



GUGGUAU CUGAUGAG X CGAA IUUCAGC 



ACAAUAC C ACCCCUG 



CAGGGGU CUGAUGAG X CGAA IUAUUGU 



CAAUACC A CCCCUGU 



ACAGGGG CUGAUGAG X CGAA IGUAUUG 



AUACCAC C CCUGUCA 



UGACAGG CUGAUGAG X CGAA IUGGUAU 



UACCACC C CUGUCAC 



GUGACAG CUGAUGAG X CGAA IGUGGUA 



ACCACCC C UGUCACA 



UGUGACA CUGAUGAG X CGAA IGGUGGU 



CCACCCC U GUCACAG 



CUGUGAC CUGAUGAG X CGAA IGGGUGG 



CCCUGUC A CAGGGGC 



GCCCCUG CUGAUGAG X CGAA IACAGGG 
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CUGUCAC A GGGGCCU 



CAGGGGC C UCCCCAG 



AGGGGCC U CCCCAGG 



GGGCCUC C CCAGGAG 



GGCCUCC C CAGGAGG 



GCCUCCC C AGGAGGC 



CCUCCCC A GGAGGCC 



AGGAGGC C UGCGGGA 



GGAGGCC U GCGGGAG - 



CGGGAGC U GCAGCUU 



GAGCUGC A GCUUCGA 



CUGCAGC U UCGAAGC 



UCGAAGC C UCACAGA 



CGAAGCC U CACAGAG 



AAGCCUC A CAGAGAU 



GCCUCAC A GAGAUCU 



AGGCCCC CUGAUGAG X CGAA IUGACAG ~ 
CUGGGGA CUGAUGAG X CGAA ICCCCUG 



CCUGGGG CUGAUGAG X CGAA IGCCCCU 



" CUCCUGG CUGAUGAG X CGAA IAGGCCC 



AGAGAUC U UGAAAGG 



AGGGGUC U UGAUCCA 



CUUGAUC C AGCGGAA 



UUGAUCC A GCGGAAC 



GCGGAAC C CCCAGCU 



CGGAACC C CCAGCUC 



GGAACCC C CAGCUCU 



GAACCCC C AGCUCUG 



AACCCCC A GCUCUGC 



CCCCAGC U CUGCUAC 



CCAGCUC U GCUACCA 



CCUCCUG CUGAUGAG X CGAA IGAGGCC 



GCCUCCU CUGAUGAG X CGAA IGGAGGC 



GGCCUCC CUGAUGAG X CGAA IGGGAGG 



UCCCGCA CUGAUGAG X CGAA ICCUCCU 



CUCCCGC CUGA UGAG X CGAA IGCCUCC 
AAGCUGC CUGAUGAG X CGAA ICUCCCG 



' UCGAAGC CUGAUGAG X CGAA ICAGCUC 



GCUUCGA CUGAUGAG X CGAA ICUGCAG~ 
UCUGUGA CUGAUGAG X CGAA ICUUCGA 



CUCUGUG CUGAUGAG X CGAA IGCUUCG 



AUCUCUG CUGAUGAG X CGAA IAGGCUU 



AGAUCUC CU GAUGAG X CGAA IUGAGGC 
CCUUUCA CUGAUGAG X CGAA IAUCUCU 



UGGAUCA CUGAUGAG X CGAA IACCCCU 



UUCCGCU CUGAUGAG X CGAA IAUCAAG 



GCUCUGC U ACCAGGA 



CUGCUAC C AGGACAC 



UGCUACC A GGACACG 



CCAGGAC A CGAUUUU 



GAAGGAC A UCUUCCA 



GGACAUC U UCCACAA 



CAUCUUC C ACAAGAA 



AUCUUCC A CAAGAAC 



CUUCCAC A AGAACAA 



CAAGAAC A ACCAGCU 



GAACAAC C AGCUGGC 



AACAACC A GCUGGCU 



AACCAGC U GGCUCUC 



AGCUGGC U CUCACAC 



GUUCCGC CUGAUGAG X CGAA IGAUCAA 
AGCUGGG CU GAUGAG X CGAA IUUCCGC 
GAGCUGG CUGAUGAG X CGAA IGUUCCG 



AGAGCUG CUGAUGAG X CGAA IGGUUCC 



CAGAGCU CUGAUGAG X CGAA IGGGUUC 



GCAGAGC CUGAUGAG X CGAA IGGGGUU 



GUAGCAG CUGA UGAG X CGAA ICUGGGG 
UGGUAGC CUGAUGAG X CGAA IAGCUGG 



' UCCUGGU CUGAUGAG X CGAA ICAGAGC 



GUGUCCU CUGAUGAG X CGAA IUAGCAG 



CGUGUCC CUGAUGAG X CGAA IGUAGCA 



1 AAAAUCG CUGAUGAG X CGAA IUCCUGG 



UGGAAGA CUGAUGAG X CGAA IUCCUUC 



" UUGUGGA CUGAUGAG X CGAA IAUGUCC 



CUGGCUC U CACACUG 



GGCUCUC A CACUGAU 



CUCUCAC A CUGAUAG 



"CUCACAC U GAUAGAC 



GAUAGAC A CCAACCG 



UAGACAC C AACCGCU 



UUCUUGU CUGAUGAG X CGAA IAAGAUG 
GUUCUUG CUGAUGAG X CGAA IGAAGAU 



UUGUUCU CUGAUGAG X CGAA IUGGAAG 



AGCUGGU CUGAUGAG X CGAA IUUCUUG 



GCCAGCU CUGAUGAG X CGAA IUUGUUC " 



AGCCAGC CUGAUGAG X CGAA IGUUGUU 



GAGAGCC CUGAUGAG X CGAA ICUGGUU 



GUGUGAG CUGAUGAG X CGAA ICCAGCU 



CAGUGUG CUGAUGAG X CGAA IAGCCAG 



AUCAGUG CUGAUGAG X CGAA IAGAGCC 



CUAUCAG CUGAUGAG X CGAA IUGAGAG 



GUCUAUC CUGAUGAG X CGAA IUGUGAG " 
CGGUUGG CUGAUGAG X CGAA IUCUAUC 
AGCGGUU CUGAUGAG X CGAA IUGUCUA 
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AGAC ACC A ACCGCUC ^ 
CACCA AC C GCUCUCG 
CAACCGC U CUCGGGC 
ACCGCUC U CGGGCCU 
CUCGGGC C UGCCACC 
UCGGGCC U GCCACCC 
GGCCUGC C ACCCCUG 
GCCUGCC A CCCCUGU 
CUGCCAC C CCUGUUC 
UGCCACC C CUGUUCU 
GCCACCC C UGUUCUC 
CCACCCC U GUUCUCC 
CCUGUUC U CCGAUGU 
UGUUCUC C GAUGUGU 
UAAGGGC U CCCGCUG 
AGGGCUC C CGCUGCU 
GGGCUCC C GCUGCUG 
CUCCCGC U GCUGGGG 
CCGCUGC U GGGGAGA 
AGAGUUC U GAGGAUU 
GAUUGUC A GAGCCUG 
UCAGAGC C UGACGCG 
CAGAGCC U GACGCGC 
GACGCGC A CUGUCUG 
CGCGCAC U GUCUGUG 
CACUGUC U GUGCCGG 
UCUGUGC C GGUGGCU 
CGGUGGC U GUGCCCG 
GCUGUGC C CGCUGCA 
CUGUGCC C GCUGCAA 
UGCCCGC U GCAAC 
CCGCUGC A AGGGC 
AAGGGGC C ACUGCCC 
AGGGGCC A CUGCCCA 
GGGCCAC U GCCCACU 
CCACUGC C CACUGAC 
CACUGCC C ACUGACU 
ACUGCCC A CUGACUG 
UGCCCAC U GACUGCU 
CACUGAC U GCUGCCA 
UGACUGC U GCCAUGA 
CUGCUGC C AUGAGCA 
UGCUGCC A UGAGCAG 
CAUGAGC A GUGUi 
AGUGUGC U GCCG 1 
GUGCUGC C GGCUGCA 
UGCCGGC U GCACGGG | 



GAGCGGU CUGAUGAG X CGAA IGUGUCU 
CGAGAGC CUGAUGAG X CGAA IUUGGUG 
GCCCGAG CUGAUGAG X CGAA ICGGUUG 
AGGCCCG CUGAUGAG X CGAA IAGCGGU 
GGUGGCA CUGAUGAG X CGAA ICCCGAG 
GGGUGGC CUGAUGAG X CGAA IGCCCGA 
CAGGGGU CUGAUGAG X CGAA ICAGGCC 
ACAGGGG CUGAUGAG X CGAA IGCAGGC 
GAACAGG CUGAUGAG X CGAA IUGGCAG 
AGAACAG CUGAUGAG X CGAA IGUGGCA 
GAGAACA CUGAUGAG X CGAA IGGUGGC 
GGAGAAC CUGAUGAG X CGAA IGGGUGG 
ACAUCGG CUGAUGAG X CGAA IAACAGG 
ACACAUC CUGAUGAG X CGAA IAGAACA 
CAGCGGG CUGAUGAG X CGAA ICCCUUA 
AGCAGCG CUGAUGAG X CGAA IAGCCCU 
CAGCAGC CUGAUGAG X CGAA IGAGCCC 
CCCCAGC CUGAUGAG X CGAA ICGGGAG 
UCUCCCC CUGAUGAG X CGAA ICAGCGG 
AAUCCUC CUGAUGAG X CGAA IAACUCU 
CAGGCUC CUGAUGAG X CGAA IACAAUC 
CGCGUCA CUGAUGAG X CGAA ICUCUGA 
GCGCGUC CUGAUGAG X CGAA IGCUCUG 
CAGACAG CUGAUGAG X CG AA ICGCGUC 
CACAGAC CUGAUGAG X CGAA IUGCGCG 
CCGGCAC CUGAUGAG X CGAA IACAGUG 
AGCCACC CUGAUGAG X CGAA ICACAGA 
CGGGCAC CUGAUGAG X CGAA ICCACCG 
UGCAGCG CUGAUGAG X CGAA ICACAGC 
UUGCAGC CUGAUGAG X CGAA IGCACAG 
CCCUUGC CUGAUGAG X CGAA ICGGGCA 
GGCCCCU CUGAUGAG X CGAA ICAGCGG 
GGGCAGU 

ugggcag" 

agugggc cugaugag x cgaa iuggccc 
gucagug cugaugag x cgaa icagugg 
agucagu cugaugag x cgaa igcagug 
cagucag cugaugag x cgaa iggcagu 
agcaguc cugaugag x cgaa iugggca 

UGGCAGC 
UCAUGGC 



CUGAUGAG X CGAA ICCCCUU 
CUGAUGAG X CGAA IGCCCCU 



CUGAUGAG X CGAA IUCAGUG 
CUGAUGAG X CGAA ICAGUCA 
UGCUCAU CUGAUGAG X CGAA ICAGCAG 
CGAA IGCAGCA 
CGAA ICUCAUG 



CUGCUCA CUGAUGAG X 
AGCACAC CUGAUGAG X ^ 
AGCCGGC CUGAUGAG X CGAA ICACACU 
UGCAGCC CUGAUGAG X CGAA ICAGCAC 
CCCGUGC CUGAUGAG X CGAA ICCGGCA 
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CGGCUGC A CGGGCCC 



GGGCCCG CUGAUGAG X CGAA ICAGCCG 



CACGGGC C CCAAGCA 



ACG3GCC C CAAGCAC 



CGGGCCC C AAGCACU 



GGGCCCC A AGCACUC 



UGCUUGG CUGAUGAG X CGAA ICCCGUG 



GUGCUUG CUGAUGAG X CGAA IGCCCGU 



AGUGCUU CUGAUGAG X CGAA IGGCCCG 



GAGUGCU CUGAUGAG X CGAA IGGGCCC 



CCCAAGC A CUCUGAC 



CAAGCAC U CUGACUG 



GUCAGAG CUGAUGAG X CGAA ICUUGGG 



CAGUCAG CUGAUGAG X CGAA IUGCUUG 



AGCACUC U GACUGCC 



CUCUGAC U GCCUGGC 



GGCAGUC CUGAUGAG X CGAA IAGUGCU 



GCCAGGC CUGAUGAG X CGAA IUCAGAG 



UGACUGC C UGGCCUG 



GACUGCC U GGCCUGC 



GCCUGGC C UGCCUCC 



CAGGCCA CUGAUGAG X CGAA ICAGUCA 



GCAGGCC CUGAUGAG X CGAA IGCAGUC 



GGAGGCA CUGAUGAG X CGAA ICCAGGC 



CCUGGCC U GCCUCCA 



GGCCUGC C UCCACUU 



GCCUGCC U CCACUUC 



CUGCCUC C ACUUCAA 



UGCCUCC A CUUCAAC 



CCUCCAC U UCAACCA 



CCACUUC A ACCACAG 



CUUCAAC C ACAGUGG 



UGGAGGC CUGAUGAG X CGAA IGCCAGG 



AAGUGGA CUGAUGAG X CGAA ICAGGCC 



GAAGUGG CUGAUGAG X CGAA IGCAGGC 



UUGAAGU CUGAUGAG X CGAA IAGGCAG 



GUUGAAG CUGAUGAG X CGAA IGAGGCA 



UGGUUGA CUGAUGAG X CGAA IUGGAGG 



CUGUGGU CUGAUGAG X CGAA IAAGUGG 



CCACUGU CUGAUGAG X CGAA IUUGAAG 



UUCAACC A CAGUGGC 



GCCACUG CUGAUGAG X CGAA IGUUGAA 



CAACCAC A GUGGCAU 



CAGUGGC A UCUGUGA 



UGGCAUC U GUGAGCU 



UGUGAGC U GCACUGC 



GAGCUGC A CUGCCCA 



GCUGCAC U GCCCAGC 



AUGCCAC CUGAUGAG X CGAA IUGGUUG 



UCACAGA CUGAUGAG X CGAA ICCACUG 



AGCUCAC CUGAUGAG X CGAA IAUGCCA 



GCAGUGC CUGAUGAG X CGAA ICUCACA 



UGGGCAG CUGAUGAG X CGAA ICAGCUC 



GCUGGGC CUGAUGAG X CGAA IUGCAGC 



GCACUGC C CAGCCCU 



CACUGCC C AGCCCUG 



ACUGCCC A GCCCUGG 



AGGGCUG CUGAUGAG X CGAA ICAGUGC 



CAGGGCU CUGAUGAG X CGAA IGCAGUG 



CCAGGGC CUGAUGAG X CGAA IGGCAGU 



GCCCAGC C CUGGUCA 



CCCAGCC C t 
CCAGCCC U C 



CCUGGUC A CCUACAA 



UGGUCAC C UACAACA 



UGACCAG CUGAUGAG X CGAA ICUGGGC 



GUGACCA CUGAUGAG X CGAA IGCUGGG 



GGUGACC CUGAUGAG X CGAA IGGCUGG 



UUGUAGG CUGAUGAG X CGAA IACCAGG 



UGUUGUA CUGAUGAG X CGAA IUGACCA 



GGUCACC U ACAACAC 



CACCUAC A ACACAGA 



CUACAAC A CAGACAC 



ACAACAC A GACACGU 



CACAGAC A CGUUUGA 



UUGAGUC C AUGCCCA 



UGAGUCC A UGCCCAA 



GUGUUGU CUGAUGAG X CGAA IGUGACC 



UCUGUGU CUGAUGAG X CGAA IUAGGUG 



GUGUCUG CUGAUGAG X CGAA IUUGUAG 



ACGUGUC CUGAUGAG X CGAA IUGUUGU 



UCAAACG CUGAUGAG X CGAA IUCUGUG 



UGGGCAU CUGAUGAG X CGAA IACUCAA 



UUGGGCA CUGAUGAG X CGAA IGACUCA 



UCCAUGC C CAAUCCC 



CCAUGCC C AAUCCCG 



GGGAUUG CUGAUGAG X CGAA ICAUGGA 



CGGGAUU CUGAUGAG X CGAA IGCAUGG 



CAUGCCC A AUCCCGA 



CCCAAUC C CGAGGGC 



CCAAUCC C GAGGGCC 



UCGGGAU CUGAUGAG X CGAA IGGCAUG 



GCCCUCG CUGAUGAG X CGAA IAUUGGG 



GGCCCUC CUGAUGAG X CGAA IGAUUGG 
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CGAGGGC C GGUAUAC 



GGUAUAC A UUCGGCG 



GUAUACC CUGAUGAG X CGAA ICCCUCG 



CGCCGAA CUGAUGAG X CGAA IUAUACC 



UCGGCGC C AGCUGUG 



CGGCGCC A GCUGUGU 



CGCCAGC U GUGUGAC 



GUGUGAC U GCCUGUC 



UGACUGC C UGUCCCU 



GACUGCC U GUCCCUA 



GCCUGUC C CUACAAC 



CCUGUCC C UACAACU 



CUGUCCC U ACAACUA 



UCCCUAC A ACUACCU 



CUACAAC U ACCUUUC 



CAACUAC C UUUCUAC 



AACUACC U UUCUACG 



ACCUUUC U ACGGACG 



GGGAUCC U GCACCCU 



AUCCUGC A CCCUCGU 



CCUGCAC C CUCGUCU 



CUGCACC C UCGUCUG 



UGCACCC U CGUCUGC 



CCUCGUC U GCCCCCU 



CGUCUGC C CCCUGCA 



GUCUGCC C CCUGCAC 



CUGCCCC C UGCACAA 



CACAGCU CUGAUGAG X CGAA ICGCCGA 



ACACAGC CUGAUGAG X CGAA IGCGCCG 



GUCACAC CUGAUGAG X CGAA ICUGGCG 



GACAGGC CUGAUGAG X CGAA IUCACAC 



AGGGACA CUGAUGAG X CGAA ICAGUCA 



UAGGGAC CUGAUGAG X CGAA IGCAGUC 



GUUGUAG CUGAUGAG X CGAA IACAGGC 



AGUUGUA CUGAUGAG X CGAA IGACAGG 



UAGUUGU CUGAUGAG X CGAA IGGACAG 



AGGUAGU CUGAUGAG X CGAA IUAGGGA 



GAAAGGU CUGAUGAG X CGAA IUUGUAG 



GUAGAAA CUGAUGAG X CGAA IUAGUUG 



CGUAGAA CUGAUGAG X CGAA IGUAGUU 



CGUCCGU CUGAUGAG X CGAA IAAAGGU 



GGGUGCA CUGAUGAG X CGAA IAUCCCA 



AGGGUGC CUGAUGAG X CGAA IGAUCCC 



ACGAGGG CUGAUGAG X CGAA ICAGGAU 



AGACGAG CUGAUGAG X CGAA IUGCAGG 



CAGACGA CUGAUGAG X CGAA IGUGCAG 



GCAGACG CUGAUGAG X CGAA IGGUGCA 



AGGGGGC CUGAUGAG X CGAA IACGAGG 
[ CGAA ICAGACG 



UGCAGGG CUGAUGAG X ( 



GUGCAGG CUGAUGAG X CGAA IGCAGAC 



UGUGCAG CUGAUGAG X CGAA IGGCAGA 



: CGAA IGGGCAG 
GUUGUGC CUGAUGAG X CGAA IGGGGCA 



UUGUGCA CUGAUGAG X ( 
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UGCCCCC U GCACAAC 



CCCCUGC A CAACCAA 



CCUGCAC A ACCAAGA 



GCACAAC C AAGAGGU 



CACAACC A AGAGGUG 



AGGUGAC P 
UGACAGC P 
AUGGAAC f 



GGAACAC A GCGGUGU 



GAAGUGC A GCAAGCC 



: C CUGUGCC 
: C UGUGCCC 



CAAGCCC U GUGCCCG 



CCUGUGC C CGAGUGU 



UUGGUUG CUGAUGAG X CGAA ICAGGGG 



UCUUGGU CUGAUGAG X CGAA IUGCAGG 



ACCUCUU CUGAUGAG X CGAA IUUGUGC 



CACCUCU CUGAUGAG X CGAA IGUUGUG 



CCUCUGC CUGAUGAG X CGAA IUCACCU 



CAUCCUC CUGAUGAG X CGAA ICUGUCA 



ACCGCUG CUGAUGAG X CGAA IUUCCAU 



ACACCGC CUGAUGAG X CGAA IUGUUCC 



GGCUUGC CUGAUGAG X CGAA ICACUUC 



CAGGGCU CUGAUGAG X CGAA ICUGCAC 



GGCACAG CUGAUGAG X CGAA ICUUGCU 



GGGCACA CUGAUGAG X CGAA IGCUUGC 



CGGGCAC CUGAUGAG X CGAA IGGCUUG 



ACACUCG CUGAUGAG X CGAA ICACAGG 



CUGUGCC C GAGUGUG 



AGUGUGC U AUGGUCU 



UAUGGUC I 
UCUGGGC I 



AUGGAGC A CUUGCGA 



CACACUC CUGAUGAG X CGAA IGCACAG 



AGACCAU CUGAUGAG X CGAA ICACACU 



CAUGCCC CUGAUGAG X CGAA IACCAUA 



UGCUCCA CUGAUGAG X CGAA ICCCAGA 



UCGCAAG CUGAUGAG X CGAA ICUCCAU 
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GGAGCAC U UGCGAGA 
UGAGGGC A GUUACCA 
CAGUUAC C AGUGCCA 
AGUUACC A GUGCCAA 
CCAGUGC C AAUAUCC 
CAGUGCC A AUAUCCA 
CAAUAUC C AGGAGUU 
AAUAUCC A GGAGUUU 
AGUUUGC U GGCUGCA 
UGCUGGC U GCAAGAA 
UGGCUGC A AGAAGAU 
GAAGAUC U UUGGGAG 
UGGGAGC C UGGCAUU 
GGGAGCC U GGCAUUU 
GCCUGGC A UUUCUGC 
GCAUUUC U GCCGGAG 
UUUCUGC C GGAGAGC 
GGAGAGC U UUGAUGG 
UGGGGAC C CAGCCUC 
GGGGACC C AGCC1 
GGGACCC A GCCUl 
ACCCAGC C UCCAACA 



UCUCGCA CUGAUGAG X CGAA IUGCUCC 
UGGUAAC CUGAUGAG X CGAA ICCCUCA 
UGGCACU CUGAUGAG X CGAA IUAACUG 
UUGGCAC CUGAUGAG X CGAA IGUAACU 
GGAUAUU CUGAUGAG X CGAA ICACUGG 
UGGAUAU CUGAUGAG X CGAA IGCACUG 
AACUCCU CUGAUGAG X CGAA IAUAUUG 
AAACUCC CUGAUGAG X CGAA IGAUAUU 
UGCAGCC CUGAUGAG X CGAA ICAAACU 
UUCUUGC CUGAUGAG X CGAA ICCAGCA 
AUCUUCU CUGAUGAG X CGAA ICAGCCA 
CUCCCAA CUGAUGAG X CGAA IAUCUUC 
AAUGCCA CUGAUGAG X CGAA ICUCCCA 
AAAUGCC CUGAUGAG X CGAA IGCUCCC 
GCAGAAA CUGAUGAG X CGAA ICCAGGC 
CUCCGGC CUGAUGAG X CGAA IAAAUGC 
GCUCUCC CUGAUGAG X CGAA ICAGAAA 
CCAUCAA CUGAUGAG X CGAA ICUCUCC 
GAGGCUG CUGAUGAG X CGAA IUCCCCA 
•GGAGGCU CUGAUGAG X CGAA IGUCCCC 
UGGAGGC CUGAUGAG X CGAA IGGUCCC 
CUGAUGAG X CGAA ICUGGGU 



CCCAGCC U CCAACAC 



GUGUUGG CUGAUGAG X CGAA IGCUGGG 
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CAGCCUC C AACACUG 



AGCCUCC A ACACUGC 



CUCCAAC A CUGCCCC 



CCAACAC U GCCCCGC 



ACACUGC C CCGCUCC 
CACUGCC C CGCUCCA 



ACUGCCC C 
GCCCCGC I 



CAGUGUU CUGAUGAG X CGAA IAGGCUG 



GCAGUGU CUGAUGAG X CGAA IGAGGCU 



GGGGCAG CUGAUGAG X CGAA IUUGGAG 



GCGGGGC CUGAUGAG X CGAA IUGUUGG 



GGAGCGG CUGAUGAG X CGAA ICAGUGU 



UGGAGCG CUGAUGAG X CGAA IGCAGUG 



CUGGAGC CUGAUGAG X CGAA IGGCAGU 



UGGCUGG CUGAUGAG X CGAA ICGGGGC 



CCCGCUC C AGCCAGA 



UCUGGCU CUGAUGAG X CGAA IAGCGGG 



ccgcucc ; 

CUCCAGC ( 



; A GAGCAGC 
: A GCUCCAA 



CUCUGGC CUGAUGAG X CGAA IGAGCGG 



CUGCUCU CUGAUGAG X CGAA ICUGGAG 



GCUGCUC CUGAUGAG X CGAA IGCUGGA 



UUGGAGC CUGAUGAG X CGAA ICUCUGG 



GAGCAGC U CCAAGUG 



CACUUGG CUGAUGAG X CGAA ICUGCUC 



W3CUC C AAGUGUU 
3CUCC A AGUGUUU 
UUGAGAC U CUGGAAG 
GAGACUC U GGAAC 
AGAGAUC A CAGGl 
AGAUCAC A GGUUACC 
AGGUUAC C UAUACAU 
GGUUACC U AUACAUC 
CCUAUAC A UCUCAGC 
AUACAUC U CAGCAUG 



CUGAUGAG X CGAA IAGCUGC 

AAACACU CUGAUGAG X CGAA IGAGCUG 

CUUCCAG CUGAUGAG X CGAA IUCUCAA 

CUCUUCC CUGAUGAG X CGAA IAGUCUC 

UAACCUG CUGAUGAG X CGAA IAUCUCU 

GGUAACC CUGAUGAG X CGAA IUGAUCU 

AUGUAUA CUGAUGAG X CGAA IUAACCU 

GAUGUAU CUGAUGAG X CGAA IGUAACC 

GCUGAGA CUGAUGAG X CGAA IUAUAGG 

CAUGCUG CUGAUGAG X CGAA IAUGUAU 
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Table 34 



1413 


ACAUCUC A GCAUGGC 




GCCAUGC CUGAUGAG X CGAA IAGAUGU 




1416 


UCUCAGC A UGGCCGG 




CCGGCCA CUGAUGAG X CGAA ICUGAGA 




1421 


GCAUGGC C GGACAGC 




GCUGUCC CUGAUGAG X CGAA ICCAUGC 




1426 


GCCGGAC A GCCUGCC 




GGCAGGC CUGAUGAG X CGAA IUCCGGC 




1429 


GGACAGC C UGCCUGA 




UCAGGCA CUGAUGAG X CGAA ICUGUCC 




1430 


GACAGCC U GCCUGAC 




GUCAGGC CUGAUGAG X CGAA IGCUGUC 




1433 


AGCCUGC C UGACCUC 




GAGGUCA CUGAUGAG X CGAA ICAGGCU 






GCCUGCC U GACCUCA 




UGAGGUC CUGAUGAG X CGAA IGCAGGC 




Il3l 


GCCUGAC C UCAGCGU 




ACGCUGA CUGAUGAG X CGAA IUCAGGC 




1439 


CCUGACC U CAGCGUC 




GACGCUG CUGAUGAG X CGAA IGUCAGG 






UGACCUC A GCGUCUU 




AAGACGC CUGAUGAG X CGAA IAGGUCA 




1447 


CAGCGUC U UCCAGAA 




UUCUGGA CUGAUGAG X CGAA IACGCUG 






CGUCUUC C AGAACCU 




AGGUUCU CUGAUGAG X CGAA IAAGACG 




1451 


GUCUUCC A GAACCUG 




CAGGUUC CUGAUGAG X CGAA IGAAGAC 




1456 


CCAGAAC C UGCAAGU 




ACUUGCA CUGAUGAG X CGAA IUUCUGG 




1457 


CAGAACC U GCAAGUA 




UACUUGC CUGAUGAG X CGAA IGUUCUG 




1460 


AACCUGC A AGUAAUC 




GAUUACU CUGAUGAG X CGAA ICAGGUU 




1468 


AGUAAUC C GGGGACG 




CGUCCCC CUGAUGAG X CGAA IAUUACU 




1481 


CGAAUUC U GCACAAU 




AUUGUGC CUGAUGAG X CGAA IAAUUCG 




1484 


AUUCUGC A CAAUGGC 




GCCAUUG CUGAUGAG X CGAA ICAGAAU 




1486 


UCUGCAC A AUGGCGC 




GCGCCAU CUGAUGAG X CGAA IUGCAGA 




1494 


AUGGCGC C UACUCGC 




GCGAGUA CUGAUGAG X CGAA ICGCCAU 




1495 


UGGCGCC U ACUCGCU 




AGCGAGU CUGAUGAG X CGAA IGCGCCA 




1498 


CGCCUAC U CGCUGAC 




GUCAGCG CUGAUGAG X CGAA IUAGGCG 




1502 


UACUCGC U GACCCUG 




CAGGGUC CUGAUGAG X CGAA ICGAGUA 




1506 ' 


CGCUGAC C CUGCAAG 




CUUGCAG CUGAUGAG X CGAA IUCAGCG 




1507 


GCUGACC C UGCAAGG 




CCUUGCA CUGAUGAG X CGAA IGUCAGC 




1508 


CUGACCC U GCAAGGG 




CCCUUGC CUGAUGAG X CGAA IGGUCAG 




1511 


ACCCUGC A AGGGCUG 




CAGCCCU CUGAUGAG X CGAA ICAGGGU 




1517 


CAAGGGC U GGGCAUC 




GAUGCCC CUGAUGAG X CGAA ICCCUUG 




1522 


GCUGGGC A UCAGCUG 




CAGCUGA CUGAUGAG X CGAA ICCCAGC 




1525 


GGGCAUC A GCUGGCU 




AGCCAGC CUGAUGAG X CGAA IAUGCCC 






CAUCAGC U GGCUGGG 




CCCAGCC CUGAUGAG X CGAA ICUGAUG 




— — 


AGCUGGC U GGGGCUG 




CAGCCCC CUGAUGAG X CGAA ICCAGCU 




1538 


CUGGGGC U GCGCUCA 




UGAGCGC CUGAUGAG X CGAA ICCCCAG 




1543 


GCUGCGC U CACUGAG 




CUCAGUG CUGAUGAG X CGAA ICGCAGC 




1545 


UGCGCUC A CUGAGGG 




CCCUCAG CUGAUGAG X CGAA IAGCGCA 




1547 


CGCUCAC U GAGGGAA 




UUCCCUC CUGAUGAG X CGAA IUGAGCG 




1556 


AGGGAAC U GGGCAGU 




ACUGCCC CUGAUGAG X CGAA IUUCCCU 




1561 


ACUGGGC A GUGGACU 




AGUCCAC CUGAUGAG X CGAA ICCCAGU 




1568 


AGUGGAC U GGCCCUC 




GAGGGCC CUGAUGAG X CGAA IUCCACU 




1572 


GACUGGC C CUCAUCC 




GGAUGAG CUGAUGAG X CGAA ICCAGUC 




1573 


ACUGGCC C UCAUCCA 




UGGAUGA CUGAUGAG X CGAA IGCCAGU 




1574 


CUGGCCC U CAUCCAC 




GUGGAUG CUGAUGAG X CGAA IGGCCAG 




1576 


GGCCCUC A UCCACCA 




UGGUGGA CUGAUGAG X CGAA IAGGGCC 




1579 


CCUCAUC C ACCAUAA 




UUAUGGU CUGAUGAG X CGAA IAUGAGG 




1580 


CUCAUCC A CCAUAAC 




GUUAUGG CUGAUGAG X CGAA IGAUGAG 
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1582 
1583 



CAUCCAC C AUAACAC 
AUCCACC A UAACACC 
CCAUAAC A CCCACCU 
AUAACAC C CACCUCU 
UAACACC C ACCUCUG 
AACACCC A CCUCUGC 
CACCCAC C UCUGCUU 
ACCCACC U CUGCUUC 
CCACCUC U GCUUCGU 



1669 
1673 



CCUCUGC U UCGUGCA 
UUCGUGC A CACGGUG 
CGUGCAC A CGGUGCC 
ACGGUGC C CUGGGAC 



GGUGCCC U GGGACCA 
CUGGGAC C AGCUCUU 
UGGGACC A GCUCUUU 
GACCAGC U CUUUCGG 
CCAGCUC U UUCGGAA 
UCGGAAC C CGCACCA 
CGGAACC C GCACCAA 
AACCCGC A CCAAGCU 
GCAC C AAGCUCU 
CCGCACC A AGCUCUG 
ACCAAGC U C 
CAAGCUC U C 
GCUCUGC U C 
UCUGCUC C ACACUGC 
CUGCUCC A CACUGCC 
GCUCCAC A CUGCCAA 
UCCACAC U C 
ACACUGC C I 
CACUGCC A ACCGGCC 
UGCCAAC C GGCCAGA 
AACCGGC C 
ACCGGCC A 



GAGGACG 
UGGCCUG 
GAGGGCC U GGCCUGC 
GCCUGGC C UGCCACC 
CCUGGCC U GCCACCA 
GGCCUGC C ACCAGCU 
GCCUGCC A CCAGCUG 
CUGCCAC C AGCUGUG 
UGCCACC A GCUGUGC 
CACCAGC U GUGCGCC 
UGUGCGC C CGAGGGC 
GUGCGCC C GAGGGCA 



GUGUUAU CUGAUGAG X CGAA IUGGAUG 
GGUGUUA CUGAUGAG X CGAA IGUGGAU 
AGGUGGG CUGAUGAG X CGAA IUUA'JGG 
AGAGGUG CUGAUGAG X CGAA IUGUUAU 
CAGAGGU CUGAUGAG X CGAA IGUGUUA 
CUGAUGAG X CGAA IGGUGUU 



AAGCAGA CUGAUGAG X CGAA IUGGGUG 
GAAGCAG CUGAUGAG X CGAA IGUGGGU 
ACGAAGC CUGAUGAG X CGAA IAGGUGG 



UGCACGA CUGAUGAG X CGAA ICAGAGG 



CCGAAAG CUGAUGAG X CGAA ] 



CACCGUG CUGAUGAG X CGAA I CACGAA 
GGCACCG CUGAUGAG X CGAA IUGCACG 
GUCCCAG CUGAUGAG X CGAA ICACCGU 
GGUCCCA CUGAUGAG X CGAA IGCACCG 
UGGUCCC CUGAUGAG X CGAA IGGCACC 
AAGAGCU CUGAUGA G X CGAA IUCCCAG 
AAAGAGC CUGAUGAG X CGAA IGUCCCA 
k ICUGGUC 
UUCCGAA CUGAUGAG X CGAA IAGCUGG 
UGGUGCG CUGAUGAG X CGAA IUUCCGA 
UUGGUGC CUGAUGAG X CGAA IGUUCCG 
AGCUUGG CUGAUGAG X CGAA ICGGGUU 
AGAGCUU CUGAUGAG X CGAA IUGCGGG 
CAGAGCU CUGAUGAG X CGAA IGUGCGG 
GGAGCAG CUGAUGAG X CGAA ICUUGGU 
GUGGAGC CUGAUGAG X CGAA IAGCUUG 
AGUGUGG CUGAUGAG X CGAA ICAGAGC 
GCAGUGU CUGAUGAG X CGAA IAGCAGA 
GGCAGUG CUGAUGAG X CGAA IGAGCAG 
UUGGCAG CUGAUGAG X CGAA IUGGAGC 
GGUUGGC CUGAUGAG X CGAA IUGUGGA 
GCCGGUU CUGAUGAG X CGAA ICAGUGU 
GGCCGGU CUGAUGAG X CGAA IGCAGUG 
UCUGGCC CUGAUGAG X CGAA IUUGGCA 
GUCCUCU CUGAUGAG X CGAA ICCGGUU 
CGUCCUC CUGAUGAG X CGAA IGCCGGU 
CAGGCCA CUGAUGAG X CGAA ICCCUCG 
GCAGGCC CUGAUGAG X CGAA IGCCCUC 
GGUGGCA CUGAUGAG X CGAA ICCAGGC 
UGGUGGC CUGAUGAG X CGAA IGCCAGG 
AGCUGGU CUGAUGAG X CGAA ICAGGCC 
CAGCUGG CUGAUGAG X CGAA IGCAGGC 



CACAGCU CUGAUGAG X CGAA I 



, IUGGCAG 
GCACAGC CUGAUGAG X CGAA IGUGGCA 
GGCGCAC CUGAUGAG X CGAA ICUGGUG 



; CGAA 

UGCCCUC CUGAUGAG X CGAA 



GCCCUCG CUGAUGAG X C 



ICGCACA 
IGCGCAC 
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CGAGGGC A CUGCUGG 
AGGGCAC U GCUGGGG 
GCACUGC U GGGGUCC 
UGGGGUC C AGGGCCC 
GGGGUCC A GGGCCCA 
CCAGGGC C CACCCAG 
CAGGGCC C ACCCAGU 
AGGGCCC A CCCAGUG 
GGCCCAC C CAGUC 
GCCCACC C AGUGl 
CCCACCC A GUGUGUC 
GUGUGUC A ACUGCAG 
UGUCAAC U GCAGCCA 



CUGCAGC C AGUUCCU 
UGCAGCC A GUUCCUU 
CCAGUUC C UUCGC 
CAGUUCC U UCGG( 
UCGGGGC C AGGAGUG 
CGGGGCC A GGAGUGC 
GGAAUGC C GAGUACU 
CGAGUAC U GCAGGGG 
GUACUGC A GGGGCUC 
CAGGGGC U CCCCAGG 
GGGGCUC C CCAGGGA 
GGGCUCC C CAGGI 
GGCUCCC C AGGGj 
GCUCCCC A GGGAGUA 
UGAAUGC C AGGCACU 
GAAUGCC A GGCACUG 
GCCAGGC A CUGUUUG 
CAGGCAC U GUUUGCC 
UGUUUGC C GUGCCAC 
GCCGUGC C ACCCUGA 
CCGUGCC A CCCUGAG 
GUGCCAC C CUGAGUG 
UGCCACC C UGAGUGU 
GCCACCC U GAGUGUC 
GAGUGUC A GCCCCAG 
UGUCAGC C CCAGAAU 
GUCAGCC C CAGAAUG 
UCAGCCC C AGAAUGG 
CAGCCCC A GAAUGGC 
GAAUGGC U CAGUGAC 
AUGGCUC A GUGACCU 
CAGUGAC C UGUUUUG 
AGUGACC U GUUUUGG 



CCAGCAG CUGAUGAG X CGAA ICCCUCG 
CCCCAGC CUGAUGAG X CGAA IUGCCCU 
GGACCCC CUGAUGAG X CGAA ICAGUGC 
GGGCCCU CUGAUGAG X CGAA IACCCCA 
UGGGCCC CUGAUGAG X CGAA IGACCCC 
CUGGGUG CUGAUGAG X CGAA ICCCUGG 
ACUGGGU CUGAUGAG X CGAA IGCCCUG 
CACUGGG CUGAUGAG X CGAA IGGCCCU 
CACACUG CUGAUGAG X CGAA IUGGGCC 
ACACACU CUGAUGAG X CGAA IGUGGGC 
GACACAC CUGAUGAG X CGAA IGGUGGG 
CUGCAGU CUGAUGAG X CGAA IACACAC 
UGGCUGC CUGAUGAG X CGAA IUUGACA 
AACUGGC CUGAUGAG X CGAA ICAGUUG 
AGGAACU CUGAUGAG X CGAA ICUGCAG 
AAGGAAC CUGAUGAG X CGAA IGCUGCA 
CCCCGAA CUGAUGAG X CGAA IAACUGG 
GCCCCGA CUGAUGAG X CGAA IGAACUG 
CACUCCU CUGAUGAG X CGAA ICCCCGA 
GCACUCC CUGAUGAG X CGAA IGCCCCG 
AGUACUC CUGAUGAG X CGAA ICAUUCC 
CCCCUGC CUGAUGAG X CGAA IUACUCG 
GAGCCCC CUGAUGAG X CGAA ICAGUAC 
CCUGGGG CUGAUGAG X CGAA ICCCCUG 
UCCCUGG CUGAUGAG X CGAA IAGCCCC 
CUCCCUG CUGAUGAG X CGAA IGAGCCC 
ACUCCCU CUGAUGAG X CGAA IGGAGCC 
UACUCCC CUGAUGAG X CGAA IGGGAGC 
AGUGCCU CUGAUGAG X CGAA ICAUUCA 
CAGUGCC CUGAUGAG X CGAA IGCAUUC 
CAAACAG CUGAUGAG X CGAA ICCUGGC 
GGCAAAC CUGAUGAG X CGAA IUGCCUG 
GUGGCAC CUGAUGAG X CGAA ICAAACA 
UCAGGGU CUGAUGAG X CGAA ICACGGC 
CUCAGGG CUGAUGAG X CGAA IGCACGG 
CACUCAG CUGAUGAG X CGAA IUGGCAC 
ACACUCA CUGAUGAG X CGAA IGUGGCA 
GACACUC CUGAUGAG X CGAA IGGUGGC 
CUGGGGC CUGAUGAG X CGAA IACACUC 
AUUCUGG CUGAUGAG X CGAA ICUGACA 
CAUUCUG CUGAUGAG X CGAA IGCUGAC 
CCAUUCU CUGAUGAG X CGAA IGGCUGA 
GCCAUUC CUGAUGAG X CGAA IGGGCUG 
GUCACUG CUGAUGAG X CGAA ICCAUUC 
AGGUCAC CUGAUGAG X CGAA IAGCCAU 
CAAAACA CUGAUGAG X CGAA IUCACUG 
CCAAAAC CUGAUGAG X CGAA IGUCACU 
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UUUGGAC C GGAGGCU 



AGCCUCC CUGAUGAG X CGAA IUCCAAA 



CGGAGGC U GACCAGU 



ACUGGUC CUGAUGAG X CGAA ICCUCCG 



GGCUGAC C AGUGUGU 



ACACACU CUGAUGAG X CGAA IUCAGCC 



GCUGACC A GUGUGUG 



CACACAC CUGAUGAG X CGAA IGUCAGC 



GUGUGGC C UGUGCCC 



GGGCACA CUGAUGAG X CGAA ICCACAC 



UGUGGCC U GUGCCCA 



UGGGCAC CUGAUGAG X CGAA IGCCACA 



CCUGUGC C 
CUGUGCC C 



UAUAGUG CUGAUGAG X CGAA ICACAGG 



UUAUAGU CUGAUGAG X CGAA IGCACAG 



UGUGCCC f 
UGCCCAC I 



CUUAUAG CUGAUGAG X CGAA IGGCACA 



UCCUUAU CUGAUGAG X CGAA IUGGGCA 



UAAGGAC C CUCCCUU 



AAGGGAG CUGAUGAG X CGAA IUCCUUA 



AAGGACC C UCCCUUC 



AGGACCC U CCCUUCU 



GAAGGGA CUGAUGAG X CGAA IGUCCUU 



AGAAGGG CUGAUGAG X CGAA IGGUCCU 



GACCCUC C CUUCUGC 



GCAGAAG CUGAUGAG X CGAA IAGGGUC 



ACCCUCC C UUCUGCG 



CGCAGAA CUGAUGAG X CGAA IGAGGGU 



CCCUCCC U UCUGCGU 



ACGCAGA CUGAUGAG X CGAA IGGAGGG 



UCCCUUC U GCGUGGC 



GCCACGC CUGAUGAG X CGAA IAAGGGA 



GCGUGGC C CGCUGCC 



GGCAGCG CUGAUGAG X CGAA ICCACGC 



CGUGGCC C 
GGCCCGC I 



CCGCUGC C CCAGCGG 



CGCUGCC C CAGCGGU 



GGGCAGC CUGAUGAG X CGAA IGCCACG 



CUGGGGC CUGAUGAG X CGAA ICGGGCC 
CCGCUGG CUGAUGAG X CGAA ICAGCGG 
ACCGCUG CUGAUGAG X CGAA IGCAGCG 



GCUGCCC C AGCGGUG 



CACCGCU CUGAUGAG X CGAA IGGCAGC 



CUGCCCC A GCGGUGU 



ACACCGC CUGAUGAG X CGAA IGGGCAG 



GUGAAAC C UGACCUC 



GAGGUCA CUGAUGAG X CGAA IUUUCAC 



UGAAACC U GACCUCU 



AGAGGUC CUGAUGAG X CGAA IGUUUCA 



ACCUGAC C UCUCCUA 



CCUGACC U CUCCUAC 



UAGGAGA CUGAUGAG X CGAA IUCAGGU 



GUAGGAG CUGAUGAG X CGAA IGUCAGG 



UGACCUC I 
ACCUCUC C 



AUGUAGG CUGAUGAG X CGAA IAGGUCA 



GCAUGUA CUGAUGAG X CGAA IAGAGGU 



CCUCUCC U ACAUGCC 



GGCAUGU CUGAUGAG X CGAA IGAGAGG 



CUCCUAC A UGCCCAU 



AUGGGCA CUGAUGAG X CGAA IUAGGAG 



UACAUGC C CAUCUGG 



ACAUGCC C AUCUGGA 



CAUGCCC A UCUGGAA 



CCAGAUG CUGAUGAG X CGAA ] 
UCCAGAU CUGAUGAG X CGAA j 



UUCCAGA CUGAUGAG X CGAA IGGCAUG 



GCCCAUC U GGAAGUU 



AACUUCC CUGAUGAG X CGAA IAUGGGC 



AAGUUUC C AGAUGAG 



AGUUUCC A GAUGAGG 



AGGGCGC ; 
CGCAUGC C 



CUCAUCU CUGAUGAG X CGAA IAAACUU 



CCUCAUC CUGAUGAG X CGAA IGAAACU 
GCUGGCA CUGAUGAG X CGAA ICGCCCU 



CAAGGCU CUGAUGAG X CGAA ICAUGCG 



GCAUGCC A GCCUUGC 



UGCCAGC C UUGCCCC 



GCAAGGC CUGAUGAG X CGAA J 
GGGGCAA CUGAUGAG X CGAA I 



GCCAGCC U UGCCCCA 



UGGGGCA CUGAUGAG X CGAA IGCUGGC 



GCCUUGC C CCAUCAA 



UUGAUGG CUGAUGAG X CGAA ICAAGGC 



CCUUGCC C 
CUUGCCC C 



GUUGAUG CUGAUGAG X CGAA IGCAAGG 



AGUUGAU CUGAUGAG X CGAA IGGCAAG 



UUGCCCC A UCAACUG 



CAGUUGA CUGAUGAG X CGAA IGGGCAA 
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CCCCAUC A ACUGCAC 



GUGCAGU CUGAUGAG X CGAA IAUGGGG 



CAUCAAC U GCACCCA 



UGGGUGC CUGAUGAG X CGAA IUUGAUG 



CAACUGC A CCCACUC 



GAGUGGG CUGAUGAG X CGAA ICAGUUG 



ACUGCAC C CACUCCU 



AGGAGUG CUGAUGAG X CGAA IUGCAGU 



CUGCACC C ACUCCUG 



CAGGAGU CUGAUGAG X CGAA IGUGCAG 



UGCACCC A CUCCUGU 



ACAGGAG CUGAUGAG X CGAA I GGUGCA 



CACCCAC U CCUGUGU 



ACACAGG CUGAUGAG X CGAA IUGGGUG 



CCCACUC C UGUGUGG 



CCACACA CUGAUGAG X CGAA IAGUGGG 



CCACUCC U GUGUGGA 



UCCACAC CUGAUGAG X CGAA IGAGUGG 



UGUGGAC C UGGAUGA 



UCAUCCA CUGAUGAG X CGAA IUCCACA 



GUGGACC U GGAUGAC 



GUCAUCC CUGAUGAG X CGAA IGUCCAC 



GGAUGAC A AGGGCUG 



CAGCCCU CUGAUGAG X CGAA IUCAUCC 



CAAGGGC U GCCCCGC 



GCGGGGC CUGAUGAG X CGAA ICCCUUG 



GGGCUGC C CCGCCGA 



UCGGCGG CUGAUGAG X CGAA ICAGCCC 



GGCUGCC C CGCCGAG 



CUCGGCG CUGAUGAG X CGAA IGCAGCC 



GCUGCCC C GCCGAGC 



GCUCGGC CUGAUGAG X CGAA IGGCAGC 



GCCCCGC C GAGCAGA 



UCUGCUC CUGAUGAG X CGAA ICGGGGC 



GCCGAGC A GAGAGCC 



GGCUCUC CUGAUGAG X CGAA ICUCGGC 



AGAGAGC C AGCCCUC 



GAGGGCU CUGAUGAG X CGAA ICUCUCU 



GAGAGCC A GCCCUCU 



AGAGGGC CUGAUGAG X CGAA IGCUCUC 



AGCCAGC C CUCUGAC 



GUCAGAG CUGAUGAG X CGAA ICUGGCU 



GCCAGCC C UCUGACG 



CGUCAGA CUGAUGAG X CGAA IGCUGGC 



CCAGCCC 'J CUGACGU 



ACGUCAG CUGAUGAG X CGAA IGGCUGG 



AGCCCUC U GACGUCC 



GGACGUC CUGAUGAG X CGAA IAGGGCU 



UGACGUC C AUCAUCU 



AGAUGAU CUGAUGAG X CGAA IACGUCA 



GACGUCC A UCAUCUC 



GAGAUGA CUGAUGAG X CGAA IGACGUC 



GUCCAUC A UCUCUGC 



GCAGAGA CUGAUGAG X CGAA IAUGGAC 



CAUCAUC U CUGCGGU 



ACCGCAG CUGAUGAG X CGAA I AUG AUG 



UCAUCUC U GCGGUGG 



CCACCGC CUGAUGAG X CGAA IAGAUGA 



GGUUGGC A UUCUGCU 



AGCAGAA CUGAUGAG X CGAA ICCAACC 



GGCAUUC U GCUGGUC 



GACCAGC CUGAUGAG X CGAA IAAUGCC 



AUUCUGC U GGUCGUG 



CACGACC CUGAUGAG X CGAA ICAGAAU 



CGUGGUC U UGGGGGU 



ACCCCCA CUGAUGAG X CGAA IACCACG 



GGUGGUC U UUGGGAU 



AUCCCAA CUGAUGAG X CGAA IACCACC 



UGGGAUC C UCAUCAA 



UUGAUGA CUGAUGAG X CGAA IAUCCCA 



GGGAUCC U CAUCAAG 



CUUGAUG CUGAUGAG X CGAA IGAUCCC 



GAUCCUC A UCAAGCG 



CGCUUGA CUGAUGAG X CGAA IAGGAUC 



CCUCAUC A AGCGACG 



CGUCGCU CUGAUGAG X CGAA IAUGAGG 



CGACGGC A GCAGAAG 



CUUCUGC CUGAUGAG X CGAA ICCGUCG 



CGGCAGC A GAAGAUC 



GAUCUUC CUGAUGAG X CGAA ICUGCCG 



GAAGAUC C GGAAGUA 



UACUUCC CUGAUGAG X CGAA IAUCUUC 



GAAGUAC A CGAUGCG 



CGCAUCG CUGAUGAG X CGAA IUACUUC 



CGGAGAC U GCUGCAG 



CUGCAGC CUGAUGAG X CGAA IUCUCCG 



AGACUGC U GCAGGAA 



UUCCUGC CUGAUGAG X CGAA ICAGUCU 



CUGCUGC A GGAAACG 



CGUUUCC CUGAUGAG X CGAA ICAGCAG 



ACGGAGC U GGUGGAG 



CUCCACC CUGAUGAG X CGAA ICUCCGU 



GUGGAGC C GCUGACA 



UGUCAGC CUGAUGAG X CGAA ICUCCAC 
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GAGCCGC U GACACCU 
CGCUGAC A CCUAGCG 
"CUGACAC C UAGCGGA 
UGACACC U AGCGGAG 
GCGAUGC C CAACCAG 
CGAUGCC C AACCAGG 
GAUGCCC A ACCAGGC 
GCCCAAC C AGGCGCA 
CCCAACC A GGCGCAG 
CAGGCGC A GAUGCGG 
GCGGAUC C UGAAAGA 
CGGAUCC U GAAAGAG 
ACGGAGC U GAGGAAG 
AAGGUGC U UGGAUCU 
UUGGAUC U GGCGCUU 
CUGGCGC U UUUGGCA 
UUUUGGC A CAGUCUA 
UUGGCAC A GUCUACA 
CACAGUC U ACAAGGG 
AGUCUAC A AGGGCAU 
CAAGGGC A UCUGGAU 
GGGCAUC U GGAUCCC 



UGGAUCC C UGAUGGG 
GGAUCCC U GAUGGGG 
AAAAUUC C AGUGGCC 
AAAUUCC A GUGGCCA 
CAGUGGC C AUCAAAG 
AGUGGCC A UCAAAGU 
GGCCAUC A AAGUGUU 
GGAAAAC A CAUCCCC 
AAAACAC A UCCCCCA 
ACACAUC C CCCA 
CACAUCC C CCAA 
ACAUCCC C CAAA 
CAUCCCC C AAAG 
AUCCCCC A AAGCCAA 
CCAAAGC C AACAAAG 
CAAAGCC A ACAAAGA 
AGCCAAC A AAGAAAU 
AGAAAUC U UAGACGA 
ACGAAGC A UACGUGA 
UGAUGGC U GGUGUGG 
UGUGGGC U CCCCAUA 
UGGGCUC C CCAUAUcT 
GGGCUCC C CAUAUGU 
GGCUCCC C AUAUGUC 



AGGUGUC CUGAUGAG X CGAA ICGGCUC 
CGCUAGG CUGAUGAG X CGAA IUCAGCG 
UCCGCUA CUGAUGAG X CGAA IUGUCAG 
CUCCGCU CUGAUGAG X CGAA IGUGUCA 
CUGGUUG CUGAUGAG X CGAA ICAUCGC 
CCUGGUU CUGAUGAG X CGAA IGCAUCG 
GCCUGGU CUGAUGAG X CGAA IGGCAUC 
UGCGCCU CUGAUGAG X CGAA IUUGGGC 
CUGCGCC CUGAUGAG X CGAA IGUUGGG 
CCGCAUC CUGAUGAG X CGAA ICGCCUG 
UCUUUCA CUGAUGAG X CGAA IAUCCGC 
CUCUUUC CUGAUGAG X CGAA IGAUCCG 
CUUCCUC CUGAUGAG X CGAA ICUCCGU 
AGAUCCA CUGAUGAG X CGAA ICACCUU 
AAGCGCC CUGAUGAG X CGAA IAUCCAA 
UGCCAAA CUGAUGAG X CGAA ICGCCAG 
UAGACUG CUGAUGAG X CGAA ICCAAAA 
UGUAGAC CUGAUGAG X CGAA IUGCCAA 
CCCUUGU CUGAUGAG X CGAA IACUGUG 
AUGCCCU CUGAUGAG X CGAA IUAGACU 
AUCCAGA CUGAUGAG X CGAA ICCCUUG 
GGGAUCC CUGAUGAG X CGAA IAUGCCC 
CCAUCAG CUGAUGAG X CGAA IAUCCAG 
CCCAUCA CUGAUGAG X CGAA IGAUCCA 
CCCCAUC CUGAUGAG X CGAA IGGAUCC 
GGCCACU CUGAUGAG X CGAA IAAUUUU 
UGGCCAC CUGAUGAG X CGAA IGAAUUU 
CUUUGAU CUGAUGAG X CGAA ICCACUG 
ACUUUGA CUGAUGAG X CGAA IGCCACU 
AACACUU CUGAUGAG X CGAA IAUGGCC 
GGGGAUG CUGAUGAG X CGAA IUUUUCC 
UGGGGGA CUGAUGAG X CGAA IUGUUUU 
CUUUGGG CUGAUGAG X CGAA IAUGUGU 
GCUUUGG CUGAUGAG X CGAA IGAUGUG 
GGCUUUG CUGAUGAG X CGAA IGGAUGU 
UGGCUUU CUGAUGAG X CGAA IGGGAUG . 
UUGGCUU CUGAUGAG X CGAA IGGGGAU 
CUUUGUU CUGAUGAG X CGAA ICUUUGG 
UCUUUGU CUGAUGAG X CGAA IGCUUUG 
AUUUCUU CUGAUGAG X CGAA IUUGGCU 
UCGUCUA CUGAUGAG X CGAA IAUUUCU 
UCACGUA CUGAUGAG X CGAA ICUUCGU 
CCACACC CUGAUGAG X CGAA ICCAUCA 
UAUGGGG CUGAUGAG X CGAA ICCCACA 
CAUAUGG CUGAUGAG X CGAA IAGCCCA 
ACAUAUG CUGAUGAG X CGAA IGAGCCC 
GACAUAU CUGAUGAG X CGAA IGGAGCC 
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GCUCCCC A UAUGUCU 



AGACAUA CUGAUGAG X CGAA IGGGAGC 



AUAUGUC U CCCGCCU 



AGGCGGG CUGAUGAG X CGAA IACAUAU 



AUGUCUC C CGCCUUC 



GAAGGCG CUGAUGAG X CGAA IAGACAU 



UGUCUCC C GCCUUCU 



AGAAGGC CUGAUGAG X CGAA IGAGACA 



CUCCCGC C UUCUGGG 



CCCAGAA CUGAUGAG X CGAA ICGGGAG 



UCCCGCC U UCUGGGC 



GCCCAGA CUGAUGAG X CGAA IGCGGGA 



CGCCUUC U GGGCAUC 



GAUGCCC CUGAUGAG X CGAA IAAGGCG 



UCUGGGC A UCUGCCU 



AGGCAGA CUGAUGAG X CGAA ICCCAGA 



GGGCAUC U GCCUGAC 



GUCAGGC CUGAUGAG X CGAA IAUGCCC 



CAUCUGC C UGACAUC 



GAUGUCA CUGAUGAG X CGAA ICAGAUG 



AUCUGCC U GACAUCC 



GGAUGUC CUGAUGAG X CGAA IGCAGAU 



GCCUGAC A UCCACGG 



CCGUGGA CUGAUGAG X CGAA IUCAGGC 



UGACAUC C ACGGUGC 



GCACCGU CUGAUGAG X CGAA IAUGUCA 



GACAUCC A CGGUGCA 



UGCACCG CUGAUGAG X CGAA IGAUGUC 



ACGGUGC A GCUGGUG 



CACCAGC CUGAUGAG X CGAA ICACCGU 



GUGCAGC U GGUGACA 



UGUCACC CUGAUGAG X CGAA ICUGCAC 



UAAGCUG CUGAUGAG X CGAA IUCACCA 



GUGACAC A GCUUAUG 



CAUAAGC CUGAUGAG X CGAA IUGUCAC 



ACACAGC U UAUGCCC 



GGGCAUA CUGAUGAG X CGAA ICUGUGU 



CUUAUGC C CUAUGGC 



GCCAUAG CUGAUGAG X CGAA ICAUAAG 



UUAUGCC C UAUGGCU 



AGCCAUA CUGAUGAG X CGAA IGCAUAA 



UAUGCCC U AUGGCUG 



CAGCCAU CUGAUGAG X CGAA IGGCAUA 



CUAUGGC U GCCUCUU 



AAGAGGC CUGAUGAG X CGAA ICCAUAG 



UGGCUGC C UCUUAGA 



UCUAAGA CUGAUGAG X CGAA ICAGCCA 



GGCUGCC U CUUAGAC 



GUCUAAG CUGAUGAG X CGAA IGCAGCC. 



CUGCCUC U UAGACCA 



UGGUCUA CUGAUGAG X CGAA IAGGCAG 



CUUAGAC C AUGUCCG 



CGGACAU CUGAUGAG X CGAA IUCUAAG 



UUAGACC A UGUCCGG 



CCGGACA CUGAUGAG X CGAA IGUCUAA 



CCAUGUC C GGGAAAA 



UUUUCCC CUGAUGAG X CGAA IACAUGG 



GGAAAAC C GCGGACG 



CGUCCGC CUGAUGAG X CGAA IUUUUCC 



CGGACGC C UGGGCUC 



GAGCCCA CUGAUGAG X CGAA ICGUCCG 



GGACGCC U GGGCUCC 



GGAGCCC CUGAUGAG X CGAA IGCGUCC 



CCUGGGC U CCCAGGA 



UCCUGGG CUGAUGAG X CGAA ICCCAGG 



UGGGCUC C CAGGACC 



GGUCCUG CUGAUGAG X CGAA IAGCCCA 



GGGCUCC C AGGACCU 



AGGUCCU CUGAUGAG X CGAA IGAGCCC 



GGCUCCC A GGACCUG 



CAGGUCC CUGAUGAG X CGAA IGGAGCC 



CCAGGAC C UGCUGAA 



UUCAGCA CUGAUGAG X CGAA IUCCUGG 



CAGGACC U GCUGAAC 



GUUCAGC CUGAUGAG X CGAA IGUCCUG 



GACCUGC U GAACUGG 



CCAGUUC CUGAUGAG X CGAA ICAGGUC 



GCUGAAC U GGUGUAU 



AUACACC CUGAUGAG X CGAA IUUCAGC 



UGUAUGC A GAUUGCC 



GGCAAUC CUGAUGAG X CGAA ICAUACA 



AGAUUGC C AAGGGGA 



UCCCCUU CUGAUGAG X CGAA ICAAUCU 



GAUUGCC A AGGGGAU 



AUCCCCU CUGAUGAG X CGAA IGCAAUC 



GAUGAGC U ACCUGGA 



UCCAGGU CUGAUGAG X CGAA ICUCAUC 



GAGCUAC C UGGAGGA 



UCCUCCA CUGAUGAG X CGAA IUAGCUC 



AGCUACC U GGAGGAU 



AUCCUCC CUGAUGAG X CGAA IGUAGCU 



GUGCGGC U CGUACAC 



GUGUACG CUGAUGAG X CGAA ICCGCAC 
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Table 34 



2702 


CUCGUAC A CAGGGAC 




GUCCCUG CUGAUGAG X CGAA IUACGAG 




2704 


CGUACAC A GGGACUU 




AAGUCCC CUGAUGAG X CGAA IUGUACG 




- 27iQ 


CAGGGAC U UGGCCGC 




GCGGCCA CUGAUGAG X CGAA IUCCCUG 




" 2715 


ACUUGGC C GCUCGGA 




UCCGAGC CUGAUGAG X CGAA ICCAAGU 




- 2?18 


UGGCCGC U CGGAACG 




CGUUCCG CUGAUGAG X CGAA ICGGCCA 




2729 


AACGUGC U GGUCAAG 




CUUGACC CUGAUGAG X CGAA ICACGUU 






GCUGGUC A AGAGUCC 




GGACUCU CUGAUGAG X CGAA IACCAGC 




2~77I 


AAGAGUC C CAACCAU 




AUGGUUG CUGAUGAG X CGAA IACUCUU 




2 2?1 


AGAGUCC C AACCAUG 




CAUGGUU CUGAUGAG X CGAA IGACUCU 






GAGUCCC A ACCAUGU 




ACAUGGU CUGAUGAG X CGAA IGGACUC 






UCCCAAC C AUGUCAA 




UUGACAU CUGAUGAG X CGAA IUUGGGA 




— ItTt^ — 


CCCAACC A UGUCAAA 




UUUGACA CUGAUGAG X CGAA IGUUGGG 




2751 


CCAUGUC A AAAUUAC 




GUAAUUU CUGAUGAG X CGAA IACAUGG 






AAAUUAC A GACUUCG 




CGAAGUC CUGAUGAG X CGAA IUAAUUU 




- 

2754 


UACAGAC U UCGGGCU 




AGCCCGA CUGAUGAG X CGAA IUCUGUA 




2771 


UUCGGGC U GGCUCGG 




CCGAGCC CUGAUGAG X CGAA ICCCGAA 




2775 


GGCUGGC U CGGCUGC 




GCAGCCG CUGAUGAG X CGAA ICCAGCC 




2780 


GCUCGGC U GCUGGAC 




GUCCAGC CUGAUGAG X CGAA ICCGAGC 




2783 


CGGCUGC U GGACAUU 




AAUGUCC CUGAUGAG X CGAA ICAGCCG 




2788 


GCUGGAC A UUGACGA 




UCGUCAA CUGAUGAG X CGAA IUCCAGC 




2799 


ACGAGAC A GAGUACC 




GGUACUC CUGAUGAG X CGAA IUCUCGU 




2806 


AGAGUAC C AUGCAGA 




UCUGCAU CUGAUGAG X CGAA IUACUCU 




2807 


GAGUACC A UGCAGAU 




AUCUGCA CUGAUGAG X CGAA IGUACUC 




28^ 


ACCAUGC A GAUGGGG 




CCCCAUC CUGAUGAG X CGAA ICAUGGU 




2821 


UGGGGGC A AGGUGCC 




GGCACCU CUGAUGAG X CGAA ICCCCCA 




2828 


AAGGUGC C CAUCAAG 




CUUGAUG CUGAUGAG X CGAA ICACCUU 




. 2829 


AGGUGCC C AUCAAGU 




ACUUGAU CUGAUGAG X CGAA IGCACCU 




. 2830 


GGUGCCC A UCAAGUG 




CACUUGA CUGAUGAG X CGAA IGGCACC 




... 2833 


GCCCAUC A AGUGGAU 




AUCCACU CUGAUGAG X CGAA IAUGGGC 




2846 


AUGGCGC U GGAGUCC 




GGACUCC CUGAUGAG X CGAA ICGCCAU 




2853 


UGGAGUC C AUUCUCC 




GGAGAAU CUGAUGAG X CGAA IACUCCA 




2854 


GGAGUCC A UUCUCCG 




CGGAGAA CUGAUGAG X CGAA IGACUCC 







UCCAUUC U CCGCCGG 




CCGGCGG CUGAUGAG X CGAA IAAUGGA 






CAUUCUC C GCCGGCG 




CGCCGGC CUGAUGAG X CGAA IAGAAUG 






UCUCCGC C GGCGGUU 




AACCGCC CUGAUGAG X CGAA ICGGAGA 




Wf2 


GCGGUUC A CCCACCA 




UGGUGGG CUGAUGAG X CGAA IAACCGC 




2874 


GGUUCAC C CACCAGA 




UCUGGUG CUGAUGAG X CGAA IUGAACC 






GUUCACC C ACCAGAG 




CUCUGGU CUGAUGAG X CGAA IGUGAAC 






UUCACCC A CCAGAGU 




ACUCUGG CUGAUGAG X CGAA IGGUGAA 




2878 


CACCCAC C AGAGUGA 




UCACUCU CUGAUGAG X CGAA IUGGGUG 




2879 


ACCCACC A GAGUGAU 




AUCACUC CUGAUGAG X CGAA IGUGGGU 




2907 


GUGUGAC U GUGUGGG 




CCCACAC CUGAUGAG X CGAA IUCACAC 




2918 


UGGGAGC U GAUGACU 




AGUCAUC CUGAUGAG X CGAA ICUCCCA 




2925 


UGAUGAC U UUUGGGG 




CCCCAAA CUGAUGAG X CGAA IUCAUCA 




2934 


UUGGGGC C AAACCUU 




AAGGUUU CUGAUGAG X CGAA ICCCCAA 




2935 


UGGGGCC A AACCUUA 




UAAGGUU CUGAUGAG X CGAA IGCCCCA 




2939 


GCCAAAC C UUACGAU 




AUCGUAA CUGAUGAG X CGAA IUUUGGC 
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CCAAACC U UACGAUG 



CAUCGUA CUGAUGAG X CGAA IGUUUGG 



UGGGAUC C CAGCCCG 



CGGGCUG CUGAUGAG X CGAA IAUCCCA 



GGGAUCC C AGCCCGG 



CCGGGCU CUGAUGAG X CGAA IGAUCCC 



GGAUCCC A GCCCGGG 



CCCGGGC CUGAUGAG X CGAA IGGAUCC 



UCCCAGC C CGGGAGA 



UCUCCCG CUGAUGAG X CGAA ICUGGGA 



CCCAGCC C GGGAGAU 



AUCUCCC CUGAUGAG X CGAA IGCUGGG 



GGAGAUC C CUGACCU 



AGGUCAG CUGAUGAG X CGAA IAUCUCC 



GAGAUCC C UGACCUG 



CAGGUCA CUGAUGAG X CGAA IGAUCUC 



AGAUCCC I 
CCCUGAC C 



GCAGGUC CUGAUGAG X CGAA IGGAUCU 



UCCAGCA CUGAUGAG X CGAA IUCAGGG 



CCUGACC U GCUGGAA 



UUCCAGC CUGAUGAG X CGAA IGUCAGG 



GACCUGC U GGAAAAG 



CUUUUCC CUGAUGAG X CGAA ICAGGUC 



GAGCGGC U GCCCCAG 



CUGGGGC CUGAUGAG X C 



CGGCUGC ( 
GGCUGCC C 



GGGCUGG CUGAUGAG X CGAA ICAGCCG 



GGGGCUG CUGAUGAG X CGAA IGCAGCC 



GCUGCCC C AGCCCCC 



GGGGGCU CUGAUGAG X CGAA IGGCAGC 



CUGCCCC 1 
CCCCAGC ( 



GGGGGGC CUGAUGAG X CGAA IGGGCAG 



GAUGGGG CUGAUGAG X CGAA ICUGGGG 



CCCAGCC C CCCAUCU 



AGAUGGG CUGAUGAG X CGAA IGCUGGG 



CCAGCCC C CCAUCUG 



CAGAUGG CUGAUGAG X CGAA IGGCUGG 



CAGCCCC C 
AGCCCCC C 



GCCCCCC A UCUGCAC 



CCCCAUC U GCACCAU 



GCAGAUG CUGAUGAG X CGAA IGGGCUG 



UGCAGAU CUGAUGAG X CGAA IGGGGCU 



GUGCAGA CUGAUGAG X CGAA IGGGGGC 



AUGGUGC CUGAUGAG X CGAA IAUGGGG 



CAUCUGC A CCAUUGA 



UCAAUGG CUGAUGAG X CGAA ICAGAUG 



UCUGCAC ( 
CUGCACC 1 
UGAUGUC I 



CAUCAAU CUGAUGAG X CGAA IUGCAGA 



ACAUCAA CUGAUGAG X CGAA IGUGCAG 



AUCAUGU CUGAUGAG X CGAA IACAUCA 



UGUCUAC A UGAUCAU 



CAUGAUC A UGGUCAA 



AUGAUCA CUGAUGAG X CGAA IUAGACA 



UUGACCA CUGAUGAG X CGAA IAUCAUG 



CAUGGUC A AAUGUUG 



GAUUGAC U CUGAAUG 



CAACAUU CUGAUGAG X CGAA IACCAUG 



CAUUCAG CUGAUGAG X CGAA IUCAAUC 



UUGACUC I 
UGUCGGC C 



GACAUUC CUGAUGAG X CGAA IAGUCAA 



GAAUCUU CUGAUGAG X CGAA ICCGACA 



GUCGGCC A AGAUUCC 



GGAAUCU CUGAUGAG X CGAA IGCCGAC 



AAGAUUC C 
UGGUGUC I 



AACUCCC CUGAUGAG X CGAA IAAUCUU 



AGAAUUC CUGAUGAG X CGAA IACACCA 



UGAAUUC U CCCGCAU 



AUGCGGG CUGAUGAG X CGAA IAAUUCA 



AAUUCUC C 
AUUCUCC ( 



CUCCCGC A UGGCCAG 



CCAUGCG CUGAUGAG X CGAA IAGAAUU 



GCCAUGC CUGAUGAG X CGAA IGAGAAU 



CUGGCCA CUGAUGAG X CGAA ICGGGAG 



GCAUGGC C AGGGACC 



GGUCCCU CUGAUGAG X CGAA ICCAUGC 



CAUGGCC A GGGACCC 



CAGGGAC C CCCAGCG 



AGGGACC C CCAGCGC 



GGGUCCC CUGAUGAG X CGAA 
CGCUGGG CUGAUGAG X CGAA 



IUCCCUG 

GCGCUGG CUGAUGAG X CGAA IGUCCCU 



GGGACCC C CAGCGCU 



AGCGCUG CUGAUGAG X CGAA IGGUCCC 



GGACCCC C AGCGCUU 



AAGCGCU CUGAUGAG X CGAA IGGGUCC 
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GACCCCC A GCGCUUU 



CCAGCGC U UUGUGGU 



AAAGCGC CUGAUGAG X CGAA IGGGGUC 
ACCACAA CUGAUGAG X CGAA ICGCUGG 



UGUGGUC A UCCAGAA 



GGUCAUC C AGAAUGA 



GUCAUCC A GAAUGAG 



i CUGAUGAG > 
I CUGAUGAG } 



i IAUGACC 
CUCAUUC CUGAUGAG X CGAA IGAUGAC 



UGAGGAC U UGGGCCC 



GGGCCCA CUGAUGAG X CGAA IUCCUCA 



CUUGGGC C CAGCCAG 



CUGGCUG CUGAUGAG X CGAA ICCCAAG 



UUGGGCC C AGCCAGU 



ACUGGCU CUGAUGAG X CGAA IGCCCAA 



UGGGCCC A GCCAGUC 



GACUGGC CUGAUGAG X CGAA IGGCCCA 



GCCCAGC C AGUCCCU 



AGGGACU CUGAUGAG X CGAA ICUGGGC 



CCCAGCC A GUCCCUU 



AAGGGAC CUGAUGAG X CGAA IGCUGGG 



GCCAGUC C CUUGGAC 



GUCCAAG CUGAUGAG X CGAA IACUGGC 



CCAGUCC C UUGGACA 



UGUCCAA CUGAUGAG X CGAA IGACUGG 



CAGUCCC U UGGACAG 



CUGUCCA CUGAUGAG X CGAA IGGACUG 



CUUGGAC A GCACCUU 



AAGGUGC CUGAUGAG X CGAA IUCCAAG 



GGACAGC A CCUUCUA 



ACAGCAC C UUCUACC 



UAGAAGG CUGAUGAG X CGAA ICUGUCC 



GGUAGAA CUGAUGAG X CGAA IUGCUGU 



CAGCACC U UCUACCG 



CGGUAGA CUGAUGAG X CGAA IGUGCUG 



CACCUUC U ACCGCUC 



GAGCGGU CUGAUGAG X CGAA IAAGGUG 



CUUCUAC C GCUCACU 



AGUGAGC CUGAUGAG X CGAA IUAGAAG 



CUACCGC U CACUGCU 



AGCAGUG CUGAUGAG X CGAA ICGGUAG 



CCAGCAG CUGAUGAG X CGAA IAGCGGU 



CGCUCAC U GCUGGAG 



CUCCAGC CUGAUGAG X CGAA IUGAGCG 



UCACUGC U GGAGGAC 



GUCCUCC CUGAUGAG X CGAA ICAGUGA 



CGAUGAC A UGGGGGA 



UCCCCCA CUGAUGAG X CGAA IUCAUCG 



GGGGGAC C UGGUGGA 



UCCACCA CUGAUGAG X CGAA IUCCCCC 



GGGGACC U GGUGGAU 



AUCCACC CUGAUGAG X CGAA IGUCCCC 



UGGAUGC U GAGGAGU 



ACUCCUC CUGAUGAG X CGAA ICAUCCA 



GAGUAUC U GGUACCC 



CUGGUAC C CCAGCAG 



UGGUACC C CAGCAGG 



GGGUACC CUGAUGAG X CGAA IAUACUC 



CUGCUGG CUGAUGAG X CGAA IUACCAG 
CCUGCUG CUGAUGAG X CGAA IGUACCA 



GGUACCC G AGCAGGG 



CCCUGCU CUGAUGAG X CGAA IGGUACC 



GUACCCC A GCAGGGC 



CCCCAGC A GGGCUUC 



GCCCUGC CUGAUGAG X CGAA IGGGUAC 



GAAGCCC CUGAUGAG X CGAA ICUGGGG 



GCAGGGC U UCUUCUG 



CAGAAGA CUGAUGAG X CGAA ICCCUGC 



GGGCUUC U UCUGUCC 



GGACAGA CUGAUGAG X CGAA IAAGCCC 



CUUCUUC U GUCCAGA 



UUCUGUC C AGACCCU 



UCUGGAC CUGAUGAG X CGAA IAAGAAG 



AGGGUCU CUGAUGAG X CGAA IACAGAA 



UCUGUCC A GACCCUG 



CAGGGUC CUGAUGAG X CGAA IGACAGA 



UCCAGAC C CUGCCCC 



GGGGCAG CUGAUGAG X CGAA IUCUGGA 



CCAGACC C UGCCCCG 



CAGACCC U GCCCCGG 



ACCCUGC C CCGGGCG 



CGGGGCA CUGAUGAG X CGAA IGUCUGG 



CCGGGGC CUGAUGAG X CGAA IGGUCUG 



CGCCCGG CUGAUGAG X CGAA ICAGGGU 



CCCUGCC C CGGGCGC 



GCGCCCG CUGAUGAG X CGAA IGCAGGG 



CCUGCCC C GGGCGCU 



CGGGCGC U GGGGGCA 



AGCGCCC CUGAUGAG X CGAA IGGCAGG 



UGCCCCC CUGAUGAG X CGAA ICGCCCG 



UGGGGGC A UGGUCCA 



UGGACCA CUGAUGAG X CGAA ICCCCCA 
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CAUGGUC C ACCACAG 



CUGUGGU CUGAUGAG X CGAA IACCAUG 



CCUGUGG CUGAUGAG X CGAA IGACCAU 



AUGGUCC A CCACAGG 



UGCCUGU CUGAUGAG X CGAA IUGGACC 



GGUCCAC C ACAGGCA 
GUCCACC A CAGGCAC 



CCACCAC A GGCACCG 
CACAGGC A CCGCAGC 
CAGGCAC C GCAGCUC 
GCACCGC A GCUCAUC 
CCGCAGC U CAUCUAC 
GCAGCUC A UCUACCA 
GCUCAUC U ACCAGGA 
CAUCUAC C AGGAGUG 
AUCUACC A GGAGUGG 
UGGGGAC C UGACACU 
GGGGACC U GACACUA 
ACCUGAC A CUAGGGC 
CUGACAC U AGGGCUG 
CUAGGGC U GGAGCCC 
CUGGAGC C CUCUGAA 
UGGAGCC C UCUGAAG 
GGAGCCC U CUGAAGA 
AGCCCUC U GAAGAGG 
5AGGC C CCCAGGU 
iGGCC C CCAGGUC 
GA GGCCC C CAGGUCU ' 
AGGCCCC C AGGUCUC 
GGCCCCC A GGUCUCC 
CCAGGUC U CCACUGG 
AGGUCUC C ACUGGCA 
GGUCUCC A CUGGCAC 
UCUCCAC U GGCACCC 
;UGGC A CCCUCCG 
CUGGCAC C CUCCGAA 
UGGCACC C UCCGAAG 
GGCACCC U CCGAAGG 
!!CCUC C GAAGGGG 
AAGGGGC U GGCUCCG 
GGCUGGC U CCGAUGU 
CUGGCUC C GAUGUAU 
UGGUGAC C UGGGAAU 
GGUGACC U GGGAAUG 
UGGGGGC A GCCAi 
GGGCAGC C AAGGC 
GGCAGCC A AGGGGCU 
AAGGGGC U GCAAAGC"" 
GGGCUGC A AAGCCUC 
GCAAAGC C UCCCCAC 



GUGCCUG CUGAUGAG X CGAA IGUGGAC 



CGGUGCC CUGAUGAG X CGAA IUGGUGG 
GCUGCGG CUGAUGAG X CGAA ICCUGUG 
GAGCUGC CUGAUGAG X CGAA IUGCCUG 
GAUGAGC CUGAUGAG X CGAA ICGGUGC 
GUAGAUG CUGAUGAG -X CGAA ICUGCGG 
UGGUAGA CUGAUGAG X CGAA IAGCUGC 
UCCUGGU CUGAUGAG X CGAA IAUGAGC 
CACUCCU CUGAUGAG X CGAA IUAGAUG 
CCACUCC CUGAUGAG X C GAA IGUAGAU 
AGUGUCA CUGAUGAG X CGAA IUCCCCA 
UAGUGUC CUGAUGAG X CGAA IGUCCCC 
GCCCUAG CUGAUGAG X CGAA IUCAGGU 
CAGCCCU CUGAUGAG X CGAA IUGUCAG 
GGGCUCC CUGAUGAG X CGAA ICCCUAG 
UUCAGAG CUGAUGAG X CGAA ICUCCAG 
CUUCAGA CUGAUGAG X CGAA IGCUCCA 
UCUUCAG CUGAUGAG X CGAA IGGCUCC 
CCUCUUC CUGAUGAG X CGAA IAGGGCU 
ACCUGGG CUGAUGAG X CGAA ICCUCCU 
GACCUGG CUGAUGAG X CGAA IGCCUCC 
AGACCUG CUGAUGAG X CGAA IGGCCUC 
GAGACCU CUGAUGAG X CGAA IGGGCCU 
GGAGACC CUGAUGAG X CGAA IGGGGCC 
CCAGUGG CUGAUGAG X CGAA IACCUGG 



UGCCAGU CUGAUGAG X CGAA 
GUGCCAG CUGAUGAG X CGAA 



IAGACCU 
IGAGACC 



GGGUGCC CUGAUGAG X CGAA IUGGAGA 
CGGAGGG CUGAUGAG X CGAA ICCAGUG 
UUCGGAG CUGAUGAG X CGAA IUGCCAG 
CUUCGGA CUGAUGAG X CGAA IGUGCCA 
CCUUCGG CUGAUGAG X CGAA IGGUGCC 
CCCCUUC CUGAUGAG X CGAA IAGGGUG 
CGGAGCC CUGAUGAG X CGAA ICCCCUU 
ACAUCGG CUGAUGAG X CGAA ICCAGCC 
AUACAUC CUGAUGAG X CGAA IAGCCAG 
AUUCCCA CUGAUGAG X CGAA IUCACCA 
CAUUCCC CUGAUGAG X CGAA IGUCACC 
CCUUGGC CUGAUGAG X CGAA ICCCCCA 
GCCCCUU CUGAUGAG X CGAA ICUGCCC 
AGCCCCU CUGAUGAG X CGAA IGCUGCC 
GCUUUGC CUGAUGAG X CGAA ICCCCUU 
GAGGCUU CUGAUGAG X CGAA ICAGCCC 
GUGGGGA CUGAUGAG X CGAA ICUUUGC 
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CAAAGCC U CCCCACA 
AAGCCUC C CCACACA 



UGUGGGG CUGAUGAG X C 
UGUGUGG CUGAUGAG X CGAA IAGGCUU 



AUGUGUG CUGAUGAG X CGAA IGAGGCU 



AGCCUCC C CACACAU 



GCCUCCC C ACACAUG 
CCUCCCC A CACAUGA 



CAUGUGU CUGAUGAG X CGAA IGGAGGC 



UCAUGUG CUGAUGAG X CGAA IGGGAGG 



GGUCAUG CUGAUGAG X CGAA IUGGGGA 



UCCCCAC A CAUGACC 



GGGGUCA CUGAUGAG X CGAA IUGUGGG 
GGGCUGG CUGAUGAG X CGAA IUCAUGU 
AGGGCUG CUGAUGAG X CGAA IGUCAUG 
GAGGGCU CUGAUGAG X CGAA IGGUCAU 
AGAGGGC CUGAUGAG X CGAA IGGGUCA 
UGUAGAG CUGAUGAG X CGAA ICUGGGG 
CUGUAGA CUGAUGAG X CGAA IGCUGGG 
GCUGUAG CUGAUGAG X CGAA IGGCUGG 
CCGCUGU CUGAUGAG X CGAA IAGGGCU 
GUACCGC CUGAUGAG X CGAA IUAGAGG 
UCCUCAC CUGAUGAG X CGAA IUACCGC 
ACUGUGG CUGAUGAG X CGAA IUCCUCA 
UACUGUG CUGAUGAG X CGAA IGUCCUC 
IGGUCCU 

GGUACUG CUGAUGAG X CGAA IGGGUCC 
GGGGUAC CUGAUGAG X CGAA IUGGGGU 
GGGCAGG CUGAUGAG X CGAA IUACUGU 
AGGGCAG CUGAUGAG X CGAA IGUACUG 
GAGGGCA CUGAUGAG X CGAA IGGUACU 
AGAGGGC CUGAUGAG X CGAA IGGGUAC 
CUCAGAG CUGAUGAG X CGAA ICAGGGG 
UCUCAGA CUGAUGAG X CGAA IGCAGGG 
GUCUCAG CUGAUGAG X CGAA IGGCAGG 
CAGUCUC CUGAUGAG X CGAA IAGGGCA 
AGCCAUC CUGAUGAG X CGAA IUCUCAG 
GCAACGU CUGAUGAG X CGAA ICCAUCA 
UCAGGGG CUGAUGAG X CGAA ICAACGU 
GUCAGGG CUGAUGAG X CGAA IGCAACG 
GGUCAGG CUGAUGAG X CGAA IGGCAAC 
AGGUCAG CUGAUGAG X CGAA IGGGCAA 
IGGGGCA 
IGGGGGC 

GGCUGCA CUGAUGAG X CGAA IUCAGGG 
GGGCUGC C 



3491 
3492 



3498 
3500 



35S6 
3570 



CCCACAC A UGACCCC 
ACAUGAC C CCAGCCC 
CAUGACC C CAGCCCU 
AUGACCC C AGCCCUC 
UGACCCC A GCCCUCU 
CCCCAGC C CUCUACA 
CCCAGCC C UCUACAG 
CCAGCCC U CUACAGC 
AGCCCUC U ACAGCGG 
CCUCUAC A GCGGUAC 
GCGGUAC A GUGAGGA 
UGAGGAC C CCACAGU 
GAGGACC C CACAGUA 
AGGACCC C ACAGUAC 
GGACCCC A CAGUACC 
ACCCCAC A GUACCCC 
ACAGUAC C CCUGCCC 
CAGUACC C CUGCCCU 
AGUACCC C UGCCCUC 
CCCC U GCCCUCU 
CUGC C CUCUGAG 
CCCUGCC C UCUGAGA 
CCUGCCC U CUGfl 
UGCCCUC U GAGACUG 
IAGAC U GAUGGCU 
,UGGC U ACGUUGC 
ACGUUGC C CCCCUGA 
FUGCC C CCCUGAC 
IGCCC C CCUGACC 
ICCCC C CUGACCU 
UGCCCCC C UGACCUG 
GCCCCCC U GACCUGC 
CCCUGAC C UGCAGCC 
CCUGACC U GCAGCCC 
GACCUGC A GCCCCCA 
CUGCAGC C CCCAGCC 
UGCAGCC C CCAGCCU 
GCAGCCC C CAGCCUG 
CAGCCCC C AGCCUGA 
AGCCCCC A GCCUGAA 
CCCCAGC C UGAAUAU 



GUACUGU CUGAUGAG X CGAA 



CAGGUCA CUGAUGAG X CGAA 
GCAGGUC CUGAUGAG X CGAA 



: CUGAUGAG X CGAA IGUCAGG 
UGGGGGC CUGAUGAG X CGAA ICAGGUC 
GGCUGGG CUGAUGAG X CGAA ICUGCAG 



AGGCUGG CUGAUGAG X CGAA 3 



> IGCUGCA 
CAGGCUG CUGAUGAG X CGAA IGGCUGC 
UCAGGCU CUGAUGAG X CGAA IGGGCUG 
UUCAGGC CUGAUGAG X CGAA IGGGGCU 
AUAUUCA CUGAUGAG X CGAA ICUGGGG 
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Table 34 





CCCAGCC U GAAUAUG 




CAUAUUC CUGAUGAG X CGAA IGCUGGG 




— Jiii 


UGUGAAC C AGCCAGA 




UCUGGCU CUGAUGAG X CGAA IUUCACA 




_ 3599 . 


GUGAACC A GCCAGAU 




AUCUGGC CUGAUGAG X CGAA IGUUCAC 




3602 


AACCAGC C AGAUGUU 




AACAUCU CUGAUGAG X CGAA ICUGGUU 




3603 


ACCAGCC A GAUGUUC 




GAACAUC CUGAUGAG X CGAA IGCUGGU 




3614 


GUUCGGC C CCAGCCC 




GGGCUGG CUGAUGAG X CGAA ICCGAAC 




3615 


UUCGGCC C CAGCCCC 




GGGGCUG CUGAUGAG X CGAA IGCCGAA 




3616 


UCGGCCC C AGCCCCC 




GGGGGCU CUGAUGAG X CGAA IGGCCGA 




3617 






AGGGGGC CUGAUGAG X CGAA IGGGCCG 




3620 


CCCCAGC C CCCUUCG 




CGAAGGG CUGAUGAG X CGAA ICUGGGG 




3621 






GCGAAGG CUGAUGAG X CGAA IGCUGGG 




3622 


CCAGCCC C CUUCGCC 




GGCGAAG CUGAUGAG X CGAA IGGCUGG 




3623 






GGGCGAA CUGAUGAG X CGAA IGGGCUG 




3624 


AGCCCCC U UCGCCCC 




GGGGCGA CUGAUGAG X CGAA IGGGGCU 




3629 






CUCUCGG CUGAUGAG X CGAA ICGAAGG 




3630 


ccuucgc c 

CUUCGCC C _ 




CCUCUCG CUGAUGAG X CGAA IGCGAAG 




3631 






CCCUCUC CUGAUGAG X CGAA IGGCGAA 




3640 


jjucgccc c GAGA ^° 

AGAGGGC C CU 




GGCAGAG CUGAUGAG X CGAA ICCCUCU 




3641 


GAGGGCC C UCUGCCU 




AGGCAGA CUGAUGAG X CGAA IGCCCUC 




3642 


AGGGCCC U CUGCCUG 




CAGGCAG CUGAUGAG X CGAA IGGCCCU 




3644 






AGCAGGC CUGAUGAG X CGAA IAGGGCC 




3647 


GGCCCUC U 
CCUCUGC C UGCUGCC 




GGCAGCA CUGAUGAG X CGAA ICAGAGG 




3648 






GGGCAGC CUGAUGAG X CGAA IGCAGAG 




3651 


CUCUGCC U 




GUCGGGC CUGAUGAG X CGAA ICAGGCA 




3654 


^GCUGC C CGACCUG 




CAGGUCG CUGAUGAG X CGAA ICAGCAG 




3655 


UGCUGCC C GAC 




GCAGGUC CUGAUGAG X CGAA IGCAGCA 




3659 


GCCCGAC C UGCUGGU 




ACCAGCA CUGAUGAG X CGAA IUCGGGC 




3660 






CACCAGC CUGAUGAG X CGAA IGUCGGG 




3663 


CCCGACC U 
GACCUGC U _____ 




UGGCACC CUGAUGAG X CGAA ICAGGUC 




3669 


CUGGUGC C A 




CCAGAGU CUGAUGAG X CGAA ICACCAG 




3670 


UGGUGCC A CUCUGGA 




UCCAGAG CUGAUGAG X CGAA IGCACCA 




3672 






UUUCCAG CUGAUGAG X CGAA IUGGCAC 




3674 


GUuCCAC _ ggaaagg 
GCCACUC U 




CCUUUCC CUGAUGAG X CGAA IAGUGGC 




3683 






AGUCUUG CUGAUGAG X CGAA ICCUUUC 




3684 


AAAGGCC C AAGACUC 




GAGUCUU CUGAUGAG X CGAA IGCCUUU 




3o85 


AAGGCCC A AGACUCU 




AGAGUCU CUGAUGAG X CGAA IGGCCUU 




3590 






GGGAGAG CUGAUGAG X CGAA IUCUUGG 




3692 


AAGACUC U CUCCCCA 




UGGGGAG CUGAUGAG X CGAA IAGUCUU 






GACUCUC U CCCCAGG 




CCUGGGG CUGAUGAG X CGAA IAGAGUC 




3696 


CUCUCUC C CCAGGGA 




UCCCUGG CUGAUGAG X CGAA IAGAGAG 




3697 


UCUCUCC C CAGGGAA 




UUCCCUG CUGAUGAG X CGAA 1GAGAGA 




3698 


CUCUCCC C AGGGAAG 




CUUCCCU CUGAUGAG X CGAA IGGAGAG 




3699 


UCUCCCC A GGGAAGA 




UCUUCCC CUGAUGAG X CGAA IGGGAGA 




3718 


GGUCGUC A AAGACGU 




ACGUCUU CUGAUGAG X CGAA IACGACC 




3732 


UUUUUGC C UUUGGGG 




CCCCAAA CUGAUGAG X CGAA ICAAAAA 




3733 


UUUUGCC U UUGGGGG 




CCCCCAA CUGAUGAG X CGAA IGCAAAA 




1 3744 


GGGGUGC C GUGGAGA 




UCUCCAC CUGAUGAG X CGAA ICACCCC 
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GGAGAAC C CCGAGUA 



GAGAACC C CGAGUAC 



AGAACCC C GAGUACU 



CGAGUAC U UGACACC 



: A CCCCAGG 

: c ccaggga" 



UGACACC C 
GACACCC C 



ACACCCC A GGGAGGA 



GAGGAGC U GCCCCUC 



GAGCUGC C CCUCAGC 



AGCUGCC C CUCAGCC 



GCUGCCC C UCAGCCC 



CUGCCCC U CAGCCCC 



GCCCCUC A GCCCCAC 



CCUCAGC C CCACCCU 



CUCAGCC C CACCCUC 



UCAGCCC C ACCCUCC 



CAGCCCC A CCCUCCU 



GCCCCAC C CUCCUCC 



CCCCACC C UCCUCCU 



UACUCGG CUGAUGAG X CGAA IUUCUCC 



GUACUCG CUGAUGAG X CGAA IGUUCUC 



AGUACUC CUGAUGAG X CGAA IGGUUCU 



GGUGUCA CUGAUGAG X CGAA IUACUCG 



CCUGGGG CUGAUGAG X CGAA IUCAAGU 



UCCCUGG CUGAUGAG X CGAA IUGUCAA 



CUCCCUG CUGAUGAG X CGAA IGUGUCA 



CCUCCCU CUGAUGAG X CGAA IGGUGUC 



UCCUCCC CUGAUGAG X CGAA IGGGUGU 



GAGGGGC CUGAUGAG X CGAA ICUCCUC 



GCUGAGG CUGAUGAG X CGAA ICAGCUC 



GGCUGAG CUGAUGAG X CGAA IGCAGCU 



GGGCUGA CUGAUGAG X CGAA IGGCAGC 



GGGGCUG CUGAUGAG X CGAA IGGGCAG 



GUGGGGC CUGAUGAG X CGAA IAGGGGC 



AGGGUGG CUGAUGAG X CGAA ICUGAGG 



GAGGGUG CUGAUGAG X CGAA IGCUGAG 



GGAGGGU CUGAUGAG X CGAA IGGCUGA 



AGGAGGG CUGAUGAG X CGAA IGGGCUG 



GGAGGAG CUGAUGAG X CGAA IUGGGGC 



AGGAGGA CUGAUGAG X CGAA 



IGUGGGG 
IGGUGGG 



CCCACCC I 
CACCCUC C 



ACCCUCC U CCUGCCU 



CCUCCUC C UGCCUUC 



CUCCUCC U GCCUUCA 



CUCCUGC C UUCAGCC 



UCCUGCC U UCAGCCC 



UGCCUUC A GCCCAGC 



CUUCAGC C CAGCCUU 



UUCAGCC C AGCCUUC 



UCAGCCC 
GCCCAGC 



C UUCGACA 
U UCGACAA 



CUUCGAC A ACCUCUA 



CGACAAC C UCUAUUA 



GACAACC I 
CAACCUC I 



CUAUUAC U GGGACCA 



CUGGGAC C AGGACCC 



UGGGACC A GGACCCA 



CCAGGAC C CACCAGA 



CAGGACC C AC C AG AG 



AGGACCC A CCAGAGC 



GACCCAC C AGAGCGG 



ACCCACC A GAGCGGG 



GGGGGGC U CCACCCA 



CAGGAGG CUGAUGAG X CGAA 



GGCAGGA CUGAUGAG X CGAA IAGGGUG 



AGGCAGG CUGAUGAG X CGAA IGAGGGU 



GAAGGCA CUGAUGAG X CGAA IAGGAGG 



UGAAGGC CUGAUGAG X CGAA IGAGGAG 



GGCUGAA CUGAUGAG X CGAA ICAGGAG 



GGGCUGA CUGAUGAG X CGAA IGCAGGA 
GCUGGGC CUGAUGAG X CGAA IAAGGCA 



AAGGCUG CUGAUGAG X CGAA ICUGAAG 
GAAGGCU CUGAUGAG X CGAA IGCUGAA 



CGAAGGC CUGAUGAG X CGAA IGGCUGA 



UGUCGAA CUGAUGAG X CGAA ICUGGGC 



UUGUCGA CUGAUGAG X CGAA IGCUGGG 



UAGAGGU CUGAUGAG X CGAA IUCGAAG 



UAAUAGA CUGAUGAG X CGAA IUUGUCG 



GUAAUAG CUGAUGAG X CGAA IGUUGUC 



CAGUAAU CUGAUGAG X CGAA I 
UGGUCCC CUGAUGAG X CGAA 1 



GGGUCCU CUGAUGAG X CGAA IUCCCAG 
; CUGAUGAG X CGAA IGUCCCA 



UGGGUCC C 
UCUGGUG CUGAUGAG X CGAA IUCCUGG 



CUCUGGU CUGAUGAG X CGAA IGUCCUG 



GCUCUGG CUGAUGAG X CGAA IGGUCCU 



CCGCUCU CUGAUGAG X CGAA IUGGGUC 



CCCGCUC CUGAUGAG X CGAA IGUGGGU 



UGGGUGG CUGAUGAG X CGAA ICCCCCC 
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GGGGCUC C ACCCAGC 
GGGCUCC A CCCAGCA 
GCUCCAC C CAGCACC 
CUCCACC C AGCACCU 
UCCACCC A GCACCUU 
ACCCAGC A CCUUCAA 
CCAGCAC C UUCAAAG 
CAGCACC U UCAAAGG 
CACCUUC A AAGGGAC 
AAGGGAC A CCUACGG 
GGGACAC C UACGGCA 
GGACACC U ACGGCAG 
CUACGGC A GAGAACC 
AGAGAAC C CAGAGUA 
GAGAACC C AGAGUAC 
AGAACCC A GAGUACC 
AGAGUAC C UGGGUCU 
GAGUACC U GGGUCUG 
CUGGGUC U GGACGUG 
GACGUGC C AGUGUGA 
ACGUGCC A GUGUGAA 
UGUGAAC C AGAAGGC 
GUGAACC A GAAGGCC 
AGAAGGC C AAGUCCG 
GAAGGCC A AGUCCGC 
CCAAGUC C GCAGAAG 
AGUCCGC A GAAGCCC 
CAGAAGC C CUGAUGU 
AGAAGCC C UGAUGUG 
GAAGCCC U GAUGUGU 
AUGUGUC C UCAGGGA 
UGUGUCC U CAGGGAG 
UGUCCUC A GGGAGCA 
AGGGAGC A GGGAAGG 
GGAAGGC C UGACUUC 
GAAGGCC U GACUUCU 
GCCUGAC U UCUGCUG 
UGACUUC U GCUGGCA 
CUUCUGC U GGCAUCA 
UGCUGGC A UCAAGAG 
UGGCAUC A AGAGGUG 
GGAGGGC C CUCCGAC 
GAGGGCC C UCCGACC 
AGGGCCC U CCGACCA 
GGCCCUC C GACCACU 
CUCCGAC C ACUUCCA 
UCCGACC A CUUCCAG 



GCUGGGU CUGAUGAG X CGAA IAGCCCC 
UGCUGGG CUGAUGAG X CGAA IGAGCCC 
GGUGCUG CUGAUGAG X CGAA IUGGAGC 
AGGUGCU CUGAUGAG X CGAA IGUGGAG 
AAGGUGC CUGAUGAG X CGAA IGGUGGA 
UUGAAGG CUGAUGAG X CGAA ICUGGGU 
CUUUGAA CUGAUGAG X CGAA IUGCUGG 
CCUUUGA CUGAUGAG X CGAA IGUGCUG 
GUCCCUU CUGAUGAG X CGAA IAAGGUG 
CCGUAGG CUGAUGAG X CGAA IUCCCUU 
UGCCGUA CUGAUGAG X CGAA IUGUCCC 
CUGCCGU CUGAUGAG X CGAA IGUGUCC 
GGUUCUC CUGAUGAG X CGAA ICCGUAG 
UACUCUG CUGAUGAG X CGAA IUUCUCU 
GUACUCU CUGAUGAG X CGAA IGUUCUC 
GGUACUC CUGAUGAG X CGAA IGGUUCU 
AGACCCA CUGAUGAG X CGAA IUACUCU 
CAGACCC CUGAUGAG X CGAA IGUACUC 
CACGUCC CUGAUGAG X CGAA IACCCAG 
UCACACU CUGAUGAG X CGAA ICACGUC 
UUCACAC CUGAUGAG X CGAA IGCACGU 
GCCUUCU CUGAUGAG X CGAA IUUCACA 
GGCCUUC CUGAUGAG X CGAA IGUUCAC 
CGGACUU CUGAUGAG X CGAA ICCUUCU 
GCGGACU CUGAUGAG X CGAA IGCCUUC 
CUUCUGC CUGAUGAG X CGAA IACUUGG 
GGGCUUC CUGAUGAG X CGAA ICGGACU 
ACAUCAG CUGAUGAG X CGAA ICUUCUG 
CACAUCA CUGAUGAG X CGAA IGCUUCU 
ACACAUC CUGAUGAG X CGAA IGGCUUC 
UCCCUGA CUGAUGAG X CGAA IACACAU 
CUCCCUG CUGAUGAG X CGAA IGACACA 
UGCUCCC CUGAUGAG X CGAA IAGGACA 
CCUUCCC CUGAUGAG X CGAA ICUCCCU 
GAAGUCA CUGAUGAG X CGAA ICCUUCC 
AGAAGUC CUGAUGAG X CGAA IGCCUUC 
CAGCAGA CUGAUGAG X CGAA IUCAGGC 
UGCCAGC CUGAUGAG X CGAA IAAGUCA 
UGAUGCC CUGAUGAG X CGAA ICAGAAG 
CUCUUGA CUGAUGAG X CGAA ICCAGCA 
CACCUCU CUGAUGAG X CGAA IAUGCCA 
GUCGGAG CUGAUGAG X CGAA ICCCUCC 
GGUCGGA CUGAUGAG X CGAA IGCCCUC 
UGGUCGG CUGAUGAG X CGAA IGGCCCU 
AGUGGUC CUGAUGAG X CGAA IAGGGCC 
UGGAAGU CUGAUGAG X CGAA IUCGGAG 
CUGGAAG CUGAUGAG X CGAA IGUCGGA 
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CGACCAC U UCCAGGG 



CCCUGGA CUGAUGAG X CGAA IUGGUCG 



CCACUUC C AGGGGAA 



CACUUCC A GGGGAAC 



UUCCCCU CUGAUGAG X CGAA IAAGUGG 



GUUCCCC CUGAUGAG X CGAA IGAAGUG 



GGGGAAC C UGCCAUG 



CAUGGCA CUGAUGAG X CGAA IUUCCCC 



GGGAACC U GCCAUGC 



GCAUGGC CUGAUGAG X CGAA IGUUCCC 



AACCUGC C AUGCCAG 



CUGGCAU CUGAUGAG X CGAA ICAGGUU 



ACCUGCC A UGCCAGG 



CCUGGCA CUGAUGAG X CGAA IGCAGGU 



GCCAUGC ( 
CCAUGCC 1 



GGUUCCU CUGAUGAG X CGAA ICAUGGC 



AGGUUCC CUGAUGAG X CGAA IGCAUGG 



CAGGAAC C UGUCCUA 



UAGGACA CUGAUGAG X CGAA IUUCCUG 



AGGAACC U GUCCUAA 



ACCUGUC C UAAGGAA 



CCUGUCC U AAGGAAC 



UUAGGAC CUGAUGAG X CGAA IGUUCCU 



UUCCUUA CUGAUGAG X CGAA IACAGGU 



GUUCCUU CUGAUGAG X CGAA IGACAGG 



AAGGAAC C UUCCUUC 



GAAGGAA CUGAUGAG X CGAA IUUCCUU 



AGGAACC U UCCUUCC 



GGAAGGA CUGAUGAG X CGAA IGUUCCU 



AACCUUC C UUCCUGC 



GCAGGAA CUGAUGAG X CGAA IAAGGUU 



ACCUUCC U UCCUGCU 



UUCCUUC C UGCUUGA 



AGCAGGA CUGAUGAG X CGAA IGAAGGU 



UCAAGCA CUGAUGAG X CGAA IAAGGAA 



UCCUUCC U GCUUGAG 



CUCAAGC CUGAUGAG X CGAA IGAAGGA 



UUCCUGC U UGAGUUC 



GAACUCA CUGAUGAG X CGAA ICAGGAA 



UGAGUUC C CAGAUGG 



CCAUCUG CUGAUGAG X CGAA IAACUCA 



GAGUUCC C AGAUGGC 



AGUUCCC A GAUGGCU 



GCCAUCU CUGAUGAG X CGAA IGAACUC 



AGCCAUC CUGAUGAG X CGAA IGGAACU 



AGAUGGC U GGAAGGG 



CCCUUCC CUGAUGAG X CGAA ICCAUCU 



AGGGGUC C AGCCUCG 



GGGGUCC A GCCUCGU 



CGAGGCU CUGAUGAG X CGAA IACCCCU 



ACGAGGC CUGAUGAG X CGAA IGACCCC 



GUCCAGC C UCGUUGG 



CCAACGA CUGAUGAG X CGAA ICUGGAC 



UCCAGCC U CGUUGGA 



UCCAACG CUGAUGAG X CGAA IGCUGGA 



GAGGAAC A GCACUGG 



CCAGUGC CUGAUGAG X CGAA IUUCCUC 



GAACAGC / 
ACAGCAC I 



GGGAGUC U UUGUGGA 



UCCCCAG CUGAUGAG X CGAA ICUGUUC 



ACUCCCC CUGAUGAG X CGAA IUGCUGU 



UCCACAA CUGAUGAG X CGAA IACUCCC 



GGGCCUC CUGAUGAG X CGAA IAAUCCA 



CUGAGGC C CUGCCCA 



UGAGGCC C UGCCCAA 



UGGGCAG CUGAUGAG X CGAA ICCUCAG 



UUGGGCA CUGAUGAG X CGAA IGCCUCA 



GAGGCCC U GCCCAAU 



AUUGGGC CUGAUGAG X CGAA IGGCCUC 



GCCCUGC C CAAUGAG 



CUCAUUG CUGAUGAG X CGAA ICAGGGC 



CCCUGCC C AAUGAGA 



UCUCAUU CUGAUGAG X CGAA IGCAGGG 



CCUGCCC A AUGAGAC 



AUGAGAC U CUAGGGU 



GAGACUC U AGGGUCC 



GUCUCAU CUGAUGAG X CGAA IGGCAGG 



ACCCUAG CUGAUGAG X CGAA IUCUCAU 



GGACCCU CUGAUGAG X CGAA IAGUCUC 



UAGGGUC C AGUGGAU 



AUCCACU CUGAUGAG X CGAA IACCCUA 



AGGGUCC A GUGGAUG 



UGGAUGC C ACAGCCC 



CAUCCAC CUGAUGAG X CGAA IGACCCU 



GGGCUGU CUGAUGAG X CGAA ICAUCCA 



GGAUGCC A CAGCCCA 



UGGGCUG CUGAUGAG X CGAA IGCAUCC 



AUGCCAC A GCCCAGC 



GCUGGGC CUGAUGAG X CGAA IUGGCAU 



CCACAGC C CAGCUUG 



CAAGCUG CUGAUGAG X CGAA ICUGUGG 
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CACAGCC C AGCUUGG 



CCAAGCU CUGAUGAG X CGAA IGCUGUG 



ACAGCCC A GCUUGGC 



GCCAAGC CUGAUGAG X CGAA IGGCUGU 



GCCCAGC U UGGCCCU 



AGGGCCA CUGAUGAG X CGAA ICUGGGC 



GCUUGGC C CUUUCCU 



AGGAAAG CUGAUGAG X CGAA ICCAAGC 



CUUGGCC C UUUCCUU 



AAGGAAA CUGAUGAG X CGAA IGCCAAG 



UUGGCCC U UUCCUUC 



GAAGGAA CUGAUGAG X CGAA IGGCCAA 



CCCUUUC C UUCCAGA 



UCUGGAA CUGAUGAG X CGAA IAAAGGG 



CCUUUCC U UCCAGAU 



AUCUGGA CUGAUGAG X CGAA IGAAAGG 



UUCCUUC C AGAUCCU 



AGGAUCU CUGAUGAG X CGAA IAAGGAA 



UCCUUCC A GAUCCUG 



CAGGAUC CUGAUGAG X CGAA IGAAGGA 



CCAGAUC C UGGGUAC 



GUACCCA CUGAUGAG X CGAA IAUCUGG 



CAGAUCC U GGGUACU 



AGUACCC CUGAUGAG X CGAA IGAUCUG 



UGGGUAC U GAAAGCC 



GGCUUUC CUGAUGAG X CGAA IUACCCA 



UGAAAGC C UUAGGGA 



UCCCUAA CUGAUGAG X CGAA ICUUUCA 



GAAAGCC U UAGGGAA 



UUCCCUA CUGAUGAG X CGAA IGCUUUC 



GGGAAGC U GGCCUGA 



UCAGGCC CUGAUGAG X CGAA ICUUCCC 



AGCUGGC C UGAGAGG 



CCUCUCA CUGAUGAG X CGAA ICCAGCU 



GCUGGCC U GAGAGGG 



CCCUCUC CUGAUGAG X CGAA IGCCAGC 



AAGCGGC C CUAAGGG 



CCCUUAG CUGAUGAG X CGAA ICCGCUU 



AGCGGCC C UAAGGGA 



UCCCUUA CUGAUGAG X CGAA IGCCGCU 



GCGGCCC U AAGGGAG 



CUCCCUU CUGAUGAG X CGAA IGGCCGC 



GAGUGUC U AAGAACA 



UGUUCUU CUGAUGAG X CGAA IACACUC 



UAAGAAC A AAAGCGA 



UCGCUUU CUGAUGAG X CGAA IUUCUUA 



AAGCGAC C CAUUCAG 



CUGAAUG CUGAUGAG X CGAA IUCGCUU 



AGCGACC C AUUCAGA 



UCUGAAU CUGAUGAG X CGAA IGUCGCU 



GCGACCC A UUCAGAG 



CUCUGAA CUGAUGAG X CGAA IGGUCGC 



CCCAUUC A GAGACUG 



CAGUCUC CUGAUGAG X CGAA IAAUGGG 



CAGAGAC U GUCCCUG 



CAGGGAC CUGAUGAG X CGAA IUCUCUG 



GACUGUC C CUGAAAC 



GUUUCAG CUGAUGAG X CGAA IACAGUC 



ACUGUCC C UGAAACC 



GGUUUCA CUGAUGAG X CGAA IGACAGU 



CUGUCCC U GAAACCU 



AGGUUUC CUGAUGAG X CGAA IGGACAG 



CUGAAAC C UAGUACU 



AGUACUA CUGAUGAG X CGAA IUUUCAG 



UGAAACC U AGUACUG 



CAGUACU CUGAUGAG X CGAA IGUUUCA 



CUAGUAC U GCCCCCC 



GGGGGGC CUGAUGAG X CGAA IUACUAG 



GUACUGC C CCCCAUG 



CAUGGGG CUGAUGAG X CGAA ICAGUAC 



UACUGCC C CCCAUGA 



UCAUGGG CUGAUGAG X CGAA IGCAGUA 



ACUGCCC C CCAUGAG 



CUCAUGG CUGAUGAG X CGAA IGGCAGU 



CUGCCCC C CAUGAGG 



CCUCAUG CUGAUGAG X CGAA IGGGCAG 



UGCCCCC C AUGAGGA 



UCCUCAU CUGAUGAG X CGAA IGGGGCA 



GCCCCCC A UGAGGAA 



UUCCUCA CUGAUGAG X CGAA IGGGGGC 



AAGGAAC A GCAAUGG 



CCAUUGC CUGAUGAG X CGAA IUUCCUU 



GAACAGC A AUGGUGU 



ACACCAU CUGAUGAG X CGAA ICUGUUC 



UGGUGUC A GUAUCCA 



CAGUAUC C AGGCUUU 



UGGAUAC CUGAUGAG X CGAA IACACCA 



AAAGCCU CUGAUGAG X CGAA IAUACUG 



AGUAUCC A GGCUUUG 



CAAAGCC CUGAUGAG X CGAA IGAUACU 



UCCAGGC U UUGUACA 



UGUACAA CUGAUGAG X CGAA ICCUGGA 



UUUGUAC A GAGUGCU 



AGCACUC CUGAUGAG X CGAA IUACAAA 



WO 01/16312 



PCT/US00/23998 



497 



AGAGUGC U UUUCUGU 



ACAGAAA CUGAUGAG X CGAA ICACUCU 



GCUUUUC U GUUUAGU 



ACUAAAC CUGAUGAG X CGAA IAAAAGC 



UUUUUAC U UUUUUUG 



CAAAAAA CUGAUGAG X CGAA IUAAAAA 



UAAAGAC C CAGGGGG 



CCCCCUG CUGAUGAG X CGAA IUCUUUA 



AAAGACC C AGGGGGA 



UCCCCCU CUGAUGAG X CGAA IGUCUUU 



AAGACCC A GGGGGAG 



CUCCCCC CUGAUGAG X CGAA IGGUCUU 



Seq Accession No. = HSERB2R (Human c-erb-B-2 mRNA; 4473 bp) 
Core Sequence = CUGAUGAG X CGAA (X = GCCGAAAGGC or other 
stem II) 
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Table 35 



Table 35: HBV Strains and Accession numbers 



Accession 
Number 


Name 


AF100308 . 1 


AF100308 Hepatitis B virus strain 2-18, complete 


AB026815 . 1 


AB ° 268 gg Hepatitis B virus DNa' complete genome' 


AB033559 . 1 


ABQ335 g8 Hepatitis B virul DMA complete genome', 


AB033558 . 1 




AB033557 . 1 


^033357 Hepatitis B virus DNa' complete genome' 


ABQ33556 - 1 


AB033556 Hepatitis B virus DNA, complete genome, 


AB03355S . 1 


A 5033555 Hepat ^^ ^ yjrul dna compete ^nome' 


AB033554 . 1 


AB033554 HeP ^^^ - ^us DNA ' conTlete — en ome' 


AB033553 .1 


A 3033553 Hepatitis B virus ^ comp^e genom , _ 


AB033552 - 1 


ABQ335S2 " e P atitis B virus DNa' com lete genome' 


AB033551 . 1 


A 6033551 Uepa ^^ s g-^j^j DNA c^iete genome — ' 


AB033550 . 1 


A 5033550 Hepatl ^ S fi v ^^3 3 clone WB ^ 2b4 — complete 


AF143308 . 1 


AF14330B Hepatitis virus c one _____ ' ___ 


AF143307 .1 


AF143307 Hepatitis B virus c one - c 


AF14330S . 1 


AF143306 Hepatitis B virus clone RMbi/, complete 


AF143305 . 1 


AF143305 Hepatitis B virus clone RM501, complete 


AF143304 . 1 


AF143304 Hepatitis B virus clone HUiiy, complete 


AF143303 .1 


AF1433 03 Hepatitis B virus clone HD14Q6, complete 


AF143302 .1 




AF143301 . 1 


AF143301 Hepatitis B virus clone BWT903, complete 


AFX43300 .1 


AF14330 ° Hep ^^ | lYZl clone 7744-Gg' colplete 


AF143299 . 1 


AF143299 HeP ^^^ — clone 7720-Gs' complete 


AF143298 . 1 


AF143298 "^^^^^^p"^ genome 1 "" 


AB026814 . 1 


A 3026814 Hepat ^^ g 1____ dna' complete ^nome' 


AB026813 .1 


n^atitis B virul DNa' complete glnome! 


AB026812 . 1 


AB02S812 Hepatitis B virus COmg__ 


AB026811.1 


A 3026811 Hepatitis B virus comp^e e , 


AJ131956 . 1 


. HBV131956 ^^ P ^V^"^ 3 — Tom le^e genome 


AF151735 .1 


^^^^^^^^-^^—"s^^in"^ 5t59§ complete 


AF090842 . 1 




AFO90841 . 1 


AF090841 Hepatitis B virus strain G4. 27241, complete 


AFO90840 . 1 


AF090840 Hepatitis B virus strain G3 . 27270, complete 


AF090839 . 1 


AF090839 Hepatitis B virus strain G2. 27246, complete 


AF090838 . 1 


AF090838 Hepatitis B virus strain PI. 27239, complete 


Y18858 . 1 


HBV18858 Hepatitis B virus complete genome, isolate 


Y18857. 1 


HBV18857 Hepatitis B virus complete genome, isolate 


D12980 . 1 


HPBCG Hepatitis B virus subtype adr(SRADR) DNA, 


Y13856 . 1 


HBV18856 Hepatitis B virus complete genome, isolate 


Y13855 . 1 


HBV18855 Hepatitis B virus complete genome, isolate 


AJ131133 .1 


HBV131133 Hepatitis B virus, complete genome, strain 


X80925 . 1 


HBVP6PCXX Hepatitis B virus (patient 6) complete 


X80926 . 1 


HBVP5PCXX Hepatitis B virus (patient 5) complete 


X80924 . 1 


HBVP4PCXX Hepatitis B virus (patient 4) complete 


AF100309.1 


Hepatitis B virus strain 56, complete genome 
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AF068756 .1 


AF068756 Hepatitis B virus, complete genome 


AF043593.1 


AF043593 Hepatitis B virus isolate 6/89, complete 


Y07587.1 


HBVAYWGEN Hepatitis B virus, complete genome 


D28880.1 


D28880 Hepatitis B virus DNA, complete genome, strain 


X98076 .1 


HBVDEFVP3 Hepatitis B virus complete genome with 


X98075 .1 


HBVDEFVP2 Hepatitis B virus complete genome with 


X98074 .1 


HBVDEFVP1 Hepatitis B virus complete genome with 


X98077.1 


HBVCGWITY Hepatitis B virus complete genome, wild type 


X98072.1 


HBVCGINSC Hepatitis B virus complete genome with 


X98073.1 


HBVCGINCX Hepatitis B virus complete genome with 


U95551 . 1 


U95551 Hepatitis B virus subtype ayw, complete genome 


D23684 . 1 


HPBC6T588 Hepatitis B virus (C6-TKB5S8) complete genome 


D23683.1 


HPBC5HK02 Hepatitis B virus (C5-HBVK02) complete genome 


D23682 . 1 


HPBB5HK01 Hepatitis B virus (B5-HBVK01) complete genome 


D23681.1 


HPBC4HST2 Hepatitis B virus (C4-HBVST2) complete genome 


D23680.1 


" HPBB4HST1 Hepatitis B virus (B4-HBVST1) complete genome 


DO0331.1 


HPBADW3 Hepatitis B virus genome, complete genome 


DO0330.1 


HPBADW2 Hepatitis B virus genome, complete genome 


D50489.1 


HPBA11A Hepatitis B virus DNA, complete genome 


D23679.1 


HPBA3HMS2 Hepatitis B virus ( A3 - HBVMS 2 } complete genome 


D23678 . 1 


HPBA2HYS2 Hepatitis B virus (A2-HBVYS2) complete genome 


D23677 . 1 


HPBA1HKK2 Hepatitis B virus (A1-HBVKK2) complete genome 


D16665.1 


HPBADRM Hepatitis B virus DNA, complete genome 


D00329.1 


HPBADW1 Hepatitis B virus (HBV) genome, complete genome 


X97851. 1 


HBVPSCSX Hepatitis B virus (patient 6) complete genome 


X97850.1 


HBVP4CSX Hepatitis B virus (patient 4) complete genome 


X97849.1 


HBVP3CSX Hepatitis B virus (patient 3) complete genome 


X97848.1 


HBVP2CSX Hepatitis B virus (patient 2) complete genome 


X51970 .1 


HVHEPB Hepatitis B virus (HBV 991) complete genome 


M38S36.1 


HPBCGADR Hepatitis o viiufa, &ujjuyi->c auL ' 3 


X59795.1 


HBVAYWMCG Hepatitis B virus (ayw subtype mutant) 


M38454 . 1 


HPBADR1CG Hepatitis B virus , complete genome 


M32138.1 


HPBHBVAA Hepatitis B virus variant HBV-alphal, complete 


J02203 .1 


HPBAYW Human hepatitis B virus (subtype ayw), complete 


M1290S .1 


HPBADRA Hepatitis B virus subtype adr, complete genome 


M54923 .1 


HPBADWZ Hepatitis B virus (subtype adw) , complete genome 


L27106.1 


HPBMUT Hepatitis B virus mutant complete genome 
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Table 36: HBV Substrate Sequence 



NT Position* 


Substrate 


Seq ID 


82 


CUAUCGUCCCCUUCUUCAUC 


1 


101 


CUACCGUUCCGGCC 


2 


159 


CUUCUCAUCU 


3 


184 


CUUCCCUUCACCAC 


4 


269 


GACUCUCAGAAUGUCAACGAC 


5 


381 


CUGUAGGCAUAAAUGGUCUG 


6 


401 


GUUCACCAGCACCAUGCAACUUUUU 


7 


424 


UUUCACGUCUGCCUAAUCAUC 


8 


524 


AUUUGGAGCUUC 


9 


562 


CUGACUUCUUUCCUUCUAUUC 


10 


649 


CUCAC CAUACCGCACUCA 


11 


667 


GGCAAGCUAUUCUGUG 


12 


717 


GGAAGUAAUUUGGAAGAC 


13 


758 


CAGCUAUGUCAAUGUUAA 


14 


783 


CUAAAAUCGGCCUAAAAUCAGAC 


15 


812 


CAUUUCCUGUCUCACUUUUGGAAGAG 


16 


887 


UCCUGCUUACAGAC 


17 


922 


CAACACUUCCGGAAACUACUGUUGUUAG 


18 


989 


CUCGCCUCGCAGACGAAGGUCUC 


19 


1009 


CAAUCGCCGCGUCGCAGAAG 


20 


1031 


AUCUCAAUCUCGGGAAUCUCAA 


21 


1052 


AUGUUAGUAUCCCUUGGACUC 


22 


1072 


CAUAAGGUGGGAAACUUUACUG 


23 


1109 


CUGUACCUAUUCUUUAAAUC C 




1127 


CUGAGUGGCAAACUCCC 


25 


1271 


CCAAAUAUCUGCCCUUGGACAA 


26 


1297 


AUUAAACCAUAUUAUCCUGAACA 


27 


1319 


' AUGCAGUUAAUCAUUACUUCAAAACUA 


28 


1340 


AAACUAGGCAUUA 


29 


1370 


AGGCGGGCAUUCUAUAUAAGAGAG 


30 


1393 


GAAACUACGCGCAGCGCCUCAUUUUGU 


31 


1412 


CAUUUUGUGGGUCACCAUA 


32 


1441 


CAAGAGCUACAGCAUGGG 


33 



LOCUS HPBADR1CG 3221 bp DNA circular VRL 

06-MAR-1995 

DEFINITION Hepatitis B virus , complete genome. 
ACCESSION M38454 



*The nucleotide number referred to in that table is the position of the 5' end of the 
oligo in this sequence. 
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Table 37: Human HBV Hammerhead Ribozyme and Target Sequence 



CCACCACT T TCCACCAA 



CACCACTT T CCACCAAA 



ACCACTTT C CACCAAAC 



ACCAAACT C TTCAAGAT 



CAAACTCT T CAAGATCC 



AAACTCTT C AAGATCCC 



TTCAAGAT C CCAGAGTC" 



CCCAGAGT C AGGGCCCT 



Ribozyme 



UUGGUGGA CUGAUGAG X CGAA AGUGGUGG 



GGCCCTGT A CTTTCCTG 



CCTGTACT T TCCTGCTG 



CTGTACTT T CCTGCTGG 



TGTACTTT C CTGCTGGT 



TGGTGGCT C CAGTTCAG 



GCTCCAGT T CAGGAACA 



CTCCAGTT C AGGAACAG 



GCCCTGCT C AGAATACT 



CTCAGAAT A CTGTCTCT 



AATACTGT C TCTGCCAT 



TACTGTCT C TGCCATAT 



TCTGCCAT A TCGTCAAT 



TGCCATAT C GTCAATCT 



CATATCGT C AATCTTAT 



UUUGGUGG CUGAUGAG X CGAA AAGUGGUG 



' GUUUGGUG CUGAUGAG X CGAA AAAGUGGU 



AUCUUGAA CUGAUGAG X CGAA AGUUUGGU 



GGAUCUUG CUGAUGAG X CGAA AGAGUUUG 



GGGAUCUU CUGAUGAG X CGAA AAGAGUUU 



GACUCUGG CUGAUGAG X CGAA AUCUUGAA 



AGGGCCCU CUGAUGAG X CGAA ACUCUGGG 



""' CAGGAAAG CUGAUGAG X CGAA ACAGGGCC 



CAGCAGGA CUGAUGAG X CGAA AGUACAGG 



CCAGCAGG CUGAUGAG X CGAA AAGUACAG 



~ ACCAGCA G CUGAUGAG X CGAA AAAGUACA 
CUGAACUG CUGAUGAG X CGAA AGCCACCA 



UGUUCCUG CUGAUGAG X CGAA ACUGGAGC 



CUGUUCCU CUGAUGAG X CGAA AACUGGAG 



" AGUAUUCU CUGAUGAG X CGAA AGCAGGGC 



AGAGACA G CUGAUGAG X CGAA AUUCUGAG 
AUGGCAGA CUGAUGAG X CGAA ACAGUAUU 



AUAUGGCA CUGAUGAG X CGAA AGACAGUA 



" AUUGACGA CUGAUGAG X CGAA AUGGCAGA 



AGAUUGAC CUGAUGAG X CGAA AUAUGGCA 



TCGTCAAT C TTATCGAA 



GTCAATCT T ATCGAAGA 



TCAATCTT A TCGAAGAC 



AATCTTAT C GAAGACTG 



GACCCTGT A CCGAACAT 



GAGAACAT C GCATCAGG~ 



CATCGCAT C AGGACTCC 



TCAGGACT C CTAGGACC~ 



GGACTCCT A GGACCCCF 



CCCCTGCT C GTGTTACA 



GCTCGTGT T ACAGGCGG 



CTCGTGTT A CAGGCGGG 



GGCGGGGT T TTTCTTGT 



GCGGGGTT T TTCTTGTT 



CGGGGTTT T TCTTGTTG 



GGGGTTTT T CTTGTTGA 



GGGTTTTT C TTGTTGAC 



GTTTTTCT T GTTGACAA 



TTTCTTGT T GACAAAAA 



ACAAAAAT C CTCAGAAT" 



AAAATCCT C ACAATACC 



CTCACAAT A CCACAGAG 



CACAGAGT C TAGACTCCT 



CAGAGTCT A GACTCGTG 



TCTAGACT C GTGGTGGA 



GGTGGACT T CTCTCAAT 



AUAAGAUU CUGAUGAG X CGAA ACGAUAUG 
UUCGAUAA CUGAUGAG X CGAA AUUGACGA 



UCUUCGAU CUGAUGAG X CGAA AGAUUGAC 



GUCUUCGA CUGAUGAG X CGAA AAGAUUGA 



CAGUCUUC CUGAUGAG X CGAA AUAAGAUU 



AUGUUCGG CUGAUGAG X CGAA ACAGGGUC 



CCUGAUGC CUGAUGAG X CGAA AUGUUCUC 



GGAGUCCU CUGAUGAG X CGAA AUGCGAUG 



GGUCCUAG CUGAUGAG X CGAA AGUCCUGA 



" AGGGGUCC CUGAUGAG X CGAA AGGAGUCC 



UGUAACAC CUGAUGAG X CGAA AGCAGGGG 



CCGC CUGU CUGAUGAG X CGAA ACACGAGC 
CCCGCCUG CUGAUGAG X CGAA AACACGAG 



ACAAGAAA CUGAUGAG X CGAA ACCCCGCC 



AACAAGAA CUGAUGAG X CGAA AACCCCGC 



CAACAAGA CUGAUGAG X CGAA AAACCCCG 



UCAACAAG CUGAUGAG X CGAA AAAACCCC 



GUCAACAA CUGAUGAG X CGAA AAAAACCC 



UUGUCAAC CUGAUGAG X CGAA AGAAAAAC 



UUUUUGUC CUGAUGAG X CGAA ACAAGAAA 



AUUGUGAG CUGAUGAG X CGAA AUUUUUGU 



GGUAUUGU CUGAUGAG X CGAA AGGAUUUU 



CUCUGUGG CUGAUGAG X CGAA AUUGUGAG 



CGAG UCUA CUGAUGAG X CGAA ACUCUGUG 
" CACGAGUC CUGAUGAG X CGAA AGACUCUG 



UCCACCAC CUGAUGAG X CGAA AGUCUAGA 



AUUGAGAG CUGAUGAG X CGAA AGUCCACC 



2576 
2577 



2532 
2583 



WO 01/16312 



PCT/US00/23998 



GTGGACTT C TCTCAATT 



AAUUGAGA CUGAUGAG X CGAA AAGUCCAC 



GGACTTCT C TCAATTTT 



AAAAUUGA CUGAUGAG X CGAA AGAAGUCC 



ACTTCTCT C AATTTTCT 



AGAAAAUU CUGAUGAG X CGAA AGAGAAGU 



CTCTCAAT T TTCTAGGG 



CCCUAGAA CUGAUGAG X CGAA AUUGAGAG 



CCCCUAGA CUGAUGAG X CGAA AAUUGAGA~ 



278 
279 



TCTCAATT T TCTAGGGG 



CCCCCUAG CUGAUG AG X CGAA AAAUUGAG 
UCCCCC UA CUGAUGAG X CGAA AAAAUUGA 
GUUCCCCC CUGAU GAG X CGAA AGAAAAUU 
UUGGCCAA CUGAUGAG X CGAA ACACACGG 
UUUUG GCC CUGAUGAG X CGAA AGACACAC 
GGACUGCG CUGAUGAG X CGAA AUUUUGGC 
GGGACUGC CU GAUGAG X CGAA AAUUUUGG 
AGAUUUGG CUGAUGAG X CGAA ACUGCGAA 
UGACUGGA CUGAUGA G X CGAA AUUUGGGA 
AGUGACUG CUGAUGAG X C GAA AGAUUUGG 
UGGUGAGU CUGAU GAG X CGAA ACUGGAGA ' 
AGGUUGGU CUGAUGAG X CGAA AGUGACUG 
UGGAGGAC CUGAUGAG X C GAA ACAGGUUG 
AAUUGGAG CUGAUG AG X CGAA ACAACAGG 
ACAAAUUG CUGAUG AG X CGAA AGGACAAC 
CCAGGACA CUGAUGAG X CGAA AUUGGAGG 
ACCAGGAC CUGAUGAG X CGAA AAUUGGAG 
A UAACCAG- CUGAUGAG X CGAA ACAAAUUG 
CCAGCGAU CUGAUG AG X CGAA ACCAGGAC ~ 
UCCAGCGA CUGAUGA G X CGAA AACCAGGA 
CAUCCAGC CUGA UGAG X CGAA AUAACCAG ~ 
ACGCCGCA CUGAUG AG X CGAA ACACAUCC~ 
GAUGAUAA CUGAUGAG X CGAA ACGCCGCA 
AGAUGA UA CUGAUGAG X CGAA AACGCCGC 
AAGAUGAU CUGA UGAG X CGAA AAACGCCG 
GAAGAUGA CUG AUGAG X CGAA AAAACGCC 
AGGAAGAU CUGAUG AG X CGAA AUAAAACG 
CAGAGGAA CUGAUGAG X CGAA AUGAUAAA 
UGCAGAGG CUGAUGAG X CG AA AGAUGAUA 
AUGCAGAG CUG AUGAG X CGAA AAGAUGAU ~ 
AGGAUGCA CUGAUGAG X CGAA AGGAAGAU 
AGCAGCAG CUGAUGAG X CGAA AUGCAGAG 
A UGAGGCA CUGAUGAG X CGAA AGCAGCAG 
A AGAAGAU CUGAUGAG X CGAA AGGCAUAG 
AACAA GAA CUGAUGAG X CGAA AUGAGGCA 
CCAACAAG CUGAUGAG X CGAA AGAUGAGG 
ACC AACAA CUGAUGAG X CGAA AAGAUGAG 
GAACCAAC CUGAU GAG X CGAA AGAAGAUG 
GAAGAACC CUGAUGAG X CGAA ACAAGAAG 
UCCAGAAG CUGAU GAG X CGAA ACCAACAA ~ 
GUCCAGAA CUGAUGAG X C GAA AACCAACA 
U AGUCCAG CUGAUGAG X CGAA AGAACCAA 
AUAGUCCA CUGAUGAG X CGAA AAGAACCA 
UAC CUUGA CUGAUGAG X CGAA AGUCCAGA 
CAU ACCUU CUGAUGAG X CGAA AUAGUCCA 
GGGCAACA CUGAUGAG X CGAA ACCUUGAU~ 



432 
434 



CTCAATTT T CTAGGGGG 
TCAATTTT C TAGGGGGA 
AATTTTC T A GGGGGAAC 
CCGTGTGT C TTGGCCAA 
GTGTGTCT T GGCCAAAA 
GCCAAAAT T CGCAGTCC 
CCAAAATT C GCAGTCCC 
TTCGCAGT C CCAAATCT 
TCCCAAA T C TCCAGTCA 
CCAAATCT C CAGTCACT 
TCTCCAGT C ACTCACCA 
CAGTCACT C ACCAACCT 
CAACCTGT T GTCCTCCA 
CCTGTTGT C CTCCAATT 
GTTGTCCT C CAATTTGT 
CCTCCAAT T TGTCCTGG 
CTCCAATT T GTCCTGGT 
CAATTTGT C CTGGTTAT 
GTCCTGGT T ATCGCTGG 
TCCTGGTT A TCGCTGGA 
CTGGTTA T C GCTGGATG 
GGATGTGT C TGCGGCGT 
TGCGGCGT T TTATCATC 
GCGGCGTT T TATCATCT 
CGGCGTT T T ATCATCTT 
GGCGTTTT A TCATCTTC 
CGTTTTAT C ATCTTCCT 
TTTATCAT C TTCCTCTG 
TATCATCT T CCTCTGCA 
ATCATCTT C CTCTGCAT 
ATCTTCCT C TGCATCCT 
CTCTGCAT C CTGCTGCT 
CTGCTGCT A TGCCTCAT 
CTATGCCT C ATCTTCTT 
TGCCTCAT C TTCTTGTT 
CCTCATCT T CTTGTTGG 
CTCATCTT C TTGTTGGT 
CATCTT CT T GTTGGTTC 
CTTCTTGT T GGTTCTTC 
TTGTTGGT T CTTCTGGA 
TGTTGGTT C TTCTGGAC 
TTGGTTCT T CTGGACTA 
TGGTTCTTtwT TGGACTAT 
TCTGGACT A TCAAGGTA 
TGGAC TAT C AAGGTATG 
ATCAAGGT A TGTTGCCC 
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AGGTATGT T GCCCGTTT 



AAACGGGC CUGAUGAG X CGAA ACAUACCU 



TTGCCCGT T TGTCCTCT 



AGAGGACA CUGAUGAG X CGAA ACGGGCAA 



TGCCCGTT T GTCCTCTA 



UAGAGGAC CUGAUGAG X CGAA AACGGGCA 



CCGTTTGT C CTCTAATT 



AAUUAGAG CUGAUGAG X CGAA ACAAACGG 



TTTGTCCT C TAATTCCA 



UGGAAUUA CUGAUGAG X CGAA AGGACAAA 



TGTCCTCT A ATTCCAGG 



CCUGGAAU CUGAUGAG X CGAA AGAGGACA 



CCTCTAAT T CCAGGATC 



GAUCCUGG CUGAUGAG X CGAA AUUAGAGG 



: CAGGATCA 



UGAUCCUG CUGAUGAG X CGAA AAUUAGAG 



TCCAGGAT C ATCAACAA 



UUGUUGAU CUGAUGAG X CGAA AUCCUGGA 



AGGATCAT C AACAACCA 



UGGUUGUU CUGAUGAG X CGAA AUGAUCCU 



GCACAACT C CTGCTCAA 



UUGAGCAG CUGAUGAG X CGAA AGUUGUGC 



CTCCTGCT C AAGGAACC 



GGUUCCUU CUGAUGAG X CGAA AGCAGGAG 



AGGAACCT C TATGTTTC 



GAAACAUA CUGAUGAG X CGAA AGGUUCCU 



GAACCTCT A TGTTTCCC 



GGGAAACA CUGAUGAG X CGAA AGAGGUUC 



CTCTATGT T TCCCTCAT 



AUGAGGGA CUGAUGAG X CGAA ACAUAGAG 



TCTATGTT T CCCTCATG 



CAUGAGGG CUGAUGAG X CGAA AACAUAGA 



CTATGTTT C CCTCATGT 



ACAUGAGG CUGAUGAG X CGAA AAACAUAG 



GTTTCCCT C ATGTTGCT 



AGCAACAU CUGAUGAG X CGAA AGGGAAAC 



CCTCATGT T GCTGTACA 



UGUACAGC CUGAUGAG X CGAA ACAUGAGG 



GTTGCTGT A CAAAACCT 



AGGUUUUG CUGAUGAG X CGAA ACAGCAAC 



CAAAACCT A CGGACGGA 



UCCGUCCG CUGAUGAG X CGAA AGGUUUUG 



GCACCTGT A TTCCCATC 



GAUGGGAA CUGAUGAG X CGAA ACAGGUGC 



606 
607 
612 



ACCTGTAT T CCCATCCC 



GGGAUGGG CUGAUGAG X CGAA AUACAGGU 



CCTGTATT C CCATCCCA 



UGGGAUGG CUGAUGAG X CGAA AAUACAGG 



ATTCCCAT C CCATCATC 



GAUGAUGG CUGAUGAG X ( 



CATCCCAT C ATCTTGGG 



CCCAAGAU CUGAUGAG X CGAA AUGGGAUG 



CCCATCAT C TTGGGCTT 



AAGCCCAA CUGAUGAG X CGAA AUGAUGGG 



CATCATCT T GGGCTTTC 



GAAAGCCC CUGAUGAG X CGAA AGAUGAUG 



CTTGGGCT T TCGCAAAA 



UUUUGCGA CUGAUGAG X CGAA AGCCCAAG 



TTGGGCTT T CGCAAAAT 



AUUUUGCG CUGAUGAG X CGAA AAGCCCAA 



TGGGCTTT C GCAAAATA 



UAUUUUGC CUGAUGAG X CGAA AAAGCCCA 



CGCAAAAT A CCTATGGG 



CCCAUAGG CUGAUGAG X CGAA AUUUUGCG 



AAATACCT A TGGGAGTG 



CACUCCCA CUGAUGAG X CGAA AGGUAUUU 



GTGGGCCT C AGTCCGTT 



AACGGACU CUGAUGAG X CGAA AGGCCCAC 



GCCTCAGT C CGTTTCTC 



GAGAAACG CUGAUGAG X CGAA ACUGAGGC 



CAGTCCGT T TCTCTTGG 



CCAAGAGA CUGAUGAG X CGAA ACGGACUG 



AGTCCGTT T CTCTTGGC 



GCCAAGAG CUGAUGAG X CGAA AACGGACU 



GTCCGTTT C TCTTGGCT 



AGCCAAGA CUGAUGAG X CGAA AAACGGAC 



CCGTTTCT C TTGGCTCA 



UGAGCCAA CUGAUGAG X CGAA AGAAACGG 



GTTTCTCT T GGCTCAGT 



ACUGAGCC CUGAUGAG X CGAA AGAGAAAC 



TCTTGGCT C AGTTTACT 



AGUAAACU CUGAUGAG X CGAA AGCCAAGA 



GGCTCAGT T TACTAGTG 



CACUAGUA CUGAUGAG X CGAA ACUGAGCC 



GCTCAGTT T ACTAGTGC 



GCACUAGU CUGAUGAG X CGAA AACUGAGC 



CTCAGTTT A CTAGTGCC 



GGCACUAG CUGAUGAG X CGAA AAACUGAG 



AGTTTACT A GTGCCATT 



AAUGGCAC CUGAUGAG X CGAA AGUAAACU 



AGTGCCAT T TGTTCAGT 



ACUGAACA CUGAUGAG X CGAA AUGGCACU 



GTGCCATT T GTTCAGTG 



CACUGAAC CUGAUGAG X CGAA AAUGGCAC 



CCATTTGT T CAGTGGTT 



AACCACUG CUGAUGAG X CGAA ACAAAUGG 



CATTTGTT C AGTGGTTC 



GAACCACU CUGAUGAG X CGAA AACAAAUG 



TCAGTGGT T CGTAGGGC 



GCCCUACG CUGAUGAG X CGAA ACCACUGA 



CAGTGGTT C GTAGGGCT 



AGCCCUAC CUGAUGAG X CGAA AACCACUG 
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Table 37 



708 


TGGTTCGT A GGGCTTTC 


184 


GAAAGCCC CUGAUGAG X CGAA ACGAACCA 


2693 


714 


GTAGGGCT T TCCCCCAC 


185 


GUGGGGGA CUGAUGAG X CGAA AGCCCUAC 


2694 


715 


TAGGGCTT T CCCCCACT 


186 


AGUGGGGG CUGAUGAG X CGAA AAGCCCUA 


2695 


716 


AGGGCTTT C CCCCACTG 


187 


CAGUGGGG CUGAUGAG X CGAA AAAGCCCU 


1 


725 


CCCACTGT C TGGCTTTC 


188 


GAAAGCCA CUGAUGAG X CGAA ACAGUGGG 


2697 


732 


GTCTGGCT T TCAGTTAT 


189 


AUAACUGA CUGAUGAG X CGAA AGCCAGAC 


2698 


733 


TCTGGCTT T CAGTTATA 


190 


UAUAACUG CUGAUGAG X CGAA AAGCCAGA 


2599 


734 


CTGGCTTT C AGTTATAT 


191 


AUAUAACU CUGAUGAG X CGAA AAAGCCAG 


2700 


738 


CTTTCAGT T ATATGGAT 


192 


AUCCAUAU CUGAUGAG X CGAA ACUGAAAG 


2701 


739 


TTTCAGTT A TATGGATG 


193 


CAUCCAUA CUGAUGAG X CGAA AACUGAAA 


2702 


741 


TCAGTTAT A TGGATGAT 


194 


AUCAUCCA CUGAUGAG X CGAA AUAACUGA 


2703 


755 


GATGTGGT T TTGGGGGC 


195 


GCCCCCAA CUGAUGAG X CGAA ACCACAUC 


2704 


756 


ATGTGGTT T TGGGGGCC 


196 


GGCCCCCA CUGAUGAG X CGAA AACCACAU 


2705 


757 


TGTGGTTT T GGGGGCCA 


197 


UGGCCCCC CUGAUGAG X CGAA AAACCACA 


iZ^ 


769 


GGCCAAGT C TGTACAAC 


198 


GUUGUACA CUGAUGAG X CGAA ACUUGGCC 


2707 


773 


AAGTCTGT A CAACATCT 


199 


AGAUGUUG CUGAUGAG X CGAA ACAGACUU 


2708 


780 


TACAACAT C TTGAGTCC 


200 


GGACUCAA CUGAUGAG X CGAA AUGUUGUA 


2 . 7 . 09 


782 


CAACATCT T GAGTCCCT 


201 


AGGGACUC CUGAUGAG X CGAA AGAUGUUG 


2710 


787 


TCTTGAGT C CCTTTATG 


202 


CAUAAAGG CUGAUGAG X CGAA ACUCAAGA 


2711 


791 


GAGTCCCT T TATGCCGC 


203 


GCGGCAUA CUGAUGAG X CGAA AGGGACUC 


2712 


792 


AGTCCCTT T ATGCCGCT 


204 


AGCGGCAU CUGAUGAG X CGAA AAGGGACU 


2713_ 


793 


GTCCCTTT A TGCCGCTG 


205 


CAGCGGCA CUGAUGAG X CGAA AAAGGGAC 


2714 


803 


GCCGCTGT T ACCAATTT 


206 


AAAUUGGU CUGAUGAG X CGAA ACAGCGGC 


2715 


804 


CCGCTGTT A CCAATTTT 


207 


AAAAUUGG CUGAUGAG X CGAA AACAGCGG 


2715 


810 


TTACCAAT T TTCTTTTG 


208 


CAAAAGAA CUGAUGAG X CGAA AUUGGUAA 


2717 


811 


TACCAATT T TCTTTTGT 


209 


ACAAAAGA CUGAUGAG X CGAA AAUUGGUA 


2718 


812 


ACCAATTT T CTTTTGTC 


210 


GACAAAAG CUGAUGAG X CGAA AAAUUGGU 


2719 


813 


CCAATTTT C TTTTGTCT 


211 


AGACAAAA CUGAUGAG X CGAA AAAAUUGG 


2720 


815 


AATTTTCT T TTGTCTTT 


212 


AAAGACAA CUGAUGAG X CGAA AGAAAAUU 


2721 


816 


ATTTTCTT T TGTCTTTG 


213 


CAAAGACA CUGAUGAG X CGAA AAGAAAAU 


2722 


817 


TTTTCTTT T GTCTTTGG 


214 


CCAAAGAC CUGAUGAG X CGAA AAAGAAAA 


2723 


820 


TCTTTTGT C TTTGGGTA 


215 


UACCCAAA CUGAUGAG X CGAA ACAAAAGA 


2724 


822 


TTTTGTCT T TGGGTATA 


216 


UAUACCCA CUGAUGAG X CGAA AGACAAAA 


2725 


823 


TTTGTCTT T GGGTATAC 


217 


GUAUACCC CUGAUGAG X CGAA AAGACAAA 


2726 


828 


CTTTGGGT A TACATTTA 


218 


UAAAUGUA CUGAUGAG X CGAA ACCCAAAG 


2727 


830 


TTGGGTAT A CATTTAAA 


219 


UUUAAAUG CUGAUGAG X CGAA AUACCCAA 


2728 


834 


GTATACAT T TAAACCCT 


220 


AGGGUUUA CUGAUGAG X CGAA AUGUAUAC 


2729 


835 


TATACATT T AAACCCTC 


221 


GAGGGUUU CUGAUGAG X CGAA AAUGUAUA 


2730 


836 


ATACATTT A AACCCTCA 


222 


UGAGGGUU CUGAUGAG X CGAA AAAUGUAU 





843 


TAAACCCT C ACAAAACA 


223 


UGUUUUGU CUGAUGAG X CGAA AGGGUUUA 


" 


865 


ATGGGGAT A TTCCCTTA 


224 


UAAGGGAA CUGAUGAG X CGAA AUCCCCAU 





867 


GGGGATAT T CCCTTAAC 


225 


GUUAAGGG CUGAUGAG X CGAA AUAUCCCC 


2^7T§ 


868 


GGGATATT C CCTTAACT 


226 


AGUUAAGG CUGAUGAG X CGAA AAUAUCCC 




872 


TATTCCCT T AACTTCAT 




AUGAAGUU CUGAUGAG X CGAA AGGGAAUA 


2736 


B73 


ATTCCCTT A ACTTCATG 


228 


CAUGAAGU CUGAUGAG X CGAA AAGGGAAU 


2737 


877 


CCTTAACT T CATGGGAT 


229 


AUCCCAUG CUGAUGAG X CGAA AGUUAAGG 


2738 


878 


CTTAACTT C ATGGGATA 


230 


UAUCCCAU CUGAUGAG X CGAA AAGUUAAG 


2739 


886 


CATGGGAT A TGTAATTG 


231 


CAAUUACA CUGAUGAG X CGAA AUCCCAUG 


2740 


890 


GGATATGT A ATTGGGAG 


232 


CUCCCAAU CUGAUGAG X CGAA ACAUAUCC 


2741 


893 


TATGTAAT T GGGAGTTG 


233 


CAACUCCC CUGAUGAG X CGAA AUUACAUA 


2742 


900 


TTGGGAGT T GGGGCACA 


234 


UGUGCCCC CUGAUGAG X CGAA ACUCCCAA 


2743 



WO 01/16312 



PCT/US00/23998 



GGGCACAT T GCCACAGG 
AGGAACAT A TTGTACAA 
GAACATAT T GTACAAAA 
CATATTGT A CAAAAAAT 
CAAAAAAT C AAAATGTG 
AAATGTGT T TTAGGAAA 
AATGTGTT T TAGGAAAC 
ATGTGTTT T AGGAAACT 
TGTGTTTT A GGAAACTT 
AGGAAACT T CCTGTAAA 
GGAAACTT C CTGTAAAC 
CTTCCTGT A AACAGGCC 
ACAGGCCT A TTGATTGG 
AGGCCTAT T GATTGGAA 
CTATTGAT T GGAAAGTA 
TGGAAAGT A TGTCAACG 
AAGTATGT C AACGAATT 
CAACGAAT T GTGGGTCT 
TTGTGGGT C TTTTGGGG 
GTGGGTCT T TTGGGGTT 
TGGGTCTT T TGGGGTTT 
GGGTCTTT T GGGGTTTG 
TTTGGGGT T TGCCGCCC 
TTGGGGTT T GCCGCCCC 
CCGCCCCT T TCACGCAA 
CGCCCCTT T CACGCAAT 
GCCCCTTT C ACGCAATG 
' ATGTGGAT A TTCTGCTT 
GTGGATAT T CTGCTTTA 
TGGATATT C TGCTTTAA 
ATTCTGCT T TAATGCCT 
TTCTGCTT T AATGCCTT 
TCTGCTTT A ATGCCTTT 
TAATGCCT T TATATGCA 
AATGCCTT T ATATGCAT 
ATGCCTTT A TATGCATG 
GCCTTTAT A TGCATGCA 
GCATGCAT A CAAGCAAA 
AACAGGCT T TTACTTTC 
ACAGGCTT T TACTTTCT 
CAGGCTTT T ACTTTCTC 
AGGCTTTT A CTTTCTCG 
CTTTTACT T TCTCGCCA 
TTTTACTT T CTCGCCAA 
TTTACTTT C TCGCCAAC 
TACTTTCT C GCCAACTT 
CGCCAACT T ACAAGGCC 
GCCAACTT A CAAGGCCT 
CAAGGCCT T TCTAAGTA 
AAGGCCTT T CTAAGTAA 
AGGCCTTT C TAAGTAAA 



CCUGUGGC CUGAUGAG X CGAA AUGUGCCC 
UUGUACAA CUGAUGAG X CGAA AUGUUCCU 
UUUUGUAC CUGAUGAG X CGAA AUAUGUUC 
AUUUUUUG CUGAUGAG X CGAA ACAAUAUG 
CACAUUUU CUGAUGAG X CGAA AUUUUUUG 
UUUCCUAA CUGAUGAG X CGAA ACACAUUU 
GUUUCCUA CUGAUGAG X CGAA AACACAUU 
AGUUUCCU CUGAUGAG X CGAA AAACACAU 
AAGUUUCC CUGAUGAG X CGAA AAAACACA 
UUUACAGG CUGAUGAG X CGAA AGUUUCCU 
GUUUACAG CUGAUGAG X CGAA AAGUUUCC 
GGCCUGUU CUGAUGAG X CGAA ACAGGAAG 
CCAAUCAA CUGAUGAG X CGAA AGGCCUGU 
UUCCAAUC CUGAUGAG X CGAA AUAGGCCU 
UACUUUCC CUGAUGAG X CGAA AUCAAUAG 
CGUUGACA CUGAUGAG X CGAA ACUUUCCA 
AAUUCGUU CUGAUGAG X CGAA ACAUACUU 
AGACCCAC CUGAUGAG X CGAA AUUCGUUG 
CCCCAAAA CUGAUGAG X CGAA ACCCACAA 
AACCCCAA CUGAUGAG X CGAA AGACCCAC 
AAACCCCA CUGAUGAG X CGAA AAGACCCA 
CAAACCCC CUGAUGAG X CGAA AAAGACCC 
GGGCGGCA CUGAUGAG X CGAA ACCCCAAA 
GGGGCGGC CUGAUGAG X CGAA AACCCCAA 
UUGCGUGA CUGAUGAG X CGAA AGGGGCGG 
AUUGCGUG CUGAUGAG X CGAA AAGGGGCG 
CAUUGCGU CUGAUGAG X CGAA AAAGGGGC 
AAGCAGAA CUGAUGAG X CGAA AUCCACAU 
UAAAGCAG CUGAUGAG X CGAA AUAUCCAC 
UUAAAGCA CUGAUGAG X CGAA AAUAUCCA 
AGGCAUUA CUGAUGAG X CGAA AGCAGAAU 
AAGGCAUU CUGAUGAG X CGAA AAGCAGAA 
AAAGGCAU CUGAUGAG X CGAA AAAGCAGA 
UGCAUAUA CUGAUGAG X CGAA AGGCAUUA 
AUGCAUAU CUGAUGAG X CGAA AAGGCAUU 
CAUGCAUA CUGAUGAG X CGAA AAAGGCAU 
UGCAUGCA CUGAUGAG X CGAA AUAAAGGC 
UUUGCUUG CUGAUGAG X CGAA AUGCAUGC~ 
GAAAGUAA CUGAUGAG X CGAA AGCCUGUU 
AGAAAGUA CUGAUGAG X CGAA AAGCCUGU 
GAGAAAGU CUGAUGAG X CGAA AAAGCCUG 
CGAGAAAG CUGAUGAG X CGAA AAAAGCCU 
UGGCGAGA CUGAUGAG X CGAA AGUAAAAG 
UUGGCGAG CUGAUGAG X CGAA AAGUAAAA 
GUUGGCGA CUGAUGAG X CGAA AAAGUAAA 
AAGUUGGC CUGAUGAG X CGAA AGAAAGUA 
GGCCUUGU CUGAUGAG X CGAA AGUUGGCG 
AGGCCUUG CUGAUGAG X CGAA AAGUUGGC 
UACUUAGA CUGAUGAG X CGAA AGGCCUUG 
UUACUUAG CUGAUGAG X CGAA AAGGCCUU 
UUUACUUA CUGAUGAG X CGAA AAAGGCCU 
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Table 37 



1124 


GCCTTTCT A AGTAAACA 


286 


UGUUUACU CUGAUGAG X CGAA AGAAAGGC 


2 -221 


1128 


TTCTAAGT A AACAGTAT 


287 


AUACUGUU CUGAUGAG X CGAA ACUUAGAA 




1135 


TAAACAGT A TGTGAACC 


288 


GGUUCACA CUGAUGAG X CGAA ACUGUUUA 


— wi — 




1145 


GTGAACCT T TACCCCGT 


289 


ACGGGGUA CUGAUGAG X CGAA AGGUUCAC 




1146 


TGAACCTT T ACCCCGTT 


290 


AACGGGGU CUGAUGAG X CGAA AAGGUUCA 


2799 


1147 


GAACCTTT A CCCCGTTG 


291 


CAACGGGG CUGAUGAG X CGAA AAAGGUUC 


2aQ 

— 


1154 


TACCCCGT T GCTCGGCA 


292 


UGCCGAGC CUGAUGAG X CGAA ACGGGGUA 


2801 


1158 


CCGTTGCT C GGCAACGG 


293 


CCGUUGCC CUGAUGAG X CGAA AGCAACGG 


02 


1173 


GGCCTGGT C TATGCCAA 


294 


UUGGCAUA CUGAUGAG X CGAA ACCAGGCC 


2803 


1175 


CCTGGTCT A TGCCAAGT 


295 


ACUUGGCA CUGAUGAG X CGAA AGACCAGG 


2804 


1185 


CCAAGTGT T TGCTGACG 


296 


CGUCAGCA CUGAUGAG X CGAA ACACUUGG 


2805 


1187 


CAAGTGTT T GCTGACGC 


297 


GCGUCAGC CUGAUGAG X CGAA AACACUUG 




1209 


CCACTGGT T GGGGCTTG 


298 


CAAGCCCC CUGAUGAG X CGAA ACCAGUGG 


2807 


1216 


TTGGGGCT T GGCCATAG 


299 


CUAUGGCC CUGAUGAG X CGAA AGCCCCAA 


2808 


1223 


TTGGCCAT A GGCCATCA 


300 


UGAUGGCC CUGAUGAG X CGAA AUGGCCAA 


280 9 


1230 


TAGGCCAT C AGCGCATG 


301 


CAUGCGCU CUGAUGAG X CGAA AUGGCCUA 


2810 .... 


1249 


TGGAACCT T TGTGTCTC 


302 


GAGACACA CUGAUGAG X CGAA AGGUUCCA 


2811 


1250 


GGAACCTT T GTGTCTCC 


303 


GGAGACAC CUGAUGAG X • CGAA AAGGUUCC 


2812 - 


1255 


CTTTGTGT C TCCTCTGC 


304 


GCAGAGGA CUGAUGAG X CGAA ACACAAAG 


2813,. 


1257 


TTGTGTCT C CTCTGCCG 


305 


CGGCAGAG CUGAUGAG X CGAA AGACACAA 


2814 


1260 


TGTCTCCT C TGCCGATC 


306 


GAUCGGCA CUGAUGAG X CGAA AGGAGACA 


2815 


1268 


CTGCCGAT C CATACCGC 


307 


GCGGUAUG CUGAUGAG X CGAA AUCGGCAG 


2816 


1272 


CGATCCAT A CCGCGGAA 


308 


UUCCGCGG CUGAUGAG X CGAA AUGGAUCG 


2817 


1283 


GCGGAACT C CTAGCCGC 


309 


GCGGCUAG CUGAUGAG X CGAA AGUUCCGC 


2813 


1286 


GAACTCCT A GCCGCTTG 


310 


CAAGCGGC CUGAUGAG X CGAA AGGAGUUC 


2819 


1293 


TAGCCGCT T GTTTTGCT 


311 


AGCAAAAC CUGAUGAG X CGAA AGCGGCUA 


2820 


1296 


CCGCTTGT T TTGCTCGC 


312 


GCGAGCAA CUGAUGAG X CGAA ACAAGCGG 


2821 


1297 


CGCTTGTT T TGCTCGCA 


313 


UGCGAGCA CUGAUGAG X CGAA AACAAGCG 


2822 


1298 


GCTTGTTT T GCTCGCAG 


314 


CUGCGAGC CUGAUGAG X CGAA AAACAAGC 


2823 


1302 


GTTTTGCT C GCAGCAGG 


315 


CCUGCUGC CUGAUGAG X CGAA AGCAAAAC 


2824 


1312 


CAGCAGGT C TGGGGCAA 


316 


UUGCCCCA CUGAUGAG X CGAA ACCUGCUG 


2825 


1325 


GCAAAACT C ATCGGGAC 


317 


GUCCCGAU CUGAUGAG X CGAA AGUUUUGC 


... 282 J? 


1328 


AAACTCAT C GGGACTGA 


318 


UCAGUCCC CUGAUGAG X CGAA AUGAGUUU 


2827 


1341 


CTGACAAT T CTGTCGTG 


319 


CACGACAG CUGAUGAG X CGAA AUUGUCAG 


2828 


1342 


TGACAATT C TGTCGTGC 


320 


GCACGACA CUGAUGAG X CGAA AAUUGUCA 


2829 


1346 


AATTCTGT C GTGCTCTC 


321 


GAGAGCAC CUGAUGAG X CGAA ACAGAAUU 


2830 


1352 


GTCGTGCT C TCCCGCAA 


322 


UUGCGGGA CUGAUGAG X CGAA AGCACGAC 


28 


1354 


CGTGCTCT C CCGCAAAT 


323 


AUUUGCGG CUGAUGAG X CGAA AGAGCACG 


— — 


1363 


CCGCAAAT A TACATCAT 


324 


AUGAUGUA CUGAUGAG X CGAA AUUUGCGG 




1365 


GCAAATAT A CATCATTT 


325 


AAAUGAUG CUGAUGAG X CGAA AUAUUUGC 


2834 


1369 


ATATACAT C ATTTCCAT 


326 


AUGGAAAU CUGAUGAG X CGAA AUGUAUAU 


2835 




1372 


TACATCAT T TCCATGGC 


327 


GCCAUGGA CUGAUGAG X CGAA AUGAUGUA 




1373 


ACATCATT T CCATGGCT 


328 


AGCCAUGG CUGAUGAG X CGAA AAUGAUGU 


2837 


1374 


CATCATTT C CATGGCTG 




CAGCCAUG CUGAUGAG X CGAA AAAUGAUG 


2838 


1385 


TGGCTGCT A GGCTGTGC 


330 


GCACAGCC CUGAUGAG X CGAA AGCAGCCA 


2839 


1406 


AACTGGAT C CTACGCGG 


331 


CCGCGUAG CUGAUGAG X CGAA AUCCAGUU 


2840 


1409 


TGGATCCT A CGCGGGAC 


332 


GUCCCGCG CUGAUGAG X CGAA AGGAUCCA 


2841 


1420 


CGGGACGT C CTTTGTTT 


333 


AAACAAAG CUGAUGAG X CGAA ACGUCCCG 


2842 


1423 


GACGTCCT T TGTTTACG 


334 


CGUAAACA CUGAUGAG X CGAA AGGACGUC 


2843 


1424 


ACGTCCTT T GTTTACGT 


335 


ACGUAAAC CUGAUGAG X CGAA AAGGACGU 


2844 


1427 


TCCTTTGT T TACGTCCC 


336 


GGGACGUA CUGAUGAG X CGAA ACAAAGGA 


2845. 



WO 01/16312 



PCT/US00/23998 



CCTTTGTT T ACGTCCCG 



CGGGACGU CUGAUGAG X CGAA AACAAAGG 



ACGGGACG CUGAUGAG X CGAA AAACAAAG 



CTTTGTTT A CGTCCCGT 



GCCGACGG CUGAUGAG X CGAA ACGUAAAC 



GTTTACGT C CCGTCGGC 



UCAGCGCC CUGAUGAG X CGAA ACGGGACG 



CGTCCCGT C GGCGCTGA 



GUCCGCGG CUGAUGAG X CGAA AUUCAGCG 



CGCTGAAT C CCGCGGAC 



CGACCCCT C CCGGGGCC 



GGCCCCGG CUGAUGAG X CGAA AGGGGUCG 



GGGCCGCT T GGGGCTCT 



AGAGCCCC CUGAUGAG X CGAA AGCGGCCcf 



TTGGGGC T C TACCGCCC 
GGGGCTCT A CCGCCCGC 
CGCCCGC T T CTCCGCCT 
GCCCGCTT C TCCGCCTA 
CCGCTTCT C CGCCTATT 
CTCCGCCT A TTGTACCG 
CCGCCTAT T GTACCGAC 
CCTATTGT A CCGACCGT 
CCGACCGT C CACGGGGC 
GCGCACCT C TCTTTACG 
GCACCTCT C TTTACGCG 



CCTCTCTT T ACGCGGAC 
CTCTCTTT A CGCGGACT 
CGCG GACT C CCCGTCTG 
CTCCCCGT C TGTGCCTT 
CTGTGCCT T CTCATCTG 
TGTGCCTT C TCATCTGC 
TGCCTTCT C ATCTGCCG 
CTTCTCAT C TGCCGGAC 
TGTGCACT T CGCTTCAC 
GTGCACTT C GCTTCACC 
ACTTCG CT T CACCTCTG 
CTTCGCTT C ACCTCTGC 
CTTCACCT C TGCACGTC 
CTGCACGT C GCATGGAG 
CCCAAGGT C TTGCATAA 
CAAGGT CT T GCATAAGA 
TCTTGCAT A AGAGGACT 
AGAGGACT C TTGGACTT 
AGGACTCT T GGACTTTC 
CTTGGACT T TCAGCAAT 
TTGGACTT T CAGCAATG 
TGGACTTT C AGCAATGT 
AGCAATGT C AACGACCG 
ACCGACCT T GAGGCATA 
TGAGGCAT A CTTCAAAG 
GGCATACT T CAAAGACT 
GCATACTT C AAAGACTG 
CTGTGTGT T TAATGAGT 
TGTGTGTT T AATGAGTG 
GTGTGTTT A ATGAGTGG 
GGAGGAGT T GGGGGAGG 
GAGGAGGT T AGGTTAAA 



GGGCGGUA CUGAUGAG X CGAA AGCCCCAA 
GCGGGCGG CUGAUGAG X CGAA AGAGCCCC 
AGGCG GAG CUGAUGAG X CGAA AGCGGGCG 
UAGGCGGA CUGAU GAG X CGAA AAGCGGGC 
AAUAGGCG CUGAUG AG X CGAA AGAAGCGG 
CGGUACAA CUGAUGAG X CGA A AGGCGGAG 
GUCGGUAC CUGAUGAG X CGAA AUAGGCGG 
ACGGUCGG CUGAUGA G X CGAA ACAAUAGG 
GCCCCGUG CUGAUGAG X CGAA ACGGUCGG 
CGUAA AGA CUGAUGAG X CGAA AGGUGCGC 
CGCGUAAA CUG AUGAG X CGAA AGAGGUGC 
UCCGCGUA CUGA UGAG X CGAA AGAGAGGU 
GUCCGCGU CUGAUGAG X CGAA AAGAGAGG 
AGUCCGCG CUGAUGAG X C GAA AAAGAGAG 
CAGACGGG CUGAUG AG X CGAA AGUCCGCG ~ 
AAGGCACA CUGAUG AG X CGAA ACGGGGAG 
CAGAUGAG CUGAUGAG X CGAA AGGCACAG 
GCAGAUGA CUGAUGAG X CGAA AAGGCACA ~ 
CGGCAGAU CUGAUGAG X CGAA AGAAGGCA 
GUCCGGCA CUGAU GAG X CGAA AUGAGAAG 
GUGAAGC G CUGAUGAG X CGAA AGUGCACA 
GGUGAAGC CUG AUGAG X CGAA AAGUGCAC 
CAGAGGUG CUGAUGAG X CGAA AGCGAAGU 
GCAGAGGU CUGAUGAG X CGAA AAGCGAAG 
GACGUGCA CUGAUGAG X CG AA AGGUGAAG 
CUCCAUGC CUG AUGAG X CGAA ACGUGCAG 
UUAUG CAA CUGAUGAG X CGAA ACCUUGGG 
UCUUAUGC CUGAUG AG X CGAA AGACCUUG 
AGUCCUCU CUGAUGAG X CGA A AUGCAAGA 
AAGUCCAA CUGAUGAG X C GAA AGUCCUCU 
GAAAGUCC CUGAUG AG X CGAA AGAGUCCU~ 
AUUGCUGA CUGAUGAG X CGAA AGUCCAAG 
CAUUGCUG CUGAUGAG X CGAA AAGUCCAA ~ 
ACAUUGCU CU GAUGAG X CGAA AAAGUCCA 
CGGUC GUU CUGAUGAG X CGAA ACAUUGCU 
UAUGCCUC CUGAUGAG X CGAA AGGUCGGU 
CUUUGAAG CUGAUGAG X CGAA AUGCCUCA~ 
AGUCUUUG CUGAUGAG X CGAA AGUAUGCC~ 
CAGUCUUU CUGAUGAG X CG AA AAGUAUGC 
ACUCAUUA CUGAUGAG X CGAA ACACACAG ~ 
CACUCAUU CUGAUGA G X CGAA AACACACA 
CCACUCAU CUGAUGAG X CGAA AAACACAC 
C CUCCCCC CUGAUGAG X CGAA ACUCCUCC 
UUUAACCU CUGAUGAG X CGAA ACCUCCUC~ 
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Table 37 



1757 


AGGAGGTT A GGTTAAAG 


388 


CUUUAACC CUGAUGAG X CGAA AACCUCC'J 


2897 


1761 


GGTTAGGT T AAAGGTCT 


389 


AGACCUUU CUGAUGAG X CGAA ACCUAACC 


2898 


17S2 


GTTAGGTT A AAGGTCTT 


390 


AAGACCUU CUGAUGAG X CGAA AACCUAAC 


2899 


1768 


TTAAAGGT C TTTGTACT 


391 


AGUACAAA CUGAUGAG X CGAA ACCUUUAA 


2900 


1770 


AAAGGTCT T TGTACTAG 


392 


CUAGUACA CUGAUGAG X CGAA AGACCUUU 


2901 


1771 


AAGGTCTT T GTACTAGG 


393 


CCUAGUAC CUGAUGAG X CGAA AAGACCUU 


2902 


1774 


GTCTTTGT A CTAGGAGG 


394 


CCUCCUAG CUGAUGAG X CGAA ACAAAGAC 


2903 


1777 


TTTGTACT A GGAGGCTG 


395 


CAGCCUCC CUGAUGAG X CGAA AGUACAAA 


2904 


1787 


GAGGCTGT A GG CAT AAA 


396 


UUUAUGCC CUGAUGAG X CGAA ACAGCCUC 


2905 


1793 


GTAGGCAT A AATTGGTG 


397 


CACCAAUU CUGAUGAG X CGAA AUGCCUAC 


2906 


1797 


GCATAAAT T GGTGTGTT 


398 


AACACACC CUGAUGAG X CGAA AUUUAUGC 


2907 


1805 


TGGTGTGT T CACCAGCA 


399 


UGCUGGUG CUGAUGAG X CGAA ACACACCA 


2908 


1806 


GGTGTGTT C ACCAGCAC 


400 


GUGCUGGU CUGAUGAG X CGAA AACACACC 


2909 


1824 


ATGCAACT T TTTCACCT 


401 


AGGUGAAA CUGAUGAG X CGAA AGUUGCAU 


2910 


1825 


TGCAACTT T TTCACCTC 


402 


GAGGUGAA CUGAUGAG X CGAA AAGUUGCA 


2911 


1826 


GCAACTTT T TCACCTCT 


403 


AGAGGUGA CUGAUGAG X CGAA AAAGUUGC 


2912 


1827 


CAACTTTT T CACCTCTG 


404 


CAGAGGUG CUGAUGAG X CGAA AAAAGUUG 






AACTTTTT C ACCTCTGC 




GCAGAGGU CUGAUGAG X CGAA AAAAAGUU 


2914 


1833 


TTTCACCT C TGCCTAAT 


406 


AUUAGGCA CUGAUGAG X CGAA AGGUGAAA 


2915 


1839 


CTCTGCCT A ATCATCTC 


407 


GAGAUGAU CUGAUGAG X CGAA AGGCAGAG 


2916 


1842 


TGCCTAAT C ATCTCATG 


408 


CAUGAGAU CUGAUGAG X CGAA AUUAGGCA 


2917 


1845 


CTAATCAT C TCATGTTC 


409 


GAACAUGA CUGAUGAG X CGAA AUGAUUAG 


2918 


1847 


AATCATCT C ATGTTCAT 


410 


AUGAACAU CUGAUGAG X CGAA AGAUGAUU 


2919 


1852 


TCTCATGT T CATGTCCT 


411 


AGGACAUG CUGAUGAG X CGAA ACAUGAGA 


2920 


1853 


CTCATGTT C ATGTCCTA 


412 
413 


UAGGACAU CUGAUGAG X CGAA AACAUGAG 
AACAGUAG CUGAUGAG X CGAA ACAUGAAC 


2922 


1858 
1861 


GTTCATGT C CTACTGTT 
CATGTCCT A CTGTTCAA 


414 


UUGAACAG CUGAUGAG X CGAA AGGACAUG 


2923 


1866 


CCTACTGT T CAAGCCTC 


415 


GAGGCUUG CUGAUGAG X CGAA ACAGUAGG 


2924 


1867 


CTACTGTT C AAGCCTCC 


416 


GGAGGCUU CUGAUGAG X CGAA AACAGUAG 


2925 


1874 


TCAAGCCT C CAAGCTGT 


417 


ACAGCUUG CUGAUGAG X CGAA AGGCUUGA 


2926 


1887 


CTGTGCCT T GGGTGGCT 


418 


AGCCACCC CUGAUGAG X CGAA AGGCACAG 


2927 


1896 


GGGTGGCT T TGGGGCAT 


419 


AUGCCCCA CUGAUGAG X CGAA AGCCACCC 


2928 


1897 


GGTGGCTT T GGGGCATG 


420 


CAUGCCCC CUGAUGAG X CGAA AAGCCACC 


2929 


1911 


ATGGACAT T GACCCGTA 


421 


UACGGGUC CUGAUGAG X CGAA AUGUCCAU 


2930 


1919 


TGACCCGT A TAAAGAAT 


422 


AUUCUUUA CUGAUGAG X CGAA ACGGGUCA 


2931 


1921 


ACCCGTAT A AAGAATTT 


423 


AAAUUCUU CUGAUGAG X CGAA AUACGGGU 


2932 


1928 


TAAAGAAT T TGGAGCTT 


424 


AAGCUCCA CUGAUGAG X CGAA AUUCUUUA 


2933 


1929 


AAAGAATT T GGAGCTTC 


425 


GAAGCUCC CUGAUGAG X CGAA AAUUCUUU 


2934 


1936 


TTGGAGCT T CTGTGGAG 


426 


CUCCACAG CUGAUGAG X CGAA AGCUCCAA 


2935 


1937 


TGGAGCTT C TGTGGAGT 


427 


ACUCCACA CUGAUGAG X CGAA AAGCUCCA 


2936 


1946 


TGTGGAGT T ACTCTCTT 


428 


AAGAGAGU CUGAUGAG X CGAA ACUCCACA 


2937 


1947 


GTGGAGTT A CTCTCTTT 


429 


AAAGAGAG CUGAUGAG X CGAA AACUCCAC 


2938_ 


1950 


GAGTTACT C TCTTTTTT 


430 


AAAAAAGA CUGAUGAG X CGAA AGUAACUC 




1952 


GTTACTCT C TTTTTTGC 




GCAAAAAA CUGAUGAG X CGAA AGAGUAAC 


2940 


1954 


TACTCTCT T TTTTGCCT 


432 


AGGCAAAA CUGAUGAG X CGAA AGAGAGUA 


2941 


1955 


ACTCTCTT T TTTGCCTT 


433 


AAGGCAAA CUGAUGAG X CGAA AAGAGAGU 


2942 


1956 


CTCTCTTT T TTGCCTTC 


434 


GAAGGCAA CUGAUGAG X CGAA AAAGAGAG 


2943 


1957 


TCTCTTTT T TGCCTTCT 


435 


AGAAGGCA CUGAUGAG X CGAA AAAAGAGA 


2944 


1958 


CTCTTTTT T GCCTTCTG 


436 


CAGAAGGC CUGAUGAG X CGAA AAAAAGAG 


2945 


1963 


TTTTGCCT T CTGACTTC 


437 


GAAGUCAG CUGAUGAG X CGAA AGGCAAAA 


2946 


1964 


TTTGCCTT C TGACTTCT 


438 


j AGAAGUCA CUGAUGAG X CGAA AAGGCAAA 


2947 
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TTCTGACT T CTTTCCTT 



AAGGAAAG CUGAUGAG X CGAA AGUCAGAA 



GAAGGAAA CUGAUGAG X CGAA AAGUCAGA 



TCTGACTT C TTTCCTTC 



TGACTTCT T TCCTTCTA 



GACTTCTT T CCTTCTAT 



UAGAAGGA CUGAUGAG X CGAA AGAAGUCA 
AUAGAAGG CUGAUGAG X CGAA AAGAAGUC 



ACTTCTTT C CTTCTATT 



AAUAGAAG CUGAUGAG X CGAA AAAGAAGU 



UCGAAUAG CUGAUGAG X CGAA AGGAAAGA 



2015 
2017 



TCTTTCCT T CTATTCGA 



CTTTCCTT C TATTCGAG 



CUCGAAUA CUGAUGAG X CGAA AAGGAAAG 



TTCCTTCT A TTCGAGAT 



AUCUCGAA CUGAUGAG X CGAA AGAAGGAA 



CCTTCTAT T CGAGATCT 



AGAUCUCG CUGAUGAG X CGAA AUAGAAGG 



CTTCTATT C GAGATCTC 
TTCGAGAT C TCCTCGAC 



GAGAUCUC CUGAUGAG X CGAA AAUAGAAG 



GUCGAGGA CUGAUGAG X CGAA AUCUCGAA 



CGAGATCT C CTCGACAC 
GATCTCCT C GACACCGC 



GUGUCGAG CUGAUGAG X CGAA AGAUCUCG~ 



GCGGUGUC CUGAUGAG X CGAA AGGAGAUC 



CACCGCCT C TGCTCTGT 



ACAGAGCA CUGAUGAG X CGAA AGGCGGUG 



CCTCTGCT C TGTATCGG 



CCGAUACA CUGAUGAG X CGAA AGCAGAGG 



TGCTCTGT A TCGGGGGG 



CCCCCCGA CUGAUGAG X CGAA ACAGAGCA 



CTCT GTAT C GGGGGGCC 
GGGGGCCT T AGAGTCTC 
GGGGCCTT A GAGTCTCC 



GGCCCCCC CUGAUGAG X CGAA AUACAGAG 



GAGACUCU CUGAUGAG X CGAA AGGCCCCC 



GGAGACUC CUGAUGAG X CGAA AAGGCCCC 



GUUCCGGA CUGAUGAG X CGAA ACUCUAAG 
AUGUUCCG CUGAUGAG X CGAA AGACUCUA 
AGGU GAAC CUGAUGAG X CGAA AUGUUCCG 
GU GAGGUG CUGAUGAG X CGAA ACAAUGUU 
GGUGA GGU CUGAUGAG X CGAA AACAAUGU 
CGUAUGGU CUGAUGA G X CGAA AGGUGAAC 
GAGUGCCG CUGA UGAG X CGAA AUGGUGAG 
GCUUGCCU CUGAUGAG X CGAA AGUGCCGU 
ACAC AGAA CUGAUGAG X CGAA AGCUUGCC 
CA ACACAG CUGAUGAG X CGAA AUAGCUUG 
CCAACACA CUGAUGAG X C GAA AAUAGCUU~ 
CUCACCCC CUGAUGAG X CGAA ACACAGAA 
GAUUCAUC CUG AUGAG X CGAA ACUCACCC~ 
GGUGGCUA CUGAUGAG X CGAA AUUCAUCA ~ 
CAGGUGGC CUGAUGAG X CGAA AGAUUCAU~~ 
UUCCAAAU CUGAUGAG X CGAA ACUUCCCA 
AUCUUCCA CUGAU GAG X CGAA AUUACUUC~ 
GAUCUUCC CUGAUGAG X CGAA AAUUACUU~ 
GGAUGCUG CUGAUGAG X CGAA AUCUUCCA ~ 
AUUCCCUG CUGAUGAG X CGAA AUGCUGGA 
UGACU ACU CUGAUGAG X CGAA AUUCCCUG 
CUGACUAC CUGAUGAG X CGAA AAUUCCCU~ 
UAGCUGAC CUGAUGAG X CGAA ACUAAUUC~ 
ACAUAGCU CUGAUGA G X CGAA ACUACUAA~ 
CGUUGACA CUGAUGA G X CGAA AGCUGACU~ 
UUAACGUU CUGAUGAG X CGA A ACAUAGCU 
^ ACGUUGAC 
GCC CAUAU CUGAUGAG X CGAA~ AACGUUGA ~ 
UAG GCCCA CUGAUGAG X CGAA AUUAACGU " 
CUGAUUUU CUGAUGAG X CGAA AGGCCCAU 
AUUUUUAG 
AGUUGUCU 



21S9 
2175 



CTTAG AGT C TCCGGAAC 
TAGAGTCT C CGGAACAT 
CGGAACAT T GTTCACCT 
AACATT GT T CACCTCAC 
ACATTGTT C ACCTCACC 
GTTCACCT C ACCATACG 
CTCACCAT A CGGCACTC 
ACGGC ACT C AGGCAAGC 
GGCAAGCT A TTCTGTGT 
CAAGCTAT T CTGTGTTG 
AAGCTATT C T GTGTTGG 
TTCTGTGT T GGGGTGAG 
GGGTGAGT T GATGAATC 
TGATGA AT C TAGCCACC 
ATGAATCT A GCCACCTG 
TGGGAAGT A ATTTGGAA 
GAAGTAAT T TG GAAGAT 
AAGTAATT T GGAAGATC 
TGGAAGAT C CAGCATCC 
TCCAGCAT C CAGGGAAT 
CAGGGAAT T AGTAGTCA 
AGGGAAT T A GTAGTCAG 
GAATT AGT A GTCAGCTA 
TTAGTAGT C AGCTATGT 
AGTCAGCT A TGTCAACG 
AGCTATGT C AACGTTAA 
GTCAACGT T AATATGGG 
TCAACGTT A ATATGGGC 
ACGTTAAT A TGGGCCTA 
ATGGGCCT A AAAATCAG 
CTAAAAAT C AGACAACT 
AGACAACT A TTGTGGTT~ 



CCCAUAUU CUGAUGAG X CGAA I 



AGUUGUCU CU GAUGAG X CGAA 
AACCACAA CUGAUGAG X CGAA 
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510 



Tauie j/ 



2204 


ACAACTAT T GTGGTTTC 


490 


GAAACCAC CUGAUGAG X CGAA AUAGUUGU 


2999 


2210 


ATTGTGGT T TCACATTT 


491 


AAAUGUGA CUGAUGAG X CGAA ACCACAAU 


. 3000 


2211 


TTGTGGTT T CACATTTC 


492 


GAAAUGUG CUGAUGAG X CGAA AACCACAA 


3001 


2212 


TGTGGTTT C ACATTTCC 


493 


GGAAAUGU CUGAUGAG X CGAA AAACCACA 


..3002 


2217 


TTTCACAT T TCCTGTCT 


494 


AGACAGGA CUGAUGAG X CGAA AUGUGAAA 


3003 


2218 


TTCACATT T CCTGTCTT 


495 


AAGACAGG CUGAUGAG X CGAA AAUGUGAA 


3004 _ 


2219 


TCACATTT C CTGTCTTA 


496 


UAAGACAG CUGAUGAG X CGAA AAAUGUGA 


3005 


2224 


TTTCCTGT C TTACTTTT 


497 


AAAAGUAA CUGAUGAG X CGAA ACAGGAAA 


3006 


2226 


TCCTGTCT T ACTTTTGG 


498 


CCAAAAGU CUGAUGAG X CGAA AGACAGGA 


3007 


2227 


CCTGTCTT A CTTTTGGG 


499 


CCCAAAAG CUGAUGAG X CGAA AAGACAGG 


3008 


2230 


GTCTTACT T TTGGGCGA 


500 


UCGCCCAA CUGAUGAG X CGAA AGUAAGAC 


3009 


2231 


TCTTACTT T TGGGCGAG 


501 


CUCGCCCA CUGAUGAG X CGAA AAGUAAGA 


3010 


2232 


CTTACTTT T GGGCGAGA 


502 


UCUCGCCC CUGAUGAG X CGAA AAAGUAAG 


3011 


2247 


GAAACTGT T CTTGAATA 




UAUUCAAG CUGAUGAG X CGAA ACAGUUUC 


3012 


2248 


AAACTGTT C TTGAATAT 


504 


AUAUUCAA CUGAUGAG X CGAA AACAGUUU 


3013 


2250 


ACTGTTCT T GAATATTT 


505 


AAAUAUUC CUGAUGAG X CGAA AGAACAGU 


3014 


2255 


TCTTGAAT A TTTGGTGT 


506 


AC AC C AAA CUGAUGAG X CGAA AUUCAAGA 


3015 


2257 


TTGAATAT T TGGTGTCT 


507 


AGACACCA CUGAUGAG X CGAA AUAUUCAA 


3016 


2258 


TGAATATT T GGTGTCTT 


508 


AAGACACC CUGAUGAG X CGAA AAUAUUCA 


3017 


2264 


TTTGGTGT C TTTTGGAG 


509 


CUCCAAAA CUGAUGAG X CGAA ACACCAAA 


3018 


2266 


TGGTGTCT T TTGGAGTG 


510 


CACUCCAA CUGAUGAG X CGAA AGACACCA 


3019 


2267 


GGTGTCTT T TGGAGTGT 


511 1 ACACUCCA CUGAUGAG X CGAA AAGACACC 


3020 


2268 


GTGTCTTT T GGAGTGTG 


512 


CACACUCC CUGAUGAG X CGAA AAAGACAC 


3021 


2280 


GTGTGGAT T CGCACTCC 


513 


GGAGUGCG CUGAUGAG X CGAA AUCCACAC 


3022 


2281 


TGTGGATT C GCACTCCT 


514 


AGGAGUGC CUGAUGAG X CGAA AAUCCACA 


3023 


2287 


TTCGCACT C CTCCTGCA 


515 


UGCAGGAG CUGAUGAG X CGAA AGUGCGAA 


3024 


N 2290 


GCACTCCT C CTGCATAT 


516 


AUAUGCAG CUGAUGAG X CGAA AGGAGUGC 


3025 


2297 


TCCTGCAT A TAGACCAC 


517 


GUGGUCUA CUGAUGAG X CGAA AUGCAGGA 


3025 


2299 


CTGCATAT A GACCACCA 


518 


UGGUGGUC CUGAUGAG X CGAA AUAUGCAG 


3027 


2317 


ATGCCCCT A TCTTATCA 


519 


UGAUAAGA CUGAUGAG X CGAA AGGGGCAU 


3028 


2319 


GCCCCTAT C TTATCAAC 


520 


GUUGAUAA CUGAUGAG X CGAA AUAGGGGC 


3029 


2321 


CCCTATCT T ATCAACAC 


521 


GUGUUGAU CUGAUGAG X CGAA AGAUAGGG 


3030 


2322 


CCTATCTT A TCAACACT 


522 


AGUGUUGA CUGAUGAG X CGAA AAGAUAGG 


3031 


2324 


TATCTTAT C AACACTTC 


523 


GAAGUGUU CUGAUGAG X CGAA AUAAGAUA 


3032 


2331 


TCAACACT T CCGGAAAC 


524 


GUUUCCGG CUGAUGAG X CGAA AGUGUUGA 


3033 


2332 


CAACACTT C CGGAAACT 


525 


AGUUUCCG CUGAUGAG X CGAA AAGUGUUG 


3034 


2341 


CGGAAACT A CTGTTGTT 


526 


AACAACAG CUGAUGAG X CGAA AGUUUCCG 


3035 


2346 


ACTACTGT T GTTAGACG 


527 


CGUCIJAAC CUGAUGAG X CGAA ACAGUAGU 


3036 


2349 


ACTGTTGT T AGACGAAG 


528 


CUUCGUCU CUGAUGAG X CGAA ACAACAGU 


3037 


2350 


CTGTTGTT A GACGAAGA 


529 


UCUUCGUC CUGAUGAG X CGAA AACAACAG 


3038 ... . 


2366 


AGGCAGGT C CCCTAGAA 


530 


UUCUAGGG CUGAUGAG X CGAA ACCUGCCU 


1^21 


2371 


GGTCCCCT A GAAGAAGA 


531 


UCUUCUUC CUGAUGAG X CGAA AGGGGACC 


3040 


2383 


GAAGAACT C CCTCGCCT 


532 


AGGCGAGG CUGAUGAG X CGAA AGUUCUUC 


^ 








UGCGAGGC CUGAUGAG X CGAA AGGGAGUU 




2392 


CCTCGCCT C GCAGACGA 


534 


UCGUCUGC CUGAUGAG X CGAA AGGCGAGG 


3043 


2405 


ACGAAGGT C TCAATCGC 


535 


GCGAUUGA CUGAUGAG X CGAA ACCUUCGU 


3044 


2407 


GAAGGTCT C AATCGCCG 


536 


CGGCGAUU CUGAUGAG X CGAA AGACCUUC 


3045 


2411 


GTCTCAAT C GCCGCGTC 


537 


GACGCGGC CUGAUGAG X CGAA AUUGAGAC 


3046 


2419 


CGCCGCGT C GCAGAAGA 


538 


UCUUCUGC CUGAUGAG X CGAA ACGCGGCG 


3047 


2429 


CAGAAGAT C TCAATCTC 


539 


GAGAUUGA CUGAUGAG X CGAA AUCUUCUG 


304B 


2431 


GAAGATCT C AATCTCGG 


540 


CCGAGAUU CUGAUGAG X CGAA AGAUCUUC 


3049 
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ATCTCAAT C TCGGGAAT 



AUUCCCGA CUGAUGAG X CGAA AUUGAGAU 



CTCAATCT C GGGAATCT 



AGAUUCCC CUGAUGAG X CGAA AGAUUGAG 



TCGGGAAT C TCAATGTT 



AACAUUGA CUGAUGAG X CGAA AUUCCCGA 



GGGAATCT C AATGTTAG 



CUAACAUU CUGAUGAG X CGAA AGAUUCCC 



CTCAATGT T AGTATTCC 



GGAAUACU CUGAUGAG X CGAA ACAUUGAG 



TCAATGTT A GTATTCCT 



AGGAAUAC CUGAUGAG X CGAA AACAUUGA 



ATGTTAGT A TTCCTTGG 



CCAAGGAA CUGAUGAG X CGAA ACUAACAU 



GTTAGTAT T CCTTGGAC 



GUCCAAGG CUGAUGAG X CGAA AUACUAAC 



TTAGTATT C CTTGGACA 



UGUCCAAG CUGAUGAG X CGAA AAUACUAA 



GTATTCCT T GGACACAT 



AUGUGUCC CUGAUGAG X CGAA AGGAAUAC 



GGACACAT A AGGTGGGA 



UCCCACCU CUGAUGAG X CGAA AUGUGUCC 



GGGAAACT T TACGGGGC 



GCCCCGUA CUGAUGAG X CGAA AGUUUCCC 



GGAAACTT T ACGGGGCT 



AGCCCCGU CUGAUGAG X CGAA AAGUUUCC 



GAAACTTT A CGGGGCTT 



AAGCCCCG CUGAUGAG X CGAA AAAGUUUC 



ACGGGGCT T TATTCTTC 



GAAGAAUA CUGAUGAG X CGAA AGCCCCGU 



CGGGGCTT T ATTCTTCT 



AGAAGAAU CUGAUGAG X CGAA AAGCCCCG 



GGGGCTTT A TTCTTCTA 



UAGAAGAA CUGAUGAG X CGAA AAAGCCCC 



GGCTTTAT T CTTCTACG 



CGUAGAAG CUGAUGAG X CGAA AUAAAGCC 



GCTTTATT C TTCTACGG 



CCGUAGAA CUGAUGAG X CGAA AAUAAAGC 



TTTATTCT T CTACGGTA 



UACCGUAG CUGAUGAG X CGAA AGAAUAAA 



TTATTCTT C TACGGTAC 



GUACCGUA CUGAUGAG X CGAA AAGAAUAA 



ATTCTTCT A CGGTACCT 



AGGUACCG CUGAUGAG X CGAA AGAAGAAU 



TCTACGGT A CCTTGCTT 



AAGCAAGG CUGAUGAG X CGAA ACCGUAGA 



CGGTACCT T GCTTTAAT 



AUUAAAGC CUGAUGAG X CGAA AGGUACCG 



ACCTTGCT T TAATCCTA 



UAGGAUUA CUGAUGAG X CGAA AGCAAGGU 



CCTTGCTT T AATCCTAA 



UUAGGAUU CUGAUGAG X CGAA AAGCAAGG 



CTTGCTTT A ATCCTAAA 



UUUAGGAU CUGAUGAG X CGAA AAAGCAAG 



GCTTTAAT C CTAAATGG 



CCAUUUAG CUGAUGAG X CGAA AUUAAAGC 



TTAATCCT A AATGGCAA 



UUGCCAUU CUGAUGAG X CGAA AGGAUUAA 



GGCAAACT C CTTCTTT' 



AAAAGAAG CUGAUGAG X CGAA AGUUUGCC 



AAACTCCT T CTTTTCCT 



AGGAAAAG CUGAUGAG X CGAA AGGAGUUU 



AACTCCTT C TTTTCCTG 



CAGGAAAA CUGAUGAG X CGAA AAGGAGUU 



CTCCTTCT T TTCCTGAC 



GUCAGGAA CUGAUGAG X CGAA AGAAGGAG 



TCCTTCTT T TCCTGACA 



UGUCAGGA CUGAUGAG X CGAA AAGAAGGA 



CCTTCTTT T CCTGACAT 



AUGUCAGG CUGAUGAG X CGAA AAAGAAGG 



CTTCTTTT C CTGACATT 



AAUGUCAG CUGAUGAG X CGAA AAAAGAAG 



CCTGACAT T CATTTGCA 



UGCAAAUG CUGAUGAG X CGAA AUGUCAGG 



CTGACATT C ATTTGCAG 



CUGCAAAU CUGAUGAG X CGAA AAUGUCAG 



ACATTCAT T TGCAGGAG 



CUCCUGCA CUGAUGAG X CGAA AUGAAUGU 



CATTCATT T GCAGGAGG 



CCUCCUGC CUGAUGAG X CGAA AAUGAAUG 



GAGGACAT T GTTGATAG 



CUAUCAAC CUGAUGAG X CGAA AUGUCCUC 



GACATTGT T GATAGATG 



CAUCUAUC CUGAUGAG X CGAA ACAAUGUC 



TTGTTGAT A GATGTAAG 



CUUACAUC CUGAUGAG X CGAA AUCAACAA 



ATAGATGT A AGCAATTT 



AAAUUGCU CUGAUGAG X CGAA ACAUCUAU 



TAAGCAAT T TGTGGGGC 



GCCCCACA CUGAUGAG X CGAA AUUGCUUA 



AAGCAATT T GTGGGGCC 



GGCCCCAC CUGAUGAG X CGAA AAUUGCUU 



GGGCCCCT T ACAGTAAA 



UUUACUGU CUGAUGAG X CGAA AGGGGCCC 



GGCCCCTT A CAGTAAAT 



AUUUACUG CUGAUGAG X CGAA AAGGGGCC 



CTTACAGT A AATGAAAA 



UUUUCAUU CUGAUGAG X CGAA ACUGUAAG 



AGGAGACT T AAATTAAC 



GUUAAUUU CUGAUGAG X CGAA AGUCUCCU 



GGAGACTT A AATTAACT 



AGUUAAUU CUGAUGAG X CGAA AAGUCUCC 
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ACTTAAAT T AACTATGC 



GCAUAGUU CUGAUGAG X CGAA AUUUAAGU 



CTTAAATT A ACTATGCC 



GGCAUAGU CUGAUGAG X CGAA AAUUUAAG 



AATTAACT A TGCCTGCT 



AGCAGGCA CUGAUGAG X CGAA AGUUAAUU 



TGCCTGCT A GGTTTTAT 



AUAAAACC CUGAUGAG X CGAA AGCAGGCA 



TGCTAGGT T TTATCCCA 



UGGGAUAA CUGAUGAG X CGAA ACCUAGCA 



GCTAGGTT T TATCCCAA 



UUGGGAUA CUGAUGAG X CGAA AACCUAGC 



CTAGGTTT T ATCCCAAT 



AUUGGGAU CUGAUGAG X CGAA AAACCUAG 



TAGGTTTT A TCCCAATG 



CAUUGGGA CUGAUGAG X CGAA AAAACCUA 



GGTTTTAT C CCAATGTT 



AACAUUGG CUGAUGAG X CGAA AUAAAACC 



CCCAATGT T ACTAAATA 



UAUUUAGU CUGAUGAG X CGAA ACAUUGGG 



CCAATGTT A CTAAATAT 



AUAUUUAG CUGAUGAG X CGAA AACAUUGG 



ATGTTACT A AATATTTG 



CAAAUAUU CUGAUGAG X CGAA AGUAACAU 



TACTAAAT A TTTGCCCT 



AGGGCAAA CUGAUGAG X CGAA AUUUAGUA 



CTAAATAT T TGCCCTTA 



UAAGGGCA CUGAUGAG X CGAA AUAUUUAG 



TAAATATT T GCCCTTAG 



CUAAGGGC CUGAUGAG X CGAA AAUAUUUA 



TTTGCCCT T AGATAAAG 



CUUUAUCU CUGAUGAG X CGAA AGGGCAAA 



TTGCCCTT A GATAAAGG 



CCUUUAUC CUGAUGAG X CGAA AAGGGCAA 



CCTTAGAT A AAGGGATC 



GAUCCCUU CUGAUGAG X CGAA AUCUAAGG 



AAAGGGAT C AAACCGTA 



UACGGUUU CUGAUGAG X CGAA AUCCCUUU 



CAAACCGT A TTATCCAG 



CUGGAUAA CUGAUGAG X CGAA ACGGUUUG 



AACCGTAT T ATCCAGAG 



CUCUGGAU CUGAUGAG X CGAA AUACGGUU 



ACCGTATT A TCCAGAGT 



ACUCUGGA CUGAUGAG X CGAA AAUACGGU 



CGTATTAT C CAGAGTAT 



AUACUCUG CUGAUGAG X CGAA AUAAUACG 



TCCAGAGT A TGTAGTTA 



UAACUACA CUGAUGAG X CGAA ACUCUGGA 



GAGTATGT A GTTAATCA 



UGAUUAAC CUGAUGAG X CGAA ACAUACUC 



TATGTAGT T AATCATTA 



UAAUGAUU CUGAUGAG X CGAA ACUACAUA 



ATGTAGTT A ATCATTAC 



TAGTTAAT C ATTACTTC 



GUAAUGAU CUGAUGAG X CGAA AACUACAU 
GAAGUAAU CUGAUGAG X CGAA AUUAACUA 



TTAATCAT T ACTTCCAG 



CUGGAAGU CUGAUGAG X CGAA AUGAUUAA 



TAATCATT A CTTCCAGA 



UCUGGAAG CUGAUGAG X CGAA AAUGAUUA 



TCATTACT T CCAGACGC 



GCGUCUGG CUGAUGAG X CGAA AGUAAUGA 



CATTACTT C CAGACGCG 



CGCGUCUG CUGAUGAG X CGAA AAGUAAUG 



CGCGACAT T ATTTACAC 



GUGUAAAU CUGAUGAG X CGAA AUGUCGCG 



GCGACATT A TTTACACA 



UGUGUAAA CUGAUGAG X CGAA AAUGUCGC 



GACATTAT T TACACACT 



AGUGUGUA CUGAUGAG X CGAA AUAAUGUC 



ACATTATT T ACACACTC 



GAGUGUGU CUGAUGAG X CGAA AAUAAUGU 



CATTATTT A CACACTCT 



AGAGUGUG CUGAUGAG X CGAA AAAUAAUG 



TACACACT C TTTGGAAG 



CUUCCAAA CUGAUGAG X CGAA AGUGUGUA 



CACACTCT T TGGAAGGC 



GCCUUCCA CUGAUGAG X CGAA AGAGUGUG 



ACACTCTT T GGAAGGCG 



CGCCUUCC CUGAUGAG X CGAA AAGAGUGU 



GCGGGGAT C TTATATAA 



UUAUAUAA CUGAUGAG X CGAA AUCCCCGC 



GGGGATCT T ATATAAAA 



UUUUAUAU CUGAUGAG X CGAA AGAUCCCC 



GGGATCTT A TATAAAAG 



CUUUUAUA CUGAUGAG X CGAA AAGAUCCC 



GATCTTAT A TAAAAGAG 



CUCUUUUA CUGAUGAG X CGAA AUAAGAUC 



TCTTATAT A AAAGAGAG 



CUCUCUUU CUGAUGAG X CGAA AUAUAAGA 



AAGAGAGT C CACACGTA 



UACGUGUG CUGAUGAG X CGAA ACUCUCUU 



CCACACGT A GCGCCTCA 



UGAGGCGC CUGAUGAG X CGAA ACGUGUGG 



TAGCGCCT C ATTTTGCG 



CGCAAAAU CUGAUGAG X CGAA AGGCGCUA 



CGCCTCAT T TTGCGGGT 



ACCCGCAA CUGAUGAG X CGAA AUGAGGCG 



GCCTCATT T TGCGGGTC 



GACCCGCA CUGAUGAG X CGAA AAUGAGGC 



CCTCATTT T GCGGGTCA 



UGACCCGC CUGAUGAG X CGAA AAAUGAGG 
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TTGCGGGT C AC CAT ATT 
GTCACCAT A TTCTTGGG 
CACCATAT T CTTGGGAA 



AAUAUGGU CU GAUGAG X CGAA ACCCGCAA 
CCCAAGAA CUGAUGAG X CGAA AUGGUGAC 
UUCCCAAG CUGAUGAG X CGAA AUAUGGUG 



GUUCCCAA CUGAUGAG X CGAA AAUAUGGU 



ACCATATT C TTGGGAAC 



CATATTCT T GGGAACAA 



UUGUUCCC CUGAUGAG X CGAA AGAAUAUG 



AACAAGAT C TACAGCAT 



AUGCUGUA CUGAUGAG X CGAA AUCUUGUU 



CAAGATCT A CAGCATGG 



CCAUGCUG CUGAUGAG X CGAA AGAUCUUG 



TGGGAGGT T GGTCTTCC 



GGAAGACC CUGAUGAG X CGAA ACCUCCCA 



AGGTTGGT C TTCCAAAC 



GUUUGGAA CUGAUGAG X CGAA ACCAACCU 



GTTGGTCT T CCAAACCT 



AGGUUUGG CUGAUGAG X CGAA AGACCAAC 



GAGGUUUG CUGAUGAG X CGAA AAGACCAA 



TTGGTCTT C CAAACCTC 
CCAAACCT C GAAAAGGC 



GCCUUUUC CUGAUGAG X CGAA AGGUUUGG 



GGACAAAT C TTTCTGTC 



GACAGAAA CUGAUGAG X CGAA AUUUGUCC 



GGGACAGA CUGAUGAG X CGAA AGAUUUGU 



ACAAATCT T TCTGTCCC 



GGGGACAG CUGAUGAG X CGAA AAGAUUUG 



CAAATCTT T CTGTCCCC 



UGGGGACA CUGAUGAG X CGAA AAAGAUUU 



AAATCTTT C TGTCCCCA 



GGAUUGGG CUGAUGAG X CGAA ACAGAAAG 



2920 
2921 



CTTTCTGT C CCCAATCC 



TCCCCAAT C CCCTGGGA 



UCCCAGGG CUGAUGAG X CGAA AUUGGGGA 



CCTGGGAT T CTTCCCCG 



CGGGGAAG CUGAUGAG X CGAA AUCCCAGG 



CTGGGATT C TTCCCCGA 



UCGGGGAA CUGAUGAG X CGAA AAUCCCAG 



GAUCGGGG CUGAUGAG X CGAA AGAAUCCC 



GGGATTCT T CCCCGATC 



UGAUCGGG CUGAUGAG X CGAA AAGAAUCC 



GGATTCTT C CCCGATCA 



TCCCCGAT C ATCAGTTG 



CAACUGAU CUGAUGAG X CGAA AUCGGGGA 



CCGATCAT C AGTTGGAC 



GUCCAACU CUGAUGAG X CGAA AUGAUCGG 



TCATCAGT T GGACCCTG 



CAGGGUCC CUGAUGAG X CGAA ACUGAUGA 



! CUGAUGAG X CGAA AUGCAGGG 



CCCTGCAT T CAAAGCCA 



UUGGCUUU CUGAUGAG X CGAA AAUGCAGG 



CCTGCATT C AAAGCCAA 



GAUUUACU CUGAUGAG X CGAA AGUUGGCU 



C AGTAAATC 
A AATCCAGA 



AACTCAGT 
CAGTAAAT C CAGATTGG 



UCUGGAUU CUGAUGAG X CGAA ACUGAGUU 



CCAAUCUG CUGAUGAG X CGAA AUUUACUG 



GAGGUCCC CUGAUGAG X CGAA AUCUGGAU 



ATCCAGAT T GGGACCTC 



TGGGACCT C AACCCGCA 



UGCGGGUU CUGAUGAG X CGAA AGGUCCCA 



CUGGCCCG CUGAUGAG X CGAA AUGCUCCC 



GGGAGCAT T CGGGCCAG 
GGAGCATT C GGGCCAGG 



CCUGGCCC CUGAUGAG X CGAA AAUGCUCC 



GCCAGGGT T CACCCCTC 



GAGGGGUG CUGAUGAG X CGAA ACCCUGGC 



CCAGGGTT C ACCCCTCC 



GGAGGGGU CUGAUGAG X CGAA AACCCUGG 



TCACCCCT C CCCATGGG 



CCCAUGGG CUGAUGAG X CGAA AGGGGUGA 



UCCACCCC CUGAUGAG X CGAA ACAGUCCC 



GGGACTGT T GGGGTGGA 



GGAGCCCT C ACGCTCAG 



CUGAGCGU 



CUGAUGAG X CGAA AGGGCUCC 



CTCACGCT C AGGGCCTA 



J CUGAUGAG X CGAA AGCGUGAG 



CAGGGCCT A CTCACAAC 
GGCCTACT C ACAACTGT 



GUUGUGAG CUGAUGAG X CGAA AGGCCCUG 



J CUGAUGAG X CGAA AGUAGGCC 



CAGCAGCT C CTCCTCCT 



AGGAGGAG CUGAUGAG X CGAA AGCUGCUG 



CAGCTCCT C CTCCTGCC 



GGCAGGAG CUGAUGAG X CGAA AGGAGCUG 



CTCCTCCT C CTGCCTCC 
TCCTGCCT C CACCAATC 



GGAGGCAG CUGAUGAG X CGAA AGGAGGAG 



GAUUGGUG CUGAUGAG X CGAA AGGCAGGA 



CCACCAAT C GGCAGTCA 
TCGGCAGT C AGGAAGGC 



UGACUGCC CUGAUGAG X CGAA AUUGGUGG 



GCCUUCCU CUGAUGAG X CGAA ACUGCCGA 



GGCAGCCT A CTCCCTTA 



UAAGGGAG CUGAUGAG X CGAA AGGCUGCC 



AGAUAAGG CUGAUGAG X CGAA AGUAGGCU 



AGCCTACT C CCTTATCT 



GUGGAGAU CUGAUGAG X CGAA AGGGAGUA 



TACTCCCT T ATCTCCAC 
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ACTCCCTT A TCTCCACC 



TCCCTTAT C TCCACCTC 



GGUGGAGA CUGAUGAG X CGAA AAGGGAGU 



GAGGUGGA CUGAUGAG X CGAA AUAAGGGA 



3177 
3183 



CCTTATCT C CACCTCTA 



UAGAGGUG CUGAUGAG X CGAA AGAUAAGG 



CTCCACCT C TAAGGGAC 



GUCCCUUA CUGAUGAG X CGAA AGGUGGAG 



CCACCTCT A AGGGACAC 



GUGUCCCU CUGAUGAG X CGAA AGAGGUGG 



GGGACACT C ATCCTCAG 
ACACTCAT C CTCAGGCC 



CUGAGGAU CUGAUGAG X CGAA AGUGUCCC 



GGCCUGAG CUGAUGAG X CGAA AUGAGUGU 



CTCATCCT C AGGCCATG 



CAUGGCCU CUGAUGAG X CGAA AGGAUGAG 



Input Sequence = AF100308. Cut Site = UH/ . 

Stem Length = 8 . Core Sequence = CUGAUGAG X CGAA (X = GCCGUUAGGC c 
AF100308 (Hepatitis B virus strain 2-18, 3215 bp) 



other stem II) 
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Table 38 

Table 38: Human HBV Inozyme and Substrate Sequence 



146 
147 



AACTCCAC C ACTTTCCA" 



ACTCCACC A GTTTCCAC 



TCCACCAG T TTCCACCA~ 



CCACTTTC C ACCAAACf" 



CACTTTCC A CCAAACTC 



CTTTCCAC C AAACTCTT 



TTTCCACC A AACTCTTC 



CACCAAAC T CTTCAAGA~ 



CCAAACTC T TCAAGATC 



AACTCTTC A AGATCCCA 



TCAAGATC C CAGAGTCA~ 



CAAGATCC C AGAGTCAG 



AAGATCCC A GAGTCAGG 



CCAGAGTC A GGGCCCTG" 



GTCAGGGC C CTGTACTT 



TCAGGGCC C TGTACTTT 



CAGGGCCC T GTACTTTC 



CCCTGTAC T TTCCTGCT 



GTACTTTC C TGCTGGTtT 



TACTTTCC T GCTGGTGG~ 



TTTCCTGC T GGTGGCTC 



CTGGTGGC T CCAGTTCA 



GGTGGCTC C AGTTCAGG 



GTGGCTCC A GTTCAGGA 



TCCAGTTC A GGAACAGT 



TCAGGAAC A GTGAGCCC 



CAGTGAGC C CTGCTCAG 



AGTGAGCC C TGCTCAGA 



GTGAGCCC T GCTCAGAA 



AGCCCTGC T CAGAATAC 



CCCTGCTC A GAATACTG" 



CAGAATAC T GTCTCTGC 



ATACTGTC T CTGCCATA 



ACTGTCTC T GCCATATC 



Ribozyme 

UGGAAAGU CUGAUGAG X CGAA IUGGAGUU 



GUGGAAAG CUGAUGAG X CGAA IGUGGAGU 



UGGUGGAA CUGAUGAG X CGAA IUGGUGGA 



AGUUUGGU CUGAUGAG X CGAA IAAAGUGG~ 



GAGUUUGG CUGAUGAG X CGAA IGAAAGUG 



AAGAGUUU CUGAUGAG X CGAA IUGGAAAG 



GAAGAGUU CUGAUGAG X CGAA IGUGGAAA 



UCUUGAAG CUGAUGAG X CGAA IUUUGGUG 



GAUCUUGA CUGAUGAG X CGAA IAGUUUGG 



UGGGAUCU CUGAUGAG X CGAA IAAGAGUU~ 



UGACUCUG CUGAUGAG X CGAA IAUCUUGA~ 



CUGACUCU CUGAUGAG X CGAA IGAUCUUG 



CCUGACUC CUGAUGAG X CGAA IGGAUCUU 



CAGGGCCC CUGAUGAG X CGAA IACUCUGG 



AAGUACAG CUGAUGAG X CGAA ICCCUGAC 



AAAGUACA CUGAUGAG X CGAA IGCCCUGA 



GAAAGUAC CUGAUGAG X CGAA IGGCCCUG 



AGCAGGAA CUGAUGAG X CGAA IUACAGGG 



CACCAGCA CUGAUGAG X CGAA IAAAGUAC 



CCACCAGC CUGAUGAG X CGAA IGAAAGUA 



GAGCCACC CUGAUGAG X CGAA ICAGGAAA 



UGAACUGG CUGAUGAG X CGAA ICCACCAG 



CCUGAACU CUGAUGAG X CGAA IAGCCACC~ 



GTCTCTGC C ATATCGTC 



TCTCTGCC A TATCGTCA 



ATATCGTC A ATCTTATC~ 



CGTCAATC T TATCGAAG 



TCGAAGAC T GGGGACCC 



CTGGGGAC C CTGTACCG 



TGGGGACC C TGTACCGA 



GGGGACCC T GTACCGAA 



CCCTGTAC C GAACATGG 



TACCGAAC A TGGAGAAC 



TGGAGAAC A TCGCATCa" 



AACATCGC A TCAGGACT 



ATCGCATC A ggactcct" 



UCCUGAAC CUGAUGAG X CGAA IGAGCCAC 



ACUGUUCC CUGAUGAG X CGAA IAACUGGA 



GGGCUC AC CUGAUGAG X CGAA IUUCCUGA 
CUGAGCAG CUGAUGAG X CGAA ICUCACUG 



UCUGAGCA CUGAUGAG X CGAA IGCUCACU 



UUCUGAGC CUGAUGAG X CGAA IGGCUCA~ 



GUAUUCUG CUGAUGAG X CGAA ICAGGGCU 



CAGUAUUC CUGAUGAG X CGAA IAGCAGGG 



GCAGAGAC CUGAUGAG X CGAA IUAUUCUG 



UAUGGCAG CUGAUGAG X CGAA IACAGUAU 



GAUAUGGC CUGAUGAG X CGAA IAGACAGU 



GACGAUAU CUGAUGAG X CGAA ICAGAGAC 



UGACGAUA CUGAUGAG X CGAA IGCAGAGA 



GAUAAGAU CUGAUGAG X CGAA IACGAUAU~ 



CUUCGAUA CUGAUGAG X CGAA IAUUGACG 



GGGUCCCC CUGAUGAG X CGAA IUCUUCGA 



CGGUACAG CUGAUGAG X CGAA IUCCCCAG 



UCGGUACA CUGAUGAG X CGAA IGUCCCCA 



UUCGGUAC CUGAUGAG X CGAA IGGUCCCC 



CCAUGUUC CUGAUGAG X CGAA IUACAGGG 



GUUCUCCA CUGAUGAG X CGAA IUUCGGUA 



UGAUGCGA CUGAUGA G X CGAA IUUCUCC A~ 
AGUCCUGA CUGAUGAG X CGAA ICGAUGUU 



AGGAGUCC CUGAUGAG X CGAA IAUGCGAU 



3242 
3243 
3244 



3248 
3249 



3255 
3256 
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ATCAGGAC T CCTAGGAC 



GUCCUAGG CUGAUGAG X CGAA IUCCUGAU 



CAGGACTC C TAGGACCC 



GGGUCCUA CUGAUGAG X CGAA IAGUCCUG 



AGGACTCC T AGGACCCC 



GGGGUCCU CUGAUGAG X CGAA IGAGUCCU 



CCTAGGAC C CCTGCTCG 



CGAGCAGG CUGAUGAG X CGAA IUCCUAGG 



CTAGGACC C CTGCTCGT 
TAGGACCC C TGCTCGTG 



ACGAGCAG CUGAUGAG X CGAA IGUCCUAG 



CACGAGCA CUGAUGAG X CGAA IGGUCCUA 



AGGACCCC T GCTCGTGT 



ACACGAGC CUGAUGAG X CGAA IGGGUCCU 



ACCCCTGC T CGTGTTAC 



GUAACACG CUGAUGAG X CGAA ICAGGGGU 



CGTGTTAC A GGCGGGGT 



ACCCCGCC CUGAUGAG X CGAA IUAACACG 



GGTTTTTC T TGTTGACA 



UGUCAACA CUGAUGAG X CGAA IAAAAACC 



TTGTTGAC A AAAATCCT 



AGGAUUUU CUGAUGAG X CGAA IUCAACAA 



CAAAAATC C TCACAATA 



UAUUGUGA CUGAUGAG X CGAA IAUUUUUG 



AAAAATCC T CACAATAC 



GUAUUGUG CUGAUGAG X CGAA IGAUUUUU 



AAATCCTC A CAATACCA 



UGGUAUUG CUGAUGAG X CGAA IAGGAUUU 



ATCCTCAC A ATACCACA 



UGUGGUAU CUGAUGAG X CGAA IUGAGGAU 



CACAATAC C ACAGAGTC 



GACUCUGU CUGAUGAG X CGAA IUAUUGUG 



ACAATACC A CAGAGTCT 



AGACUCUG CUGAUGAG X CGAA IGUAUUGU 



AATACCAC A GAGTCTAG 



CUAGACUC CUGAUGAG X CGAA IUGGUAUU 



ACAGAGTC T AGACTCGT 



ACGAGUCU CUGAUGAG X CGAA IACUCUGU 



GTCTAGAC T CGTGGTGG 



CCACCACG CUGAUGAG X CGAA IUCUAGAC 



TGGTGGAC T TCTCTCAA 



UUGAGAGA CUGAUGAG X CGAA IUCCACCA 



TGGACTTC T CTCAATTT 
GACTTCTC T CAATTTTC 



AAAUUGAG CUGAUGAG X CGAA IAAGUCCA 



GAAAAUUG CUGAUGAG X CGAA IAGAAGUC 



CTTCTCTC A ATTTTCTA 
CAATTTTC T AGGGGGAA 



UAGAAAAU CUGAUGAG X CGAA IAGAGAAG 



UUCCCCCU CUGAUGAG X CGAA IAAAAUUG 



GGGGGAAC A CCCGTGTG 



CACACGGG CUGAUGAG X CGAA IUUCCCCC 



GGGAACAC C CGTGTGTC 



GACACACG CUGAUGAG X CGAA IUGUUCCC 



GGAACACC C GTGTGTCT 



AGACACAC CUGAUGAG X CGAA IGUGUUCC 



307 
308 
317 
321 



CGTGTGTC T TGGCCAAA 



UUUGGCCA CUGAUGAG X CGAA IACACACG 



GTCTTGGC C AAAATTCG 



CGAAUUUU CUGAUGAG X CGAA ICCAAGAC 



TCTTGGCC A AAATTCGC 



GCGAAUUU CUGAUGAG X CGAA I GCCAAGA 



AAATTCGC A GTCCCAAA 



UUUGGGAC CUGAUGAG X CGAA ICGAAUUU 



TCGCAGTC C CAAATCTC 



GAGAUUUG CUGAUGAG X CGAA IACUGCGA 



CGCAGTCC C AAATCTCC 



GGAGAUUU CUGAUGAG X CGAA IGACUGCG 



GCAGTCCC A AATCTCCA 



UGGAGAUU CUGAUGAG X CGAA IGGACUGC 



CCCAAATC T CCAGTCAC 



GUGACUGG CUGAUGAG X CGAA IAUUUGGG 



CAAATCTC C AGTCACTC 



GAGUGACU CUGAUGAG X CGAA IAGAUUUG 



AAATCTCC A GTCACTCA 



UGAGUGAC CUGAUGAG X CGAA IGAGAUUU 



CTCCAGTC A CTCACCAA 



UUGGUGAG CUGAUGAG X CGAA IACUGGAG 



CCAGTCAC T CACCAACC 



GGUUGGUG CUGAUGAG X CGAA IUGACUGG 



339 
341 
342 
345 
346 



AGTCACTC A CCAACCTG 



CAGGUUGG CUGAUGAG X CGAA IAGUGACU 



TCACTCAC C AACCTGTT 



AACAGGUU CUGAUGAG X CGAA IUGAGUGA 



CACTCACC A ACCTGTTG 



CAACAGGU CUGAUGAG X CGAA IGUGAGUG 



TCACCAAC C TGTTGTCC 



GGACAACA CUGAUGAG X CGAA IUUGGUGA 



CACCAACC T GTTGTCCT 



AGGACAAC CUGAUGAG X CGAA IGUUGGUG 



CTGTTGTC C TCCAATTT 



AAAUUGGA CUGAUGAG X CGAA IACAACAG 



TGTTGTCC T CCAATTTG 



CAAAUUGG CUGAUGAG X CGAA IGACAACA 



TTGTCCTC C AATTTGTC 



GACAAAUU CUGAUGAG X CGAA IAGGACAA 



TGTCCTCC A ATTTGTCC 
AATTTGTC C TGGTTATC 



GGACAAAU CUGAUGAG X CGAA IGAGGACA 



GAUAACCA CUGAUGAG X CGAA IACAAAUU 



ATTTGTCC T GGTTATCG 



CGAUAACC CUGAUGAG X CGAA IGACAAAU 
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GTTATCGC T GGATGTGT 
GATGTGTC T GCGGCGTT 
GTTTTATC A TCTTCCTC 
TTATCATC T TCCTCTGC 
TCATCTTC C TCTGCATC 
CATCTTCC T CTGCATCC 
TCTTCCTC T GCATCCTG 
TCCTCTGC A TCCTGCTG 
TCTGCATC C TGCTGCTA 
CTGCATCC T GCTGCTAT 
CATCCTGC T GCTATGCC 
CCTGCTGC T ATGCCTCA 
TGCTATGC C TCATCTTC 
GCTATGCC T CATCTTCT 
TATGCCTC A TCTTCTTG 
GCCTCATC T TCTTGTTG 
TCATCTTC T TGTTGGTT 
GTTGGTTC T TCTGGACT 
GGTTCTTC T GGACTATC 
TTCTGGAC T ATCAAGGT 
GGACTATC A AGGTATGT 
TATGTTGC C CGTTTGTC 
ATGTTGCC C GTTTGTCC 
CGTTTGTC C TCTAATTC 
GTTTGTCC T CTAATTCC 
TTGTCCTC T AATTCCAG 
TCTAATTC C AGGATCAT 
CTAATTCC A GGATCATC 
CCAGGAT C A TCAACAAC 
GGATCATC A ACAACCAG 
TCATCAAC A ACCAGCAC 
~~ TCAAC AAC C AGCACCGG 
CAACAAC C A GCACCGGA 
CAACCAGC A CCGGACCA 
ACCAGCAC C GGACCATG 
CACCGGAC C ATGCAAAA 
ACCGGACC A TGCAAAAC 



TGCAAAAC C TGCACAAC 
GCAAAACC T GCACAACT 
AAACCT GC A CAACTCCT 
ACCTGCAC A ACTCCTGC 
TGCACAAC T CCTGCTCA 
CACAACTC C TGCTCAAG 
ACAACTCC T GCTCAAGG 
ACTCCTGC T CAAGGAAC 
TCCTGCTC A AGGAACCT 
CAAGGA AC C TCTATGTT 
AAGGAACC T CTATGTTT 
GGAACCTC T ATGTTTCC 
TATGTTTC C CTCATGTT 



ACACAUCC C UGAUGAG X CGAA ICGAUAAC 
AACGCCGC CUGAUGAG X CGAA IACACAUC 
GAGGAAGA CUGAUGAG X CGAA IAUAAAAC 
GCAGAGGA CUGAUGAG X CG AA IAUGAUAA 
GAUGCAGA CUGAUGAG X CG AA IAAGAUGA 
GGAUGCAG CUGAUGA G X CGAA IGAAGAUG 
CAGGAUGC CUG AUGAG X CGAA IAGGAAGA 
CAGCAGGA CUGAUG AG X CGAA ICAGAGGA 
UAGCAG CA CUGAUGAG X CGAA IAUGCAGA 
AUAGCAGC CUGAUG AG X CGAA IGAUGCAG 
GGCAUAGC CUGAUGAG X CGAA ICAGGAUG 
UGAGGCAU CUGAUGAG X CGAA ICAGCAGG 
GAAGAUGA CUGAUG AG X CGAA ICAUAGCA 
AGAAGAUG CUGAUGAG X CGAA IGCAUAGC 
CAAGAAGA C UGAUGAG X CGAA IAGGCAUA 
CAACAAGA CUGAUG AG X CGAA IAUGAGGC 
AACCAACA CUGAUG AG X CGAA IAAGAUGA 
AG UCCAGA CUGAUGAG X CGAA IAACCAAC 
GAUAGUCC CUGAU GAG X CGAA IAAGAACC 
ACCUUGAU CUGAUGA G X CGAA IUCCAGAA 
A CAUACCU CUGAUGAG X CGAA IAUAGUCC 
GACAA ACG CUGAUGAG X CGAA ICAACAUA 
GGA CAAAC CUGAUGAG X CGAA IGCAACAU 
GAAUUAGA CUGAUGAG X CGAA IACAAACG 
GGAAUUAG CUGAU GAG X CGAA IGACAAAC 
CUGGAAUU CUGAUGAG X CGAA IAGGACAA 
AUGAUCCU CUGAUGAG X CGAA I AAUU AGA 
GAUGAUCC CUGAUGAG X CGAA IGAAUUAG 
G UUGUUGA CUGAUGAG X CGAA IAUCCUGG 
CUGGUUGU CUGAUGAG X CGAA IAUGAUCC 
GUGCUGGU CUGAUGA G X CGAA IUUGAUGA~ 
CCGGUGCU CUGAUGAG X CGAA IUUGUUGA 
UCCGGUGC CUGAUGAG X CGAA IGUUGUUG~ 
UGGU CCGG CUGAUGAG X CGAA ICUGGUUG 
CAUGGU CC CUGAUGAG X CGAA IUGCUGGU 
UUUUGCAU CUGAUG AG X CGAA IUCCGGUG~ 
GUUUUGCA CUGAUGAG X CGAA IGUCCGGU~ 
GCAGGU UU CUGAUGAG X CGAA ICAUGGUC 
GUUGUGCA CUGAUGAG X CGAA IUUUUGCA 
AGUUGUGC CUG AUGAG X CGAA IGUUUUGC~ 
AGGAGUUG CUGAUGAG X C GAA ICAGGUUU~ 
GCAGGAGU CUGAUGAG X C GAA IUGCAGGU~ 
UGAGCAGG CUGAUG AG X CGAA IUUGUGCA~ 
CUUGA GCA CUGAUGAG X CGAA IAGUUGUG 
CCUUGAGC CUGAUGAG X CGAA 
GUUCCUUG CUGAUGAG X CGAA 
AGGUUCCU CUGAUGAG X CGAA IAGCAGGA 
AACAUAGA CUGAUGA G X CGAA IUUCCUUG~ 
AAACAUAG CUGAUGAG X CG AA IGUUCCUU~ 
GGA AACAU CUGAUGAG X CGAA IAGGUUCC 
AACAUGAG CUGAUGAG X CGAA IAAACAUA 



IGAGUUGU 
ICAGGAGU 
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Table 38 



ATGTTTCC C TCATGTTG 



CAACAUGA CUGAUGAG X CGAA IGAAACAU 



TGTTTCCC T CATGTTGC 



GCAACAUG CUGAUGAG X CGAA IGGAAACA 



TTTCCCTC A TGTTGCTG 



CAGCAACA CUGAUGAG X CGAA IAGGGAAA 



CATGTTGC T GTACAAAA 



UUUUGUAC CUGAUGAG X CGAA ICAACAUG 



TGCTGTAC A AAACCTAC 



GUAGGUUU CUGAUGAG X CGAA IUACAGCA 



TACAAAAC C TACGGACG 



CGUCCGUA CUGAUGAG X CGAA IUUUUGUA 



ACAAAACC T ACGGACGG 



CCGUCCGU CUGAUGAG X CGAA IGUUUUGU 



ACGGAAAC T GCACCTGT 



ACAGGUGC CUGAUGAG X CGAA IUUUCCGU 



GAAACTGC A CCTGTATT 



AAUACAGG CUGAUGAG X CGAA ICAGUUUC 



AACTGCAC C TGTATTCC 



GGAAUACA CUGAUGAG X CGAA IUGCAGUU 



ACTGCACC T GTATTCCC 



GGGAAUAC CUGAUGAG X CGAA IGUGCAGU 



CTGTATTC C CATCCCAT 



AUGGGAUG CUGAUGAG X CGAA IAAUACAG 



TGTATTCC C ATCCCATC 



GAUGGGAU CUGAUGAG X CGAA IGAAUACA 



GTATTCCC A TCCCATCA 



UGAUGGGA CUGAUGAG X CGAA IGGAAUAC 



TTCCCATC C CATCATCT 



AGAUGAUG CUGAUGAG X CGAA IAUGGGAA 



; ATCATCTT 
I TCATCTTG 



AAGAUGAU CUGAUGAG X CGAA IGAUGGGA 



CAAGAUGA CUGAUGAG X CGAA IGGAUGGG 



ATCCCATC A TCTTGGGC 



GCCCAAGA CUGAUGAG X CGAA IAUGGGAU 



CCATCATC T TGGGCTTT 



AAAGCCCA CUGAUGAG X CGAA IAUGAUGG 



TCTTGGGC T TTCGCAAA 



UUUGCGAA CUGAUGAG X CGAA ICCCAAGA 



GCTTTCGC A AAATACCT 



AGGUAUUU CUGAUGAG X CGAA ICGAAAGC 



CUCCCAUA CUGAUGAG X CGAA IUAUUUUG 



AAAATACC T ATGGGAGT 



ACUCCCAU CUGAUGAG X CGAA IGUAUUUU 



GAGTGGGC C TCAGTCCG 



CGGACUGA CUGAUGAG X CGAA ICCCACUC 



AGTGGGCC T CAGTCCGT 



ACGGACUG CUGAUGAG X CGAA IGCCCACU 



TGGGCCTC A GTCCGTTT 



AAACGGAC CUGAUGAG X CGAA IAGGCCCA 



CCTCAGTC C GTTTCTCT 



TCCGTTTC T CTTGGCTC 



AGAGAAAC CUGAUGAG X CG AA IACUGAGG 
i IAAACGGA 



GAGCCAAG CUGAUGAG X CGAA I 



S90 
698 
713 



CGTTTCTC T TGGCTCAG 



CUGAGCCA CUGAUGAG X CGAA IAGAAACG 



CTCTTGGC T CAGTTTAC 



GUAAACUG CUGAUGAG X CGAA ICCAAGAG 



CTTGGCTC A GTTTACTA 



UAGUAAAC CUGAUGAG X CGAA IAGCCAAG 



CAGTTTAC T AGTGCCAT 



AUGGCACU CUGAUGAG 



I CGAA IUAAACUG 



ACTAGTGC C ATTTGTTC 



GAACAAAU CUGAUGAG X CGAA ICACUAGU 



CTAGTGCC A TTTGTTCA 



ATTTGTTC A GTGGTTCG 



CGTAGGGC T TTCCCCCA 



GGGCTTTC C CCCACTGT 



GGCTTTCC C CCACTGTC 



GCTTTCCC C CACTGTCT 



CTTTCCCC C ACTGTCTG 



UGAACAAA CUGAUGAG X CGAA IGCACUAG 



CGAACCAC CUGAUGAG X CGAA IAACAAAU 



UGGGGGAA CUGAUGAG X CGAA ICCCUACG 



ACAGUGGG CUGAUGAG X CGAA IAAAGCCC 



GACAGUGG CUGAUGAG X CGAA 



AGACAGUG CUGAUGAG X CGAA 



CAGACAGU CUGAUGAG X CGAA 



IGAAAGCC 
IGGAAAGC 
IGGGAAAG 



TTTCCCCC A CTGTCTGG 



TCCCCCAC T GTCTGGCT 



i CUGAUGAG X CGAA IGGGGAAA 
: CUGAUGAG X CGAA IUGGGGGA 



CCACTGTC T GGCTTTCA 



UGAAAGCC CUGAUGAG X CGAA IACAGUGG 



TGTCTGGC T TTCAGTTA 



UAACUGAA CUGAUGAG X CGAA ICCAGACA 



TGGCTTTC A GTTATATG 



TTGGGGGC C AAGTCTGT 
TGGGGGCC A AGTCTGTA 
GCCAAGTC T GTACAACA 



, IAAAGCCA 
ACAGACUU CUGAUGAG X CGAA ICCCCCAA 



CAUAUAAC CUGAUGAG X CGAA I 



UACAGACU CUGAUGAG X CGAA IGCCCCCA 



UGUUGUAC CUGAUGAG X CGAA IACUUGGC 



GTCTGTAC A ACATCTTG 



CAAGAUGU CUGAUGAG X ( 



TGTACAAC A TCTTGAGT 



ACUCAAGA CUGAUGAG X CGAA IUUGUACA 



ACAACATC T TGAGTCCC 



GGGACUCA CUGAUGAG X CGAA IAUGUUGU 
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CTTGAGTC C CTTTATGC 
TTGAGT CC C TTTATGCC ~ 
TGAGTCCC T TTATGCCG 



GCAUAAAG CU GAUGAG X CGAA IACUCAAG 
GGCAUAAA CUGAUGAG X C GAA IGACUCAA~ 
CGGCAUAA CUGAUGAG X CGAA IGGACUCA 



CTTTATGC C GCTGTTAC 



GUAACAGC CUGAUGAG X CGAA ICAUAAAG 



TATGCCGC T GT TACCAA 
GCTGTTAC C AATTTTCT 
CTGTTACC A AT TTTCTT 
CAATTTTC T TTTGTCTT 



UUGGUAAC CUGAUGAG X CGAA ICGGCAUA 



CTTTTGTC T TTGGGTAT 



GGGTATAC A TTTAAACC 
ATTTAAAC C CTCACAAA 
TTTAAACC C TCACAAAA 
TTAAACCC T CACAAAAC 
AAACCCTC A CAAAACAA 
ACCCTCAC A AAACAAAA 
CACAAAAC A AAAAGATG 
GGATATTC C CTTAACTT 
GATAT TCC C TTAACTTC 
ATATTCCC T TAACTTCA 
CCCTTAAC T TCATGGGA 
TTAACTTC A TGGGATAT 
GTTGGGGC A CATTGCCA 
TGGGGC AC A TTGCCACA 
CACATTGC C ACAGGAAC 
ACATTGC C A CAGGAACA 
ATTGCCAC A GGAACATA 
ACAGGAAC A TATTGTAC 
TATTGTAC A AAAAATCA 
AAAAAAT C A AAATGTGT 
TAGGAAAC T TCCTGTAA 
GAAACTT C C TGTAAACA 
AAACTTCC T GT AAACAG 
CTGTAAAC A GGCCTATT 
AAACAGGC C TATTGATT 
AACAGGCC T ATTGATTG 
AGTATGTC A ACGAATTG 
TGTGGGTC T TTTGGGGT 
GGGTTTGC C GCCCCTTT 
TTTGCCGC C CCTTTCAC 
TTGCCGCC C CT TTCACG 
TGCCGCCC C TTTCACGC 
GCCGCCCC T TTCACGCA 
CCCCTTTC A CGCAATGT 
TTTCACGC A ATGTGGAT 
GGATATTC T GCTTTAAT 
TATTCTGC T TTAATGCC 
TTTAATGC C TTTATATG 
TTAATGCC T TTATATGC 
TTATAT GC A TGCATACA 
ATGCA TGC A TACAAGCA 
ATGCATAC A AGCAAAAC 



AGAAAAUU CUGAUGAG X CGAA IUAACAGC 



AAGAAAAU CUGAUGAG X CGAA IGUAACAG 



AAGACAAA CUGAUGAG X CGAA IAAAAUUG 



AUACCCAA CUGAUGAG X CGAA IACAAAAG 



GGUUUAAA CUGAUGAG X CGAA IUAUACCC 
UUUGUGAG CUGAUGAG X CGAA IUUUAAAU 
UUUUGUGA CUGAUGAG X CGAA IGUUUAAA 
GUUUUGUG CUGAU GAG X CGAA IGGUUUAA 
UUGUUUUG CUGAUGAG X CGAA IAGGGUUU 
UUUUGUUU CUGAUGAG X CG AA IUGAGGGU 
CAUCUUUU CUGAUGAG X CGAA IUUUUGUG 
A AGUUAAG CUGAUGAG X CGAA IAAUAUCC 
GA AGUUAA CUGAUGAG X CGAA IGAAUAUC 
UGAAGUUA CUG AUGAG X CGAA IGGAAUAU 
UCCCAUGA CUGAUGAG X CGAA IUUAAGGG 
AUAUCCCA CUGAUGAG X CGAA IAAGUUAA 
UGGC AAUG CUGAUGAG X CGAA ICCCCAAC 
UGUGGCAA CUGAU GAG X CGAA IUGCCCCA 
GUUCCUGU CUGAUGAG X CGAA ICAAUGUG 
UGUUCCUG CUG AUGAG X CGAA IGCAAUGU 
UAUGUUCC CUGAU GAG X CGAA IUGGCAAU 
GUACAAUA CUGAUGAG X CGAA IUUCCUGU 
UGAUUUUU CUGAUGAG X C GAA IUACAAUA 
ACACAUUU CUG AUGAG X CGAA IAUUUUUU 
UUACA GGA CUGAUGAG X CGAA IUUUCCUA 
UGUU UACA CUGAUGAG X CGAA IAAGUUUC 
CUGUUUAC CUGAUGAG X CGAA IGAAGUUU~ 
AAUAGGCC CUGAUGAG X CGAA IUUUACAG~ 
AAUCAAUA CUGAUGAG X CGAA ICCUGUUU~ 
CAAUCAAU CUGAUG AG X CGAA IGCCUGUU 
CAAUUCGU CUGAU GAG X CGAA IACAUACU 
ACCCCAAA CUGAUGAG X CGAA IACCCACA~ 
AAAGGGGC CUGAUGAG X CGAA ICAAACCC~ 
GUGAAAGG CUGAUGAG X CGAA ICGGCAAA 
CGUGAAAG CUGAUG AG X CGAA IGCGGCAA 
GCGUGAAA CUGAUGAG X CGAA IGGCGGCA 
UG CGUGAA CUGAUGAG X CGAA IGGGCGGC 
ACAUUGCG CUGAUGAG X CGAA IAAAGGGG 
AUCCACAU CUGAUGAG X CGAA ICGUGAAA 
AUUAAAGC CUGAUGAG X CGAA IAAUAUCC 
GGCAUUAA CUGAUGAG X CGAA ICAGAAUA 
CAUAUAAA CUGAUGAG X CGAA ICAUUAAA~ 
GCAUAUAA CUGAUG AG X CGAA IGCAUUAA~ 
UGUAUGCA CUGAUGAG X CGAA ICAUAUAA 
UG CUUGUA CUGAUGAG X CGAA ICAUGCAU 
GUUUUGCU CUGAUGAG X CGAA IUAUGCAU~ 
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ATACAAGC A AAACAGGC 
AGCAAA AC A GGCTTTTA 
AAACAGGC T TTTACTTT 



GCCUGUUU C UGAUGAG X CGAA ICUUGUAU 
UAAAAGCC CUGAUG AG X CGAA IUUUUGCU~ 
AAAGUAAA CUGAUGAG X CGAA ICCUGUUU 



GCTTTTAC T TTCTCGCC 



GGCGAGAA CUGAUGAG X CGAA IUAAAAG~C~ 



AGUUGGCG CUGAUGAG X CGAA IAAAGUAA 



TTACTTTC T CGCCAACT 



TTTCTCGC C AACTTACA 
TTCTCGCC A ACTTACAA 



UGUAAGUU CUGAUGAG X CGAA ICGAGAAA 



UUGUAAGU CUGAUGAG X CGAA IGCGAGAA 



TCGCCAAC T TACAAGGC 



GCCUUGUA CUGAUGAG X CGAA IUUGGCGA 



CAACTTAC A AGGCCTTT 



AAAGGCCU CUGAUGAG X CGAA IUAAGUUG 



1119 
1123 



TACAAGGC C TTTCTAAG 



CUUAGAAA CUGAUGAG X CGAA ICCUUGUA 



ACAAGGCC T TTCTAAGT 



ACUUAGAA CUGAUGAG X CGAA IGCCUUGU 



GGCCTTTC T AAGTAAAC 
AAGTAAAC A GTATGTGA 
ATGT GAAC C TTTACCCC 
TGTGAACC T TTACCCCG 
ACCTTTAC C CC GTTGCT 
CCTTTACC C CGTTGCTC 
CTTTACC C C GTTGCTCG 
CCCGTTGC TCGGCAACG 
TGCTCGGC A ACGGCCTG 
GCAACGGC C TGGTCTAT 
CAACGGCC T GGTCTATG 
GCCTGGTC T ATGCCAAG 
GTCTATGC C AAGTGTTT 
TCTATGCC A AGTGTTTG 
GTGTTTGC T GACGCAAC 
GCTGACGC A ACCCCCAC 
GACG CAAC C CCCACTGG 
ACGCAACC C CCACTGGT 
CGCAACCC C CACTGGTT 
GCAACCCC C ACTGGTTG 
CAACCCCC A CTGGTTGG 
ACCCCCAC T GGTTGGGG 
GTTGGGGC T TGGCCATA 
GGCTTGGC C ATAGGCCA 
GCTTGGC C A TAGGCCAT 
CCATA GGC C ATCAGCGC 
CATAGGCC A TCAGCGCA 
AGGCCATC A G CGCATGC 
ATCAGCG C A TGCGTGGA 
CGTGGAA C C TTTGTGTC 
GTGGAACC T TTGTGTCT 
TTTGTGTC T CCTCTGCC 
TGTGTCTC C TCTGCCGA 
GTGTCTCC T CTGCCGAT 
GTCTCCTC T GCC GATCC 
TCCTCTGC C GATCCATA 
T GCCGATC C ATACCGCG 
GCCGATC C A TACCGCGG 
ATCCATAC C GCGGAACT 
CGCGGAAC T CCTAGCCG 



GUUUACUU CUGAUGAG X CGAA IAAAGGCC 



10Q1 
1002 



UCACAUAC CUGAUGAG X CGAA IUUUACUU 



GGGGUAAA CUGAUGA G X CGAA IUUCACAU 
CGGGGUAA CUGAUGAG X CGAA IGUUCACA 
AGCAACGG CUGAUGAG X CGAA IUAAAGGU 
GAGCAACG CUGAU GAG X CGAA IGUAAAGG 
CG AGCAAC CUGAUGAG X CGAA IGGUAAAG 
CG UUGCCG CUGAUGAG X CGAA I CAACGGG 
CAGGC CGU CUGAUGAG X CGAA ICCGAGCA 
AUAGACCA CUGAUGA G X CGAA ICCGUUGC 
CAUA GACC CUGAUGAG X CGAA IGCCGUUG 
CUU GGCAU CUGAUGAG X CGAA IACCAGGC 
AAACACUU CUGAUGA G X CGAA ICAUAGAC 
CAAACACU CUGAUGAG X CGAA IGCAUAGA 
GUUGCGUC CUGAUGA G X CGAA ICAAACAC 
GUGGGGGU CUGAUGAG X CGAA ICGUCAGC~ 
CCAGUGGG CUGAUGA G X CGAA IUUGCGUC~ 
ACC AGUGG CUGAUGAG X CGAA IGUUGCGU 
AACC AGUG CUGAUGAG X CGAA IGGUUGCG 
CAA CCAGU CUGAUGAG X CGAA IGGGUUGC 
CCAACCAG CUG AUGAG X CGAA IGGGGUUG 
CCCCAACC CUGAUGAG X CGA A IUGGGGGU ~ 
ICCCCAAC 
ICCAAGCC 



UAUGGCCA CUGAUGAG X CGAA 
UGGCCUAU CUGAUGAG X CGAA 



AUGGCCUA CUG AUGAG X CGAA IGCCAAGC 
GCGCUGAU CUGAUGAG X CGAA ICCUAUGG~ 



UGCGCUGA CUGAUGAG X CGAA IGC CUAUG 
GCAUGCGC CUGAUGAG X CGAA ] 



UCCACGCA CUGAUGAG X CGAA 



IAUGGCCU 
ICGCUGAU 



GACACAAA CUGAUGA G X CGAA IUUCCACG 
AGACACAA CUGAUGAG X CGAA IGUUCCAC 
GGCAGAGG CUGAUGAG X CGAA IACACAAA 
UCGGCAGA CUGAUGA G X CGAA IAGACACA 
AUCGGCAG CUGAUGAG X CG AA IGAGACAC 
GGAUCGGC CUGAUGA G X CGAA IAGGAGAC 
UAUGGAUC CUGAU GAG X CGAA ICAGAGGA 
CGCGGUAU CUGA UGAG X CGAA IAUCGGC~A~ 
CCGCGGUA CUGAUGAG X CGAA IGAUCGG~C~ 
AGUUCCGC CUGAUGAG X CGAA IUAUGGAU 
CGGCUAGG CUGAUGAG X CGAA IUUCCGCG 
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521 



CGGAACTC C TAGCCGCT 
GGAACTC C T AGCCGCTT 
CTCCTAGC C GCTTGTTT 
CTAG CCGC T TGTTTTGC 
TGTTTTGC T CGCAGCAG 
TTGCTCGC A GCAGGTCT 
CTCGCAGC A GGTCTGGG 



AGCGGCUA CUGAUGAG X CGA A IAGUUCCG 
AAGCGGCU CUGAUGAG X CGA A IGAGUUCCT" 
AAACAAGC CUGAUGAG X CGA A ICUAGGA~ 
GCAAA ACA CUGAUGAG X CGAA ICGGCUAG 
CUGC UGCG CUGAUGAG X CGAA ICAAAACA 
AGAC CUGC CUGAUGAG X CGAA ICGAGCAA 
CCCAGACC CUGAUGAG X CGAA ICUGCGAG 



AGCAGGTC T GGGGCAAA 



UUUGCCCC CUGAUGAG X CGAA IACCUGCU 



TCTGGGGC A AAACTCAT 



AUGAGUUU CUGAUGAG X CGAA ICCCCAGA 



GGCAAAAC T CATCGGGA 



UCCCGAUG CUGAUGAG X CGAA IUUUUGCC 



CAAAACTC A TCGGGACT 



AGUCCCGA CUGAUGAG X CGAA IAGUUUUG 



ATCGGGAC T GACAATTC 



GAAUUGUC CUGAUGAG X CGAA IUCCCGAU 



GGACTGAC A ATTCTGTC 



GACAGAAU CUGAUGAG X CGAA IUCAGUCC 



AGCACGAC CUGAUGAG X CGAA IAAUUGUC 



: T GTCGTGCT 
: T CTCCCGCA 



UGCGGGAG CUGAUGAG X CGAA ICACGACA 



TCGTGCTC T CCCGCAAA 



UUUGCGGG CUGAUGAG X CGAA IAGCACGA 



UAUUUGCG CUGAUGAG X CGAA IAGAGCAC 



GTGCTCTC C CGCAAATA 



TGCTCTCC C GCAAATAT 



AUAUUUGC CUGAUGAG X CGAA IGAGAGCA 



UGUAUAUU CUGAUGAG X CGAA I CGGGAGA 



TCTCCCGC A AATATACA 
AAATATAC A TCATTTCC 



GGAAAUGA CUGAUGAG X CGAA IUAUAUUU 



CAUGGAAA CUGAUGAG X CGAA IAUGUAUA 



T AT AC AT C A TTTCCATG 



ATCATTTC C ATGGCTGC 



GCAGCCAU CUGAUGAG X CGAA IAAAUGAU 



TCATTTCC A TGGCTGCT 



k CUGAUGAG X CGAA IGAAAUGA 



TCCATGGC T GCTAGGCT 



AGCCUAGC CUGAUGAG X CGAA ICCAUGGA 



CACAGCCU CUGAUGAG X CGAA ICAGCCAU 



ATGGCTGC T AGGCTGTG 
TGCTAGGC T GTGCTGCC 



GGCAGCAC CUGAUGAG X CGAA ICCUAGCA 



GGCTGTGC T GCCAACTG 



CAGUUGGC CUGAUGAG X CGAA ICACAGCC 



TGTGCTGC C AACTGGAT 



AUCCAGUU CUGAUGAG X CGAA ICAGCACA 



GTGCTGCC A ACTGGATC 



GAUCCAGU CUGAUGAG X CGAA IGCAGCAC 



UAGGAUCC CUGAUGAG X CGAA IUUGGCAG 



CTGCCAAC T GGATCCTA 



CCCGCGUA CUGAUGAG X CGAA IAUCCAGU 



ACTGGATC C TACGCGGG 



CTGGATCC T ACGCGGGA 
GGGACGTC C TTTGTTTA 



UCCCGCGU CUGAUGAG X CGAA IGAUCCAG 



UAAACAAA CUGAUGAG X CGAA IACGUCCC 



GUAAACAA CUGAUGAG X CGAA IGACGUCC 



GGACGTCC T TTGTTTAC 



CGCCGACG CUGAUGAG X CGAA IACGUAAA 



TTTACGTC C CGTCGGCG 



GCGCCGAC CUGAUGAG X CGAA IGACGUAA 



TTACGTCC C GTCGGCGC 



CGGGAUUC CUGAUGAG X CGAA ICGCCGAC 



GTCGGCGC T GAATCCCG 



GCTGAATC C CGCGGACG 



CGUCCGCG CUGAUGAG X CGAA IAUUCAGC 



CTGAATCC C GCGGACGA 
CGGACGAC C CCTCCCGG 



1042 
1043 



UCGUCCGC CUGAUGAG X CGAA IGAUUCAG 



CCGGGAGG CUGAUGAG X CGAA IUCGUCCG 



GGACGA CC C CTCCCGGG 
GACGAC CC C TCCCGGGG 
ACGACCCC T CCCGGGGC 



CCCGGGAG CUGAUGAG X CGAA IGUCGUCC~ 



CCCCGGGA CUGAUGAG X CGAA IGGUCGUC 



GCCCCGGG CUGAUGAG X CGAA IGGGUCGU 



GACCCCTC C CGGGGCCG 



CGGCCCCG CUGAUGAG X CGAA IAGGGGUC 



ACCCCTCC C GGGGCCGC 



GCGGCCCC CUGAUGAG X CGAA IGAGGGGU 



CCCGGGGC C GCTTGGGG 



CCCCAAGC CUGAUGAG X CGAA ICCCCGGG 



GGGGCCGC T TGGGGCTC 



CTTGGGGC T CTACCGCC 
TGGGGCTC T ACCGCCCG 



GAGCCCCA 
GGCGGUAG 



CUGAUGAG X CGAA ICGGCCCC 



CUGAUGAG X CGAA ICCCCAAG 



CGGGCGGU 



CUGAUGAG X CGAA IAGCCCCA 



GGCTCTAC C GCCCGCTT 
TCTACCGC C CGCTTCTC 



AAGCGGGC CUGAUGAG X CGAA IUAGAGCC 



GAGAAGCG CUGAUGAG X CGAA I CGGUAGA 
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CTACCG CC C GCTTCTCC 
CCGCCCGC T TCTCCGCC 
CCCGCTTC T CCGCCTAT 
CGCTTCTC C GCCTATTG 
TTCTCCGC C TATTGTAC 
TCTCCGCC T ATTGTACC 
TATTGTAC C GACCGTCC 
GTACCGAC C GTCCACGG 
CGACCGTC C ACGGGGCG 
GACCGTCC A CGGGGCGC 
CGGGGCGC A CCTCTCTT 
GGGCGCAC C TCTCTTTA 
GGCGCAC C T CTCTTTAC 
CGCACCTC T CTTTACGC 
CACCTCTC T TTACGCGG 
ACGCGGAC T~CCCCGTCT 
GCGGACTC C CCGTCTGT 
CG GACTCC C CGTCTGTG 
GGACTCCC C GTCTGTGC 
TCCCCGTC T GTGCCTTC 
GTCTGTGC C TTCTCATC 
TCTGTGCC T TCTCATCT 
GTGCCTTC T CATCTGCC 
GCCTTCTC A TCTGCCGG 
TTCTCATC T GCCGGACC 
TCATCTGC C GGACCGTG 
TGCCGGAC C GTGTGCAC 
CCGTGTGC A CTTCGCTT 
GTGT GCAC T TCGCTTCA 
CACTTCGC T TCACCTCT 
TTCGCTTC A CCTCTGCA 
CGCTTCAC C TCTGCACG 
GCTTCACC T CTGCACGT 
TTCACCTC T GCACGTCG 
ACCTCTG C A CGTCGCAT 
CACGTCGC A TGGAGACC 
ATGGAGAC C ACCGTGAA 
TGGAGACC A CCGTGAAC 
GAGACCAC C GTGAACGC 
GTGAACGC C CACAGGAA 
TGAACGCC C ACAGGAAC 
GAACGCCC A CAGGAACC 
ACGCCCAC A GGAACCTG 
ACAGGAAC C TGCCCAAG 
CAGGAACC T GCCCAAGG 
GAACCTGC C CAAGGTCT 
AACCTGC C C AAGGTCTT 
ACCTGCCC A AGGTCTTG 
CCAAGGTC T TGCATAAG 
GGTCTTGC A TAAGAGGA 
AAGAGGAC T CTTGGACT 



GGAGAAGC CUGAUGAG X CGAA IGCGGUAG 
GGCGGAGA CUGAUGAG X CGAA ICGGGCGG 
AUAGGCGG CUGAUGAG X CGAA IAAGCGGG 
CAAUAGGC CUGAUGAG X CGAA IAGAAGCG 
GUACAAUA CU GAUGAG X CGAA ICGGAGAA 
GGUACAAU CUGAUGAG X CGAA IGCGGAGA 
GGACGGUC CUGAUGAG X CGAA IUACAAUA 
CCGUGGAC CUGAUGAG X CGAA IUCGGUAC 
CGCCCCGU CUGAUGAG X CGAA IACGGUCG 
GCGCCCCG CUGAUGAG X CGAA IGACGGUC 
AAGAGAGG CUGAUGAG X CGAA ICGCCCCG 
UAAAGAGA CUGAUGAG X CGAA IUGCGCCC 
GUAAAGAG' CUG AUGAG X CGAA IGUGCGCC 
GCGUAAAG CUGAUGAG X CGAA IAGGUGCG 
CCGCGUAA CUGAUGAG X CGAA IAGAGGUG 
AGACGGGG CUGAUGAG X CGAA IUCCGCGU 
ACAGACGG CUGAUGAG X CGAA IAGUCCGC 
CACAGACG CUGAUGAG X CGAA IGAGUCCG 
GCACAGAC CU GAUGAG X CGAA IGGAGUCC 
GAAGGCAC CUGAUGAG X CGAA IACGGGGA 
GAUGAGAA CUGAUGAG X CGAA ICACAGAC 
AGAUGAGA CUGAUGAG X CGAA I GCACAGA 
GGCAGAUG CUGAUGAG X CGAA IAAGGCAC 
CCGGCAGA CUGAUGAG X CGAA IAGAAGGC 
GGUCCGGC CUGA UGAG X CGAA IAUGAGAA 
CACGGUCC CUGAUGAG X CGAA I CAGAUGA 
GUGCACAC CUGAUGAG X CGAA IUCCGGCA 
AAGCGAAG CUGAUGAG X CGAA ICACACGG 
UGAAGCGA CUGAUGAG X CGAA IUGCACAC 
AGAGGUGA CUGAUG AG X CGAA ICGAAGUG 
UGCAGAGG C UGAUGAG X CGAA IAAGCGAA 
CGUGCAGA CUGAUGAG X CGAA IUGAAGCG 
ACGUGCAG CUGAUGAG X CGAA IGUGAAGC 
CGACGUGC CUGAUGAG X CGAA IAGGUGAA 
A UGCGACG CUGAUGAG X CGAA ICAGAGGU 
GGUCUCCA CUGAUGAG X C GAA ICGACGUG 
UUCACGGU CUGAUGAG X CGAA IUCUCCAU 
GUUCACGG CUGAUGAG X CGAA IGUCUCCA 
GCGUUCAC CUGAUGAG X CGAA IUGGUCUC 
UUCCUGUG CUGAUGAG X CGAA ICGUUCAC 
GUUCCUGU CUGAUGAG X CGAA IGCGUUCA 
GGUUC CUG CUGAUGAG X CGAA IGGCGUUC ~ 
CAGGUUCC CUGAUGAG X CGAA IUGGGCGU 
CUUGGGCA CUGAUGAG X CGAA IUUCCUGU 
CCUUGGGC CUGAUGAG X CGAA IGUUCCUG 
AGACCUUG CUGAUGAG X CGAA ICAGGUUC 
AAGACCUU CUGAUGAG X CGAA IGCAGGUU 
CAAGACCU CUGAUGAG X CGAA IGGCAGGU ~ 

X CGAA IACCUUGG 
UCCUC UUA CUGAUGAG X CGAA ICAAGACC 
AGUCCAAG CUGAUGAG X CGAA IUCCUCUU 
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Table 38 



1668 


GAGGACTC T TGGACTTT ■ 


1106 


AAAGUCCA CUGAUGAG X CGAA IAGUCCUC 


3615 


1674 


TCTTGGAC T TTCAGCAA 


1107 


UUGCUGAA CUGAUGAG X CGAA IUCCAAGA 


3616 


1678 


GGACTTTC A GCAATGTC 


1108 


GACAUUGC CUGAUGAG X CGAA IAAAGUCC 




1681 


CTTTCAGC A ATGTCAAC 


1109 


GUUGACAU CUGAUGAG X CGAA ICUGAAAG 




1687 


GCAATGTC A ACGACCGA 


1110 


UCGGUCGU CUGAUGAG X CGAA IACAUUGC 


3619 


1693 


TCAACGAC C GACCTTGA 


1111 


UCAAGGUC CUGAUGAG X CGAA IUCGUUGA 


3620 


1697 


CGACCGAC C TTGAGGCA 


1112 


UGCCUCAA CUGAUGAG X CGAA IUCGGUCG 


3621 


1698 


GACCGACC T TGAGGCAT 


1113 


AUGCCUCA CUGAUGAG X CGAA IGUCGGUC 


3622 


1705 


CTTGAGGC A TACTTCAA 


1114 


UUGAAGUA CUGAUGAG X CGAA ICCUCAAG 


3623 


1709 


AGGCATAC T TCAAAGAC 


1115 


GUCUUUGA CUGAUGAG X CGAA IUAUGCCU 


3S24 


1712 


CATACTTC A AAGACTGT 


1116 


ACAGUCUU CUGAUGAG X CGAA IAAGUAUG 


3525 


1718 


TCAAAGAC T GTGTGTTT 


1117 


AAACACAC CUGAUGAG X CGAA IUCUUUGA 


3626 


1769 


TAAAGGTC T TTGTACTA 


1118 


UAGUACAA CUGAUGAG X CGAA IACCUUUA 


3627 


1776 


CTTTGTAC T AGGAGGCT 


1119 


AGCCUCCU CUGAUGAG X CGAA IUACAAAG 




1784 


TAGGAGGC T GTAGGCAT 


1120 


AUGCCUAC CUGAUGAG X CGAA ICCUCCUA 


36 . 29 


1791 


CTGTAGGC A TAAATTGG 


1121 


CCAAUUUA CUGAUGAG X CGAA ICCUACAG 


3630 


1807 


GTGTGTTC A CCAGCACC 


1122 


GGUGCUGG CUGAUGAG X CGAA IAACACAC 


3631 


1809 


GTGTTCAC C AGCACCAT 


1123 


AUGGUGCU CUGAUGAG X CGAA IUGAACAC 


3632 


1810 


TGTTCACC A GCACCATG 


1124 


CAUGGUGC CUGAUGAG X CGAA IGUGAACA 


3633 


1813 


TCACCAGC A CCATGCAA 


1125 


UUGCAUGG CUGAUGAG X CGAA ICUGGUGA 


3634 


1815 


ACCAGCAC C ATGCAACT 


1126 


AGUUGCAU CUGAUGAG X CGAA IUGCUGGU 


3635 


1816 


CCAGCACC A TGCAACTT 


1127 


AAGUUGCA CUGAUGAG X CGAA IGUGCUGG 


... 3636 


1820 


CACCATGC A ACTTTTTC 


1128 


GAAAAAGU CUGAUGAG X CGAA ICAUGGUG 


3637 


1823 


CATGCAAC T TTTTCACC 


1129 


GGUGAAAA CUGAUGAG X CGAA IUUGCAUG 


3638 


1829 


ACTTTTTC A CCTCTGCC 


1130 


GGCAGAGG CUGAUGAG X CGAA IAAAAAGU 


3639 


1831 


TTTTTCAC C TCTGCCTA 


1131 


UAGGCAGA CUGAUGAG X CGAA IUGAAAAA 


3640 


1832 


TTTTCACC T CTGCCTAA 


1132 


UUAGGCAG CUGAUGAG X CGAA IGUGAAAA 


3641 


1834 


TTCACCTC T GCCTAATC 


1133 


GAUUAGGC CUGAUGAG X CGAA IAGGUGAA 


3642 


1837 


ACCTCTGC C TAATCATC 


1134 


GAUGAUUA CUGAUGAG X CGAA I CAGAGGU 


3643 


1838 


CCTCTGCC T AATCATCT 


1135 


AGAUGAUU CUGAUGAG X CGAA IGCAGAGG 


3644 


1843 


GCCTAATC A TCTCATGT 


1136 


ACAUGAGA CUGAUGAG X CGAA IAUUAGGC 


3645 


1846 


TAATCATC T CATGTTCA 


1137 


UGAACAUG CUGAUGAG X CGAA IAUGAUUA 


3645 


1848 


ATCATCTC A TGTTCATG 


1138 


CAUGAACA CUGAUGAG X CGAA IAGAUGAU 


3647 


1854 


TCATGTTC A TGTCCTAC 


1139 


GUAGGACA CUGAUGAG X CGAA IAACAUGA 


3648 


1859 


TTCATGTC C TACTGTTC 


1140 


GAACAGUA CUGAUGAG X CGAA IACAUGAA 


3649 


1860 


TCATGTCC T ACTGTTCA 


1141 


UGAACAGU CUGAUGAG X CGAA IGACAUGA 


., 365 -° 


1863 


TGTCCTAC T GTTCAAGC 


1142 


GCUUGAAC CUGAUGAG X CGAA IUAGGACA 


. 3651 


1868 


TACTGTTC A AGCCTCCA 


1143 


UGGAGGCU CUGAUGAG X CGAA IAACAGUA 


3652 


1872 


GTTCAAGC C TCCAAGCT 


1144 


AGCUUGGA CUGAUGAG X CGAA ICUUGAAC 


3653 


1873 


TTCAAGCC T CCAAGCTG 


1145 


CAGCUUGG CUGAUGAG X CGAA IGCUUGAA 


3654 


1875 


CAAGCCTC C AAGCTGTG 


1146 


CACAGCUU CUGAUGAG X CGAA IAGGCUUG 


3655 


1876 


AAGCCTCC A AGCTGTGC 


1147 


GCACAGCU CUGAUGAG X CGAA IGAGGCUU 


3656 


1880 


CTCCAAGC T GTGCCTTG 


1148 


CAAGGCAC CUGAUGAG X CGAA ICUUGGAG 






AGCTGTGC C TTGGGTGG 




CCACCCAA CUGAUGAG X CGAA ICACAGCU 


3658 


1886 


GCTGTGCC T TGGGTGGC 


1150 


GCCACCCA CUGAUGAG X CGAA IGCACAGC 


3659 


1895 


TGGGTGGC T TTGGGGCA 


1151 


UGCCCCAA CUGAUGAG X CGAA ICCACCCA 


3660 


1903 


TTTGGGGC A TGGACATT 


1152 


AAUGUCCA CUGAUGAG X CGAA ICCCCAAA 


3661 


1909 


GCATGGAC A TTGACCCG 


1153 


CGGGUCAA CUGAUGAG X CGAA IUCCAUGC 


3662 


1915 


ACATTGAC C CGTATAAA 


1154 


UUUAUACG CUGAUGAG X CGAA IUCAAUGU 


3663 


1916 


CATTGACC C GTATAAAG 


1155 


CUUUAUAC CUGAUGAG X CGAA IGUCAAUG 


3664 


1935 


TTTGGAGC T TCTGTGGA 


1156 


UCCACAGA CUGAUGAG X CGAA ICUCCAAA 


3665 
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1953 
1961 



2072 
2076 



GGAGCTTC T G TGGAGTT 
GGAGTTAC T CTCTTTTT 
AGTTACTC T CTTTTTTG 
TTACTCTC T TTTTTGCC 
TTTTTTGC C TTCTGACT 
TTTTTGCC T TCTGACTT 
TTGCCTTC T GACTTCTT 
CTTCTGA C T TCTTTCCT 
CTGACTTC T TTCCTTCT 
CTTCTTTC C TTCTATTC 
TTCTTTCC T TCTATTCG 
TTTCCTTC T ATTCGAGA 
TCGAGAT C T CCTCGACA 
GAGATCTC C TCGACACC 
AGATCTCC T CGACACCG 
TCCTCGA C A CCGCCTCT 
CTCGACAC C GCCTCTGC 
GACACCG C C TCTGCTCT 
ACACCGCC T CTGCTCTG 
ACCGCCTC T GCTCTGTA 
GCCTCTGC T CTGTATCG 
CTCTGCTC T GTATCGGG 
CGGGGGGC C TTAGAGTC 
GGGGGGCC T TAGAGTCT 
TTAGAGTC T CCGGAACA 
AGAGTCTC C GGAACATT 
TCCGGAAC A TTGTTCAC 
CATTGTTC A CCTCACCA 
TTGTTCAC C TCACCATA 
TGTTCACC T CACCATAC 
TTCACCTC A CCATACGG 
CACCTCAC C ATACGGCA 
ACCTCACC A TACGGCAC 
CATACGGC A CTCAGGCA 
TACGGCAC T CAGGCAAG 
CGGCACTC A GGCAAGCT 
ACTCAG GC A AGCTATTC 
AGGCAAGC T ATTCTGTG 
AGCTATTC T GTGTTGGG 
GATGAATC T AGCCACCT 



1162 
1163 



1169 
1170 



1173 
1174 



AACUCCAC CUGA UGAG X CGAA IAAGCUCC 
AAAAAGAG CUGA UGAG X CGAA IUAACUCC 
CAAAAAAG CUGAUGAG X CGAA IAGUAACU 
GGCAAAAA CUGA UGAG X CGAA IAGAGUAA 
AGU CAGAA CUGAUGAG X CGAA ICAAAAAA 
AAGUCAGA CUGAUGAG X CGAA IGCAAAAA 
AAGAAGUC CUGA UGAG X CGAA IAAGGCAA 
AGGAAAGA CUGAUGAG X CGAA IUCAGAAG 
AGAAG GAA CUGAUGAG X CGAA IAAGUCAG 
GAAUAGAA CUGAUGAG X CGAA IAAAGAAG 
CGAAUAGA CUGA UGAG X CGAA IGAAAGAA 
UCUCGAAU CUGAUG AG X CGAA IAAGGAAA 
UGU CGAGG CUGAUGAG X CGAA IAUCUCGA 
GGUGUCGA CUGAUGAG X CGAA IAGAUCUC~" 
CGGUGUCG CUGA UGAG X CGAA IGAGAUCU~ 
AGAGGCGG CUGAU GAG X CGAA I UCGAGGA 
GCAGAGGC CUGAUGAG X CGAA IUGUCGAG~ 
AGAGCAG A CUGAUGAG X CGAA ICGGUGUC 
C AGAGCAG CUGAUGAG X CGAA IGCGGUGU 
UACA GAGC CUGAUGAG X CGAA IAGGCGGU 
CGA UACAG CUGAUGAG X CGAA I CAGAGGC 
CCCGAU AC CUGAUGAG X CGAA IAGCAGAG 
GACUCUAA CUGAUGAG X CGAA ICCCCCCG~ 
AGACUCUA CUGAUGA G X CGAA IGCCCCCC~ 
UGUUCCGG CUGAUGAG X CGAA IACUCUAA~ 
AAUGUUCC CUGAU GAG X CGAA IAGACUCU ~ 
GUGAACAA CUGAUGAG X CGAA IUUCCGGA 



5 CUGAUGAG X 
i CUGAUGAG X 



CGAA IAACAAUG 
CGAA IUGAACAA 



GUAUGGUG CUGAU GAG X CGAA IGUGAACA 
CC GUAUGG CUGAUGAG X CGAA IAGGUGAA 
UGCCG UAU CUGAUGAG X CGAA IUGAGGUG 
GUGCCGUA CUGAUGAG X CGAA IGUGAGGU~~ 
UGCC UGAG CUGAUGAG X CGAA ICCGUAUG 
CUUGCCUG CUGAUGAG X CGAA IUGCCGU~ 
AGCUUGCC CUGAUGAG X CGAA IAGUGCCG~ 
GAAUAGCU CUGAUGAG X CGAA ICCUGAGU 
CACA GAAU CUGAUGAG X CGAA ICUUGCCU 
CCCAACAC CUGAUGAG X CGAA IAAUAGCU 
AGGUGGCU CUGAUGAG X CGAA IAUUCAUC 



AATCTAGC C ACCTGGGT 



T CUGAUGAG X CGAA ICUAGAUU 



ATCTAGCC A CCTGGGTG 



CACCCAGG CUGAUGAG X CGAA IGCUAGAU~ 



CTAGCCAC C TGGGTGGG 
TAGCCACC T GGGTGGGA 



CCCACCCA CUGAUGAG X CGAA IUGGCUAG~ 



: CUGAUGAG X CGAA IGUGGCUA 



GGAAGATC C AGCATCCA 
GAAGATCC A GCATCCAG 



UGGAUGCU CUGAUGAG X CGAA IAUCUUCCT" 



: CUGAUGAG X CGAA IGAUCUUC 



GATCCAGC A TCCAGGGA 



CCAGCATC C AGGGAATT 



UCCCUGGA 
AAUUCCCU 



< CGAA ICUGGAUC 



CUGAUGAG X CGAA IAUGCUGG 



CAGCATCC A GGGAATTA 
TAGTAGTC A GCTATGTC 



: CUGAUGAG X CGAA IGAUGCUG 



TAGTCAGC T ATGTCAAC 



GACAUAGC 
GUUGACAU 



CUGAUGAG X CGAA IACUACUA 



CUGAUGAG X CGAA ICUGACUA 
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2330 
2333 



Table 38 



GCTATGTC A ACGTTAAT 
ATATGGGC C TAAAAATC 



TATGGGCC T AAAAATCA 



TAAAAATC A GACAACTA 



AATCAGAC A ACTATTGT 



CAGACAAC T ATTGTGGT 



GTGGTTTC A CATTTCCT 



GGTTTCAC A TTTCCTGT 



CACATTTC C TGTCTTAC 



ACATTTCC T GTCTTACT 



TTCCTGTC T TACTTTTG 



TGTCTTAC T TTTGGGCG 



CGAGAAAC T GTTCTTGA 



AACTGTTC T TGAATATT 



TTGGTGTC T TTTGGAGT 



GGATTCGC A CTCCTCCT" 



ATTCGCAC T CCTCCTGC 



TCGCACTC C TCCTGCAT 



CGCACTCC T CCTGCATA 



CACTCCTC C TGCATATA 



ACTCCTCC T GCATATAG 



CCTCCTGC A TATAGACC 



ATATAGAC C ACCAAATG 



TATAGACC A CCAAATGC 



TAGACCAC C AAATGCCc" 



AUUAACGU CUGAUGAG X CGAA IACAUAGC~ 



GAUUUUUA CUGAUGAG X CGAA ICCCAUAU 



UGAUUUUU CUGAUGAG X CGAA IGCCCAUA~ 



UAGUUGUC CUGAUGAG X CGAA IAUUUUUA 



ACAAUAGU CUGAUGAG X CGAA IUCUGAUU 



ACCACAAU CUGAUGAG X CGAA IUUGUCUG 
AGGAAAUG CUGAUGAG X CGAA IAAACCAC~ 



ACAGGAAA CUGAUGAG X CGAA IUGAAACC~ 



GUAAGACA CUGAUGAG X CGAA IAAAUGUG 



AGUAAGAC CUGAUGAG X CGAA IGAAAUGTj 



CAAAAGUA CUGAUGAG X CGAA IACAGGAA ~ 



CGCCCAAA CUGAUGAG X CGAA IUAAGACA 



UCAAGAAC CUGAUGAG X CGAA IUUUCUCG 



AAUAUUC A CUGAUGAG X CGAA IAACAGUU 
ACUCCAAA CUGAUGAG X CGAA IACACCAA 



AGGAGGAG CUGAUGAG X CGAA ICGAAUCCT" 



GCAGGAGG CUGAUGAG X CGAA IUGCGAAU 



AUGCAGGA CUGAUGAG X CGAA IAGUGCGA 



UAUGCAGG CUGAUGAG X CGAA IGAGUGCG~ 



UAUAUGCA CUGAUGAG X CGAA IAGGAGUG 



AGACCACC A AATGCCCC 



CCAAATGC C CCTATCTT 



CAAATGCC C CTATCTTA 



AAATGCCC C TATCTTAT 



AATGCCCC T ATCTTATC 



CCCCTATC T TATCAACA 



ATCTTATC A ACACTTCC 



TTATCAAC A CTTCCGGA 



ATCAACAC T TCCGGAAA 



AACACTTC C GGAAACTA 



CCGGAAAC T ACTGTTGT 



GAAACTAC T GTTGTTAG 



GAAGAGGC A GGTCCCCT 



GGCAGGTC C CCTAGAAG 



GCAGGTCC C CTAGAAGA 



CAGGTCCC C TAGAAGAA 



AGGTCCCC T AGAAGAAG 



AGAAGAAC T CCCTCGCC 



AAGAACTC C CTCGCCTC 



AGAACTCC C TCGCCTCG 



GAACTCCC T CGCCTCGC 



TCCCTCGC C TCGCAGAC 



CCCTCGCC T CGCAGACG 



CGCCTCGC A GACGAAGG 



CGAAGGTC T CAATCGCC 



CUAUAUGC CUGAUGAG X CGAA I GAGGAGU 



GGUCUAUA CUGAUGAG X CGAA ICAGGAGG 



CAUUUGGU CUGAUGAG X CGAA IUCUAUAU 



GCAUUUGG CUGAUGAG X CGAA IGUCUAU~ 



GGGCAUUU CUGAUGAG X CGAA IUGGUCUA~ 



GGGGCAUU CUGAUGAG X CGAA IGUGGUCU~ 



AAGAUAGG CUGAUGAG X CGAA ICAUUUGG 



UAAGAUAG CUGAUGAG X CGAA IGCAUUUG 



AUAAGAUA CUGAUGAG X CGAA IGGCAUUU 



GAUAAGAU CUGAUGAG X CGAA IGGGCAUU 



UGUUGAUA CUGAUGAG X CGAA I AUAGGGG 



GGAAGUGU CUGAUGAG X CGAA IAUAAGAU 



UCCGGAAG CUGAUGAG X CGAA IUUGAUAA 



UUUCCGGA CUGAUGAG X CGAA IUGUUGAU 



UAGUUUCC CUGAUGAG X CGAA IAAGUGUU~ 



ACAACAGU CUGAUGAG X CGAA IUUUCCGG~ 



CUAACAAC CUGAUGAG X CGAA IUAGUUUC ~ 



AGGGGACC CUGAUGAG X CGAA ICCUCUUC 



CUUCUAGG CU GAUGAG X CGAA IACCUGCC 
UCUUCUAG CUGAUGAG X CGAA IGACCUGC 



UUCUUCUA CUGAUGAG X CGAA IGGACCUG 



CUUCUUCU CUGAUGAG X CGAA IGGGACCU 



GGCGAGGG CUGAUGAG X CGAA IUUCUUCU~ 



AAGGTCTC A ATCGCCGC | 1258 



GAGGCG AG CUGAUGAG X CGAA IAGUUCUU 
CGAGGCGA CUGAUGAG X CGAA IGAGUUCU~ 



GCGAGGCG CUGAUGAG X CGAA IGGAGUUC 



GUCUGCGA CUGAUGAG X CGAA ICGAGGGA 



CGUCUGCG CUGAUGAG X CGAA IGCGAGGG~ 



CCUUCGUC CUGAUGAG X CGAA I CGAGGCG 



GGCGAUUG CUGAUGAG X CGAA IACCUUCG~ 



GCGGCGAU CUGAUGAG X CGAA IAGACCUU~ 
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TCAATCGC C GCGTCGCA 
CGCGTCGC A GAAGATCT 



UGCGACGC CUGAUGAG X CGA A ICGAUUGA 
AGAUCUUC CUGAUGAG X CGAA ICGACGCG 



CGAGAUUG CUGAUGAG X CGAA IAUCUUCU 



AGAAGATC T CAATCTCG 



AAGATCT C A ATCTCGGG 
TCTCAATC T CGGGAATC 
CGGGAATC T CAATGTTA 
GGAATCT C A ATGTTAGT 
TAGTATTC C TTGGACAC 
AGTATTCC T TGGACACA 
CCTTGGAC A CATAAGGT 
TTGGA CAC A TAAGGTGG 
TGGGAAAC T TTACGGGG 
TACGGGGC T TTATTCTT 
CTTTATTC T TCTACGGT 
TATTCTTC T ACGGTACC 
TACGGTAC C TTGCTTTA 
ACGGTACC T TGCTTTAA 
TACCTTGC T TTAATCCT 
CTTTAATC C TAAATGGC 
TTTAATCC T AAATGGCA 
TAAATGGC A AACTCCTT 
TGGCAAAC T C CTTCTTT 
GCAAACTC C TTCTTTTC 
CAAACTCC T TCTTTTCC 
ACTCCTTC T TTTCCTGA 
TTCTTTTC C TGACATTC 
TCTTTTCC T GA CATTCA 
TTCCTGAC A TTCATTTG 
TGACATTC A TTTGCAGG 
TCATTTGC A GGAGGACA 
AGGAGGAC A TTGTTGAT 
ATGTAAGC A ATTTGTGG 
TGTGGGGC C CCTTACAG 
GTGGGGCC C CTTACAGT 
TGGGGCCC C TTACAGTA 
GGGGCCCC T T ACAGTAA 
CCCCTTAC A GTAAATGA 
ATGAAAAC A GGAGACTT 
CAGGAGAC T TAAATTAA 
AAATTAAC T ATGCCTGC 
AACTATGC C TGCTAGGT 
ACTATGCC T GCTAGGTT 
ATGCCTGC T AGGTTTTA 
GTTTTATC C CAATGTTA 
TTTTATCC C AATGTTAC 
TTTATCCC A ATGTTACT 
AATGTTAC T AAATATTT 
ATATTTGC C CTTAGATA 
TATTTGCC C TTAGATAA 
ATTTGCCC T TAGATAAA 
AAGGGATC A AACCGTAT 



1305 
1306 



CCCGAGAU CUGAUG AG X CGAA IAGAUCUU 
GAUUCCCG CUGAUGAG X CGAA IAUUGAGA 
UAACAUUG CUGA UGAG X CGAA IAUUCCCG 
ACUAACAU CUGAUGAG X CGAA IAGAUUCC 
GUGUCCAA CUGAUGA G X CGAA IAAUACUA 
UGUGUCCA CUGAUGAG X CGAA IGAAUACU 
ACCUUAUG CUGAUG AG X CGAA IUCCAAGG 
CCACCUUA CUGAU GAG X CGAA IUGUCCAA 
CCCCGUAA CUGAUGAG X CGAA IUUUCCCA 
AAGAAUAA CUGAUG AG X CGAA ICCCCGUA 
ACCGUAGA CUGAU GAG X CGAA IAAUAAAG 
GGUACCGU CUGAUGAG X CG AA IAAGAAUA 
UAAAGCAA CUGAUGAG X CGAA IUACCGUA 
UUAAAGCA CUGAUG AG X CGAA IGUACCGU 
AGGAUUAA CU GAUGAG X CGAA ICAAGGUA~~ 
GCCAUUUA CU GAUGAG X CGAA IAUUAAAG 
UGCCAUUU CUGA UGAG X CGAA IGAUUAAA 
AAGGAGUU CUGAUG AG X CGAA ICCAUUUA 
AAAGAAGG CUGAUG AG X CGAA IUUUGCCA 
GAAAAGAA CUGAUGAG X CGAA IAGUUUGC 
GGAAAAGA CUGA UGAG X CGAA IGAGUUUG 
UCAGGAAA CU GAUGAG X CGAA IAAGGAGU 
GAAUGUCA CUGAUG AG X CGAA IAAAAGAA 
UGAAUGUC CUGAUG AG X CGAA IGAAAAGA 
CAAAUGAA CUGAUGAG X CGAA I UCAGGAA 
CCUGCAAA CUGAUGAG X CGAA IAAUGUCA 
UGUCCUCC CUGAUGAG X CGAA I CAAAUGA 
AUCAACAA CUGA UGAG X CGAA IUCCUCCU 
CCACAAAU CUGAUGA G X CGAA ICUUACAU 
CUGUAAGG CUGAUGAG X CGAA ICCCCACA~ 
ACUGUAAG CUGAUGAG X CGA A IGCCCCAC~ 
UACUGUAA CUGAUG AG X CGAA IGGCCCCA~ 
UUACUGUA CUGAUGA G X CGAA IGGGCCCC~ 
UCAUUUAC CUGAUGA G X CGAA IUAAGGGG 
AA GUCUCC CUGAUGAG X CGAA IUUUUCAU 
UUAAUU UA CUGAUGAG X CGAA IUCUCCUG 
GCAGGCA U CUGAUGAG X CGAA IUUAAUUU 
ACCUAGCA CUGAUGAG X CGAA ICAUAGUU ~ 
AACCUAGC CUGAUGAG X CG AA IGCAUAGU~ 
UAAAACCU CUGAU GAG X CGAA ICAGGCAU~ 
UAACAUUG CUGAUGAG X C GAA IAUAAAAC 
GUAACAUU CUGAUGAG X CGAA IGAUAAAA 
AGUAACAU CUGAUGA G X CGAA IGGAUAAA 
AAAUAUUU CUGAUGAG X CGAA IUAACAUU 
UAUCUAAG CUGAUGAG X CGAA ICAAAUAU~ 
UUAUCUAA CUGAUGAG X CG AA IGCAAAUA 
UUUAUCUA CUGAUGAG X CGAA IGGCAAAU 
AUACGGUU CUGAUGAG X CGAA IAUCCCUU~ 
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GATCAAAC C GTATTATC 



GAUAAUAC CUGAUGAG X CGAA IUUUGAUC 



GTATTATC C AGAGTATG 



CAUACUCU CUGAUGAG X CGAA IAUAAUAC 



TATTATCC A GAGTATGT 



ACAUACUC CUGAUGAG X CGAA IGAUAAUA 



AGTTAATC A TTACTTCC 



GGAAGUAA CUGAUGAG X CGAA IAUUAACU 



ATCATTAC T TCCAGACG 



CGUCUGGA CUGAUGAG X CGAA IUAAUGAU 



ATTACTTC C AGACGCGA 



UCGCGUCU CUGAUGAG X CGAA IAAGUAAU 



2756 
2758 



TTACTTCC A GACGCGAC 



GACGCGAC A TTATTTAC 



TTATTTAC A CACTCTTT 



ATTTACAC A CTCTTTGG 



GUCGCGUC CUGAUGAG X CGAA IGAAGUAA 



GUAAAUAA CUGAUGAG X CGAA IUCGCGUC 



AAAGAGUG CUGAUGAG X CGAA IUAAAUAA 



CCAAAGAG CUGAUGAG X CGAA IUGUAAAU 



TTACACAC T CTTTGGAA 



UUCCAAAG CUGAUGAG X CGAA IUGUGUAA 



ACACACTC T TTGGAAGG 



CCUUCCAA CUGAUGAG X CGAA IAGUGUGU 



CGGGGATC T TATATAAA 



UUUAUAUA CUGAUGAG X CGAA IAUCCCCG 



AGAGAGTC C ACACGTAG 



CUACGUGU CUGAUGAG X CGAA IACUCUCU 



GAGAGTCC A CACGTAGC 



GCUACGUG CUGAUGAG X CGAA IGACUCUC 



GAGTCCAC A CGTAGCGC 
CGTAGCGC C TCATTTTG 



GCGCUACG CUGAUGAG X CGAA IUGGACUC 



CAAAAUGA CUGAUGAG X CGAA ICGCUACG 



GTAGCGCC T CATTTTGC 



GCAAAAUG CUGAUGAG X CGAA IGCGCUAC 



AGCGCCTC A TTTTGCGG 



CCGCAAAA CUGAUGAG X CGAA IAGGCGCU 



TGCGGGTC A CCATATTC 



GAAUAUGG CUGAUGAG X CGAA IACCCGCA 



CGGGTCAC C ATATTCTT 



AAGAAUAU CUGAUGAG X CGAA IUGACCCG 



GGGTCACC A TATTCTTG 



CAAGAAUA CUGAUGAG X CGAA IGUGACCC 



CCATATTC T TGGGAACA 



UGUUCCCA CUGAUGAG X CGAA IAAUAUGG 



TTGGGAAC A AGATCTAC 



GUAGAUCU CUGAUGAG X CGAA IUUCCCAA 



ACAAGATC T ACAGCATG 



CAUGCUGU CUGAUGAG X CGAA IAUCUUGU 



AGATCTAC A GCATGGGA 



UCCCAUGC CUGAUGAG X CGAA IUAGAUCU 



TCTACAGC. A TGGGAGGT 



ACCUCCCA CUGAUGAG X CGAA ICUGUAGA 



GGTTGGTC T TCCAAACC 



GGUUUGGA CUGAUGAG X CGAA IACCAACC 



TGGTCTTC C 
GGTCTTCC I 



CGAGGUUU CUGAUGAG X CGAA IAAGACCA 



UCGAGGUU CUGAUGAG X CGAA IGAAGACC 



2896 
2900 



TTCCAAAC C TCGAAAAG 



CUUUUCGA CUGAUGAG X CGAA IUUUGGAA 



TCCAAACC T CGAAAAGG 



CCUUUUCG CUGAUGAG X CGAA IGUUUGGA 



GAAAAGGC A TGGGGACA 



UGUCCCCA CUGAUGAG X CGAA ICCUUUUC 



ATGGGGAC A AATCTTTC 



GAAAGAUU CUGAUGAG X CGAA IUCCCCAU 



GACAAATC T TTCTGTCC 



GGACAGAA CUGAUGAG X CGAA IAUUUGUC 



AATCTTTC T GTCCCCAA 



TTTCTGTC C CCAATCCC 



UUGGGGAC CUGAUGAG X CGAA IAAAGAUU 
GGGAUUGG CUGAUGAG X CGAA IACAGAAA~ 



TTCTGTCC C CAATCCCC 



GGGGAUUG CUGAUGAG X CGAA IGACAGAA 



TCTGTCCC C AATCCCCT 



AGGGGAUU CUGAUGAG X CGAA IGGACAGA 



CTGTCCCC A ATCCCCTG 



CAGGGGAU CUGAUGAG X CGAA IGGGACAG 



CCCCAATC C CCTGGGAT 



AUCCCAGG CUGAUGAG X CGAA IAUUGGGG 



CCCAATCC C CTGGGATT 



CCAATCCC C TGGGATTC 



AAUCCCAG CUGAUGAG X 
GAAUCCCA 



CUGAUGAG X CGAA IGGAUUGG 



CAATCCCC T GGGATTCT 



AGAAUCCC CUGAUGAG X CGAA IGGGAUUG 



TGGGATTC T TCCCCGAT 



AUCGGGGA CUGAUGAG X CGAA IAAUCCCA 



GATTCTTC C CCGATCAT 



AUGAUCGG CUGAUGAG X CGAA IAAGAAUC 



ATTCTTCC C CGATCATC 



GAUGAUCG CUGAUGAG X CGAA IGAAGAAU 



TTCTTCCC C GATCATCA 



UGAUGAUC CUGAUGAG X CGAA IGGAAGAA 



CCCCGATC A TCAGTTGG 



CCAACUGA CUGAUGAG X CGAA IAUCGGGG 



CGATCATC A GTTGGACC 



GGUCCAAC CUGAUGAG X CGAA IAUGAUCG 



AGTTGGAC C CTGCATTC 



GAAUGCAG CUGAUGAG X CGAA IUCCAACU 
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GTTGGACC C TGCATTCA 
TTGGACCC T GCATTCAA 
GACCCTGC A TTCAAAGC 
CTGCATTC A AAGCCAAC 
TTCAAAGC C AACTCAGT 
TCAAAGCC A ACTCAGTA 
AAGCCA AC T CAGTAAAT 
GCCAACTC A GTAAATCC 



UGAAU GCA CUGAUGAG X CGAA IGUCCAAC 
UUGAAUGC CUGA UGAG X CGAA IGGUCCAA~ 
GCUUUGAA CUGAUGAG X CGAA ICAGGGUC~ 



GUUGGCUU CUGAUGAG : 



CGAA IAAUGCAG 



ACUGAGUU CUGAUGAG X C GAA ICUUUGAA 

UACUGAGU CUGAUGA G X CGAA IGCUUUG~A~ 

AUUUACUG CUGAUGAG X CGAA IUUGGCUU 

GGAUUUAC CUGAUGAG X CGAA IAGUUGGC ~ 



CCCAAUCU CUGAUGAG X CGAA IAUUUACU"~ 



AGTAAATC C AGATTGGG 



GTAAATCC A GATTGGGA 
ATTGGGAC C TCAACCCG 



UCCCAAUC CUGAUGAG X CGAA IGAUUUAC 



CGGGUUGA CUGAUGAG X CGAA IUCCCAAU 



TTGGGACC T CAACCCGC 



GCGGGUUG CUGAUGAG X CGAA IGUCCCAA 



GGGACCTC A ACCCGCAC 



GUGCGGGU CUGAUGAG X CGAA IAGGUCCC 



ACCTCAAC C CGCACAAG 



CUUGUGCG CUGAUGAG X CGAA IUUGAGGU 



CCTCAACC C GCACAAGG 



CCUUGUGC CUGAUGAG X CGAA IGUUGAGG 



CAACCCGC A CAAGGACA 



UGUCCUUG CUGAUGAG X CGAA ICGGGUUG 



3060 
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ACCCGCAC A AGGACAAC 
ACAAGGAC A ACTGGCCG 



GUUGUCCU CUGAUGAG X CGAA IUGCGGGU 



CGGCCAGU CUGAUGAG X CGAA IUCCUUGU 



AGGACAAC T GGCCGGAC 



GUCCGGCC CUGAUGAG X CGAA IUUGUCCU 



CAACTGGC C GGACGCCA 



UGGCGUCC CUGAUGAG X CGAA ICCAGUUG 



CCGGACGC C AA CAAGGT 
CGGACGCC A ACAAGGTG 
ACGCCAAC A AGGTGGGA 
GTGGGAGC A TTCGGGCC 
ATTCGGG C C AGGGTTCA 
TTCGGGCC A GGGTTCAC 
CAGGGTTC A CCCCTCCC 
GG GTTCAC C CCTCCCCA 
GGTTCACC C CTCCCCAT 
GTTCACCC C TCCCCATG 
TTCACCCC T CCCCATGG 
CACCCCTC C CCATGGGG 
ACCCCTCC C CATGGGGG 
CCCCTCCC C ATGGGGGA 
CCCTCCCC A TGGGGGAC 
TGGGGGAC T GTTGGGGT 
GGTGGAGC C CTCACGCT 
GTGGAGCC C TCACGCTC 
TGGAGCCC T CACGCTCA 
GAG CCCTC A CGCTCAGG 
CCTCACGC T CAGGGCCT 
TCACGCTC A GGGCCTAC 
CTCAGGGC C TACTCACA 
TCAGGGCC T ACTCACAA 
GGGCCTAC T CACAACTG 
GCCTACTC A CAACTGTG 
CTACTCAC A ACTGTGCC 
CTCACAAC T GTGCCAGC 
AACTGTGC C AGCAGCTC 
ACTGTGCC A GCAGCTCC 
GTGCCAGC A GCTCCTCC 



ACCUUGUU CUGAUGAG X CGAA ICGUCCGG 
CACC UUGU CUGAUGAG X CGAA IGCGUCCG 
UCCCACCU C UGAUGAG X CGAA IUUGGCGU 
GGCCCGAA CUGA UGAG X CGAA ICUCCCAC 
UGAACCCU CUG AUGAG X CGAA ICCCGAAU~ 
GUGAACCC CUGAUGAG X CGAA IGCCCGAA~ 
GGGAGGGG CUGAUGAG X CGAA IAACCCUG~ 
UGGGGAGG CUGAUGAG X CG AA IUGAACCC ~ 
AUGGGGAG CUGAUGAG X CGAA IGUGAACC 



CAUGGGGA CUGAUGAG X CGAA 

CCAU GGGG CUGAUGAG X CGAA~ 

CCCCAUGG CUGAUGAG X CGAA 

CCCCCAUG CUGAUGAG X CGAA 



IGGUGAAC 
"iGGGUGAA 
IAGGGGUG ~ 
IGAGGGGU 



UCCCCCAU CUGAUGAG X CG AA IGGAGGGG 
GUCCCCCA CUGAUG AG X CGAA IGGGAGGG 
ACCCCAAC CUGA UGAG X CGAA IUCCCCCA 
AGCGUGAG CUGAUGAG X C GAA ICUCCACC 
GAG CGUGA CUGAUGAG X CGAA IGCUCCAC 
UGAGCGUG CUGAUGAG X CGAA IGGCUCCA 
CCUGAGCG CUGAUGAG X CGAA IAGGGCUC 
AGGCCCUG CUGAUGAG X CGAA ICGUGAGG 
GUAGGCCC CUG AUGAG X CGAA IAGCGUGA~~ 
UGUGAGUA CUGAUGAG X CGAA ICCCUGAG~ 
UUG UGAGU CUGAUGAG X CGAA IGCCCUGA 
CAGUUGUG CUGAUGAG X CGAA IUAGGCCC~ 
CACAGUUG CUGAUGAG X C GAA IAGUAGG"C~ 
GGCACAGU CUGAUGAG X CGAA IUGAGUAG 
GCUGGCAC CUGAUGAG X CGAA IUUGUGAG~ 
GAGCUGCU CUGAUG AG X CGAA ICACAGUU~ 
GGAGCUGC CUG AUGAG X CGAA IGCACAGU~ 
GGAGGAGC CUGAUGAG X CGAA ICUGGCAC 
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Table 38 



3122 | CCAGCAGC T CCTCCTCC 


1412 


GGAGGAGG CUGAUGAG X CGAA ICUGCUGG 




3124 AGCAGCTC C TCCTCCTG 


1413 


CAGGAGGA CUGAUGAG X CGAA IAGCUGCU 


3922 


3125 


GCAGCTCC T CCTCCTGC 


1414 


GCAGGAGG CUGAUGAG X CGAA IGAGCUGC 


3923 


3127 


AGCTCCTC C TCCTGCCT 


1415 


AGGCAGGA CUGAUGAG X CGAA IAGGAGCU 


3924 


3128 


GCTCCTCC T CCTGCCTC 


1415 


GAGGCAGG CUGAUGAG X CGAA IGAGGAGC 


3925 


3130 


TCCTCCTC C TGCCTCCA 


1417 


UGGAGGCA CUGAUGAG X CGAA IAGGAGGA 


2926 

- 


3131 


CCTCCTCC T GCCTCCAC 


1418 


GUGGAGGC CUGAUGAG X CGAA IGAGGAGG 




3134 


CCTCCTGC C TCCACCAA 


1419 


UUGGUGGA CUGAUGAG X CGAA ICAGGAGG 


3928 


3135 


CTCCTGCC T CCACCAAT 


1420 


AUUGGUGG CUGAUGAG X CGAA IGCAGGAG 


3929 


3137 


CCTGCCTC C ACCAATCG 


1421 


CGAUUGGU CUGAUGAG X CGAA IAGGCAGG 


930 


3138 


CTGCCTCC A CCAATCGG 


1422 


CCGAUUGG CUGAUGAG X CGAA IGAGGCAG 




3140 


GCCTCCAC C AATCGGCA 


1423 


UGCCGAUU CUGAUGAG X CGAA IUGGAGGC 


3932 


3141 


CCTCCACC A ATCGGCAG 


1424 


CUGCCGAU CUGAUGAG X CGAA I GUGGAGG 


933 


3148 


CAATCGGC A GTCAGGAA 


1425 


UUCCUGAC CUGAUGAG X CGAA ICCGAUUG 


934 


3152 


CGGCAGTC A GGAAGGCA 


1426 


UGCCUUCC CUGAUGAG X CGAA IACUGCCG 


3935 


3160 


AGGAAGGC A GCCTACTC 


1427 


GAGUAGGC CUGAUGAG X CGAA ICCUUCCU 


3936 


3163 


AAGGCAGC C TACTCCCT 


1428 


AGGGAGUA CUGAUGAG X CGAA ICUGCCUU 


3937 


3164 


AGGCAGCC T ACTCCCTT 


1429 


AAGGGAGU CUGAUGAG X CGAA IGCUGCCU 


38 

■ 


31S7 


CAGCCTAC T CCCTTATC 


1430 


GAUAAGGG CUGAUGAG X CGAA IUAGGCUG 




3169 


GCCTACTC C CTTATCTC 


1431 


GAGAUAAG CUGAUGAG X CGAA IAGUAGGC 


940 


3170 


CCTACTCC C TTATCTCC 


1432 


GGAGAUAA CUGAUGAG X CGAA IGAGUAGG 


3941 


3171 


CTACTCCC T TATCTCCA 


1433 


UGGAGAUA CUGAUGAG X CGAA IGGAGUAG 


3942 




3176 


CCCTTATC T CCACCTCT 


1434 


AGAGGUGG CUGAUGAG X CGAA IAUAAGGG 





3178 


CTTATCTC C ACCTCTAA 


1435 


UUAGAGGU CUGAUGAG X CGAA IAGAUAAG 


3944 


3179 


TTATCTCC A CCTCTAAG 


1436 


CUUAGAGG CUGAUGAG X CGAA IGAGAUAA 


^Hl 


3181 


ATCTCCAC C TCTAAGGG 


1437 


CCCUUAGA CUGAUGAG X CGAA IUGGAGAU 





3182 


TCTCCACC T CTAAGGGA 


1438 


UCCCUUAG CUGAUGAG X CGAA I GUGGAGA 




3184 


TCCACCTC T AAGGGACA 


1439 


UGUCCCUU CUGAUGAG X CGAA IAGGUGGA 




3192 


TAAGGGAC A CTCATCCT 




AGGAUGAG CUGAUGAG X CGAA IUCCCUUA 


3949 


3194 


AGGGACAC T CATCCTCA 


1441 


UGAGGAUG CUGAUGAG X CGAA IUGUCCCU 


3950 


3196 


GGACACTC A TCCTCAGG 


1442 


CCUGAGGA CUGAUGAG X CGAA IAGUGUCC 


3951 


3199 


CACTCATC C TCAGGCCA 


1443 


UGGCCUGA CUGAUGAG X CGAA IAUGAGUG 


3952 


3200 


ACTCATCC T CAGGCCAT 




AUGGCCUG CUGAUGAG X CGAA IGAUGAGU 


3953 


3202 


TCATCCTC A GGCCATGC 


1445 


GCAUGGCC CUGAUGAG X CGAA IAGGAUGA 


3954 


3206 


CCTCAGGC C ATGCAGTG 


1446 


CACUGCAU CUGAUGAG X CGAA ICCUGAGG 


3955 


3207 


CTCAGGCC A TGCAGTGG 


1447 


CCACUGCA CUGAUGAG X CGAA IGCCUGAG 


3956 



Input Sequence = AF100308. Cut site = CH/ . 

Stem Length = 8 . Core Sequence = CUGAUGAG X CGAA (X = GCCGUUAGGC or ot 
AF100308 (Hepatitis B virus strain 2-18, 3215 bp) 
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1 


GCCACCAG GCcgaaagGCGaGuCaaGGuCu AGGAAAGU : 


AUUCUGAG GCcgaaagGCGaGuCaaGGuCu AGGGCUCA | 


CGAUAUGG GCcgaaagGCGaGuCaaGGuCu agagacag [ 


UCCUGAUG GCcgaaagGCGaGuCaaGGuCu GAUGUUCU 


AACACGAG GCcgaaagGCGaGuCaaGGuCu AGGGGUCC | 


UGGGACUG GCcgaaagGCGaGuCaaGGuCu GAAUUUUG j 


ACAUCCAG GCcgaaagGCGaGuCaaGGuCu GAUAACCA j 


AAACGCCG GCcgaaagGCGaGuCaaGGuCu AGACACAU ] 


GCAGGAUG GCcgaaagGCGaGuCaaGGuCu AGAGGAAG | 


CAUAGCAG GCcgaaagGCGaGuCaaGGuCu AGGAUGCA | 


AGGCAUAG GCcgaaagGCGaGuCaaGGuCu AGCAGGAU | 


AGAUGAGG GCcgaaagGCGaGuCaaGGuCu AUAGCAGC I 


CAAACGGG GCcgaaagGCGaGuCaaGGuCu AACAUACC j 


AGGUUUUG GCcgaaagGCGaGuCaaGGuCu AUGGUCCG 


GAGUUGUG GCcgaaagGCGaGuCaaGGuCu AGGUUUUG | 


UCCUUGAG GCcgaaagGCGaGuCaaGGuCu AGGAGUUG 


UUGUACAG GCcgaaagGCGaGuCaaGGuCu AACAUGAG 1 


UACAGGUG GCcgaaagGCGaGuCaaGGuCu AGUUUCCG j 


ACAAAUGG GCcgaaagGCGaGuCaaGGuCu ACUAGUAA 


AACAGCGG GCcgaaagGCGaGuCaaGGuCu AUAAAGGG ] 


GGUAACAG GCcgaaagGCGaGuCaaGGuCu GGCAUAAA 


UCCUGUGG GCcgaaagGCGaGuCaaGGuCu AAUGUGCC 


AGGGGCGG GCcgaaagGCGaGuCaaGGuCu AAACCCCA 


CCACAUUG GCcgaaagGCGaGuCaaGGuCu GUGAAAGG 


CAUUAAAG GCcgaaagGCGaGuCaaGGuCu AGAAUAUC j 


UAUAAAGG GCcgaaagGCGaGuCaaGGuCu AUUAAAGC j 


UAUGCAUG GCcgaaagGCGaGuCaaGGuCu AUAUAAAG j 


CUUGUAUG GCcgaaagGCGaGuCaaGGuCu AUGCAUAU j 


Seq ID 




1450 




1454 












1461 




1463 






1466 














1476 


1479 










1485 


Substrate j 


ACUUUCCU G CUGGUGGC 1 


UGAGCCCU G CUCAGAAU | 


CUGUCUCU G CCAUAUCG 1 


AGAACAUC G CAUCAGGA 1 


B 
O 

8 
5 

i 

1 


< 


UGGUUAUC G CUGGAUGU 


i 

8 

1 
1 


CUUCCUCU G CAUCCUGC j 


UGCAUCCU G CUGCUAUG 


u 

1 

p 


GCUGCUAU G CCUCAUCU 


GGUAUGUU G CCCGUUUG | 


CGGACCAU G CAAAACCU 


CAAAACCU G CACAACUC [ 


CAACUCCU G CUCAAGGA 


i 

1 


CGGAAACU G CACCUGUA 


UUACUAGU G CCAUUUGU 


CCCUUUAU G CCGCUGUU ; 


UUUAUGCC G CUGUUACC j 


GGCACAUU G CCACAGGA 


UGGGGUUU G CCGCCCCU 1 


i 8 

\l 

j a 

i % 
' § 


GAUAUUCU G CUUUAAUG , 


GCUUUAAU G CCUUUAUA ; 


1 
1 

o 

1 


AUAUGCAU G CAUACAAG 


I 














































1020 




1050 


1058 : 


1068 


1072 
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4196 1 






4199 | 




4201 


4202 | 


4203 | 


4204 1 


4205 1 


4206 j 


i 4207 | 
































| 4223 | 






1 4226 1 


























UAAGUUGG GCcgaaagGCGaGuCaaGGuCu GAGAAAGU 


UUGCCGAG GCcgaaagGCGaGuCaaGGuCu AACGGGGU \ 


ACACUUGG GCcgaaagGCGaGuCaaGGuCu AUAGACCA 1 


UGCGUCAG GCcgaaagGCGaGuCaaGGuCu AAACACUU 1 


! GGGGGUUG GCcgaaagGCGaGuCaaGGuCu GUCAGCAA | 


| CACGCAUG GCcgaaagGCGaGuCaaGGuCu GCUGAUGG 


! GUUCCACG GCcgaaagGCGaGuCaaGGuCu AUGCGCUG [ 


| UGGAUCGG GCcgaaagGCGaGuCaaGGuCu AGAGGAGA | 


GAGUUCCG GCcgaaagGCGaGuCaaGGuCu GGUAUGGA 1 


1 AAAACAAG GCcgaaagGCGaGuCaaGGuCu GGCUAGGA 1 


GCUGCGAG GCcgaaagGCGaGuCaaGGuCu AAAACAAG ! 


ACCUGCUG GCcgaaagGCGaGuCaaGGuCu GAGCAAAA 


CGGGAGAG GCcgaaagGCGaGuCaaGGuCu ACGACAGA 


UAUAUUUG GCcgaaagGCGaGuCaaGGuCu GGGAGAGC 


CAGCCUAG GCcgaaagGCGaGuCaaGGuCu AGCCAUGG 


GUUGGCAG GCcgaaagGCGaGuCaaGGuCu ACAGCCUA ! 


I CCAGUUGG GCcgaaagGCGaGuCaaGGuCu AGCACAGC | 


1 ACGUCCCG GCcgaaagGCGaGuCaaGGuCu GUAGGAUC 


! GGAUUCAG GCcgaaagGCGaGuCaaGGuCu GCCGACGG 


\ GUCGUCCG GCcgaaagGCGaGuCaaGGuCu GGGAUUCA 


GCCCCAAG GCcgaaagGCGaGuCaaGGuCu GGCCCCGG 


1 GAAGCGGG GCcgaaagGCGaGuCaaGGuCu GGUAGAGC 


j CGGAGAAG GCcgaaagGCGaGuCaaGGuCu GGGCGGUA 


1 ACAAUAGG GCcgaaagGCGaGuCaaGGuCu GGAGAAGC 


\ GAGAGGUG GCcgaaagGCGaGuCaaGGuCu GCCCCGUG 


GGAGUCCG GCcgaaagGCGaGuCaaGGuCu GUAAAGAG 


| UGAGAAGG GCcgaaagGCGaGuCaaGGuCu ACAGACGG 


CGGUCCGG GCcgaaagGCGaGuCaaGGuCu AGAUGAGA 


: GCGAAGUG GCcgaaagGCGaGuCaaGGuCu ACACGGUC 


| AGGUGAAG GCcgaaagGCGaGuCaaGGuCu GAAGUGCA 


| GCGACGUG GCcgaaagGCGaGuCaaGGuCu AGAGGUGA 


j UCUCCAUG GCcgaaagGCGaGuCaaGGuCu GACGUGCA 


CCUGUGGG GCcgaaagGCGaGuCaaGGuCu GUUCACGG 


ACCUUGGG GCcgaaagGCGaGuCaaGGuCu AGGUUCCU 
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ACCCCGUU G CUCGGCAA | 
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| UUGCUGAC G CAACCCCC | 


CCAUCAGC G CAUGCGUG 


CAGCGCAU G CGUGGAAC 


UCUCCUCU G CCGAUCCA 


UCCAUACC G CGGAACUC 


UCCUAGCC G CUUGUUUU 
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UUUUGCUC G CAGCAGGU 


UCUGUCGU G CUCUCCCG 
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1 
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CCAUGGCU G CUAGGCUG 


UAGGCUGU G CUGCCAAC 


GCUGUGCU G CCAACUGG 


GAUCCUAC G CGGGACGU 


CCGUCGGC G CUGAAUCC 


| UGAAUCCC G CGGACGAC 


CCGGGGCC G CUUGGGGC 


GCUCUACC G CCCGCUUC 


UACCGCCC G CUUCUCCG 


1 GCUUCUCC G CCUAUUGU 


CACGGGGC G CACCUCUC 


CUCUUUAC G CGGACUCC 
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| UCUCAUCU G CCGGACCG 


1 GACCGUGU G CACUUCGC 
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1 UGCACGUC G CAUGGAGA 


| CCGUGAAC G CCCACAGG 


| AGGAACCU G CCCAAGGU 
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4303 


4304 




4306 1 




4308 | 


4309 1 






4312 | 








4316 | 


4317 | 






4320 
4321 


4323 
4324 


4325 | 








4329 | 


4330 | 


UGGCAGCA GCcgaaagGCGaGuCaaGGuCu AGCCUAGC ; 
GACGUAAA GCcgaaagGCGaGuCaaGGuCu AAAGGACG 1 


GGUCGGUA GCcgaaagGCGaGuCaaGGuCu AAUAGGCG ] 


AGAAGGCA GCcgaaagGCGaGuCaaGGuCu AGACGGGG 


GAAGUGCA GCcgaaagGCGaGuCaaGGuCu ACGGUCCG | 


GUCGUUGA GCcgaaagGCGaGuCaaGGuCu AUUGCUGA | 


UAAACACA GCcgaaagGCGaGuCaaGGuCu AGUCUUUG 


AUUAAACA GCcgaaagGCGaGuCaaGGuCu ACAGUCUU 


UCAUUAAA GCcgaaagGCGaGuCaaGGuCu ACACAGUC j 


UCCUAGUA GCcgaaagGCGaGuCaaGGuCu AAAGACCU 1 


UAUGCCUA GCcgaaagGCGaGuCaaGGuCu AGCCUCCU I 


GGUGAACA GCcgaaagGCGaGuCaaGGuCu ACCAAUUU j 


CUGGUGAA GCcgaaagGCGaGuCaaGGuCu ACACCAAU i 


GACAUGAA GCcgaaagGCGaGuCaaGGuCu AUGAGAUG | 


CAGUAGGA GCcgaaagGCGaGuCaaGGuCu AUGAACAU j 


GGCUUGAA GCcgaaagGCGaGuCaaGGuCu AGUAGGAC I 


CCAAGGCA GCcgaaagGCGaGuCaaGGuCu AGCUUGGA j 


UAACUCCA GCcgaaagGCGaGuCaaGGuCu AGAAGCUC j 


CCCCGAUA GCcgaaagGCGaGuCaaGGuCu AGAGCAGA j 


GAGGUGAA GCcgaaagGCGaGuCaaGGuCu AAUGUUCC 


CCCCAACA GCcgaaagGCGaGuCaaGGuCu AGAAUAGC 


CACCCCAA GCcgaaagGCGaGuCaaGGuCu ACAGAAUA j 


AACGUUGA GCcgaaagGCGaGuCaaGGuCu AUAGCUGA j 


UGAAACCA GCcgaaagGCGaGuCaaGGuCu AAUAGUUG 
AAGUAAGA GCcgaaagGCGaGuCaaGGuCu AGGAAAUG 
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UCUAACAA GCcgaaagGCGaGuCaaGGuCu AGUAGUUU | 


UCGUCUAA GCcgaaagGCGaGuCaaGGuCu AACAGUAG j 


AAUACUAA GCcgaaagGCGaGuCaaGGuCu AUUGAGAU 


UCUAUCAA GCcgaaagGCGaGuCaaGGuCu AAUGUCCU 


AUUGCUUA GCcgaaagGCGaGuCaaGGuCu AUCUAUCA | 


GGGCCCCA GCcgaaagGCGaGuCaaGGuCu AAAUUGCU 
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1640 
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GCUAGGCU G UGCUGCCA 1 
CGUCCUUU G UUUACGUC 
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U 
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CGGACCGU G UGCACUUC I 


UCAGCAAU G UCAACGAC | 


CAAAGACU G UGUGUUUA 


AAGACUGU G UGUUUAAU 


GACUGUGU G UUUAAUGA | 


AGGUCUUU G UACUAGGA 1 


AGGAGGCU G UAGGCAUA 


AAAUUGGU G UGUUCACC 


AUUGGUGU G UUCACCAG 


CAUCUCAU G UUCAUGUC j 


AUGUUCAU G UCCUACUG 


GUCCUACU G UUCAAGCC 


UCCAAGCU G UGCCUUGG 


GAGCUUCU G UGGAGUUA 
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GGAACAUU G UUCACCUC 


GCUAUUCU G UGUUGGGG | 


UAUUCUGU G UUGGGGUG 


UCAGCUAU G UCAACGUU 


CAACUAUU G UGGUUUCA 
CAUUUCCU G UCUUACUU 


UAUUUGGU G UCUUUUGG 
UUUGGAGU G UGGAUUCG 


AAACUACU G UUGUUAGA . 
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AUCUCAAU G UUAGUAUU [ 


AGGACAUU G UUGAUAGA 


UGAUAGAU G UAAGCAAU 
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2344 


2347 , 


2450 
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AUUAACUA GCcgaaagGCGaGuCaaGGuCu AUACUCUG | 


AUUGGGGA GCcgaaagGCGaGuCaaGGuCu AGAAAGAU j 


CACCCCAA GCcgaaagGCGaGuCaaGGuCu AGUCCCCC 


UGCUGGCA GCcgaaagGCGaGuCaaGGuCu AGUUGUGA j 


GGCCCUGA GCcgaaagGCGaGuCaaGGuCu UCUGGGAU j 


GUACAGGG GCcgaaagGCGaGuCaaGGuCu CCUGACUC j 


UGGAGCCA GCcgaaagGCGaGuCaaGGuCu CAGCAGGA 


AACUGGAG GCcgaaagGCGaGuCaaGGuCu CACCAGCA 1 


UUCCUGAA GCcgaaagGCGaGuCaaGGuCu UGGAGCCA j 


AGGGCUCA GCcgaaagGCGaGuCaaGGuCu UGUUCCUG | 


GAGCAGGG GCcgaaagGCGaGuCaaGGuCu UCACUGUU \ 


AAGAUUGA GCcgaaagGCGaGuCaaGGuCu GAUAUGGC 1 


CUGUAACA GCcgaaagGCGaGuCaaGGuCu GAGCAGGG ] 


AAACCCCG GCcgaaagGCGaGuCaaGGuCu CUGUAACA | 


AAGAAAAA GCcgaaagGCGaGuCaaGGuCu CCCGCCUG 


AGUCUAGA GCcgaaagGCGaGuCaaGGuCu UCUGUGGU 1 


GUCCACCA GCcgaaagGCGaGuCaaGGuCu GAGUCUAG 1 


GAAGUCCA GCcgaaagGCGaGuCaaGGuCu CACGAGUC | 


AAGACACA GCcgaaagGCGaGuCaaGGuCu GGGUGUUC ] 
AAUUUUGG GCcgaaagGCGaGuCaaGGuCu CAAGACAC 


AUUUGGGA GCcgaaagGCGaGuCaaGGuCu UGCGAAUU 
GUGAGUGA GCcgaaagGCGaGuCaaGGuCu UGGAGAUU 


AGCGAUAA GCcgaaagGCGaGuCaaGGuCu CAGGACAA | 

AUAAAACG GCcgaaagGCGaGuCaaGGuCu CGCAGACA 


UGAUAAAA GCcgaaagGCGaGuCaaGGuCu GCCGCAGA 
CAGAAGAA GCcgaaagGCGaGuCaaGGuCu CAACAAGA 


GCAACAUA GCcgaaagGCGaGuCaaGGuCu CUUGAUAG 


AGGACAAA GCcgaaagGCGaGuCaaGGuCu GGGCAACA | 


GUCCGGUG GCcgaaagGCGaGuCaaGGuCu UGGUUGUU | 


UGCGAAAG GCcgaaagGCGaGuCaaGGuCu CCAAGAUG J 


GAGGCCCA GCcgaaagGCGaGuCaaGGuCu UCCCAUAG 


GACUGAGG GCcgaaagGCGaGuCaaGGuCu CCACUCCC | 


GAAACGGA GCcgaaagGCGaGuCaaGGuCu UGAGGCCC 


























































































































AUCCCAAU G UUACUAAA 1 
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AUCUUUCU G UCCCCAAU 1 


GGGGGACU G UUGGGGUG I 


UCACAACU G UGCCAGCA | 


AUCCCAGA G UCAGGGCC | 


GAGUCAGG G CCCUGUAC | 


UCCUGCUG G UGGCUCCA 
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UGGCUCCA G UUCAGGAA 


CAGGAACA G UGAGCCCU 
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! GCCAUAUC G UCAAUCUU 


CCCUGCUC G UGUUACAG 


O 

i 


CAGGCGGG G UUUUUCUU 


ACCACAGA G UCUAGACU 


CUAGACUC G UGGUGGAC 


GACUCGUG G UGGACUUC S 


GAACACCC G UGUGUCUU j 
GUGUCUUG G CCAAAAUU 


AAUUCGCA G UCCCAAAU 
AAUCUCCA G UCACUCAC 


| UUGUCCUG G UUAUCGCU 1 
UGUCUGCG G CGUUUUAU j 


1 UCUGCGGC G UUUUAUCA 
UCUUGUUG G UUCUUCUG 


u 
1 
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UGUUGCCC G UUUGUCCU I 
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CUAUGGGA G UGGGCCUC 


GGGAGUGG G CCUCAGUC 


| GGGCCUCA G UCCGUUUC 1 
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4438 1 


4439 1 
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4453 










4461 








4465 | 




CUUGGAGG GCcgaaagGCGaGuCaaGGuCu UUGAACAG ( 


AGGCACAG GCcgaaagGCGaGuCaaGGuCu UUGGAGGC j 


CAAAGCCA GCcgaaagGCGaGuCaaGGuCu CCAAGGCA | 


CCCCAAAG GCcgaaagGCGaGuCaaGGuCu CACCCAAG 


UGUCCAUG GCcgaaagGCGaGuCaaGGuCu CCCAAAGC 


UCUUUAUA GCcgaaagGCGaGuCaaGGuCu GGGUCAAU | 


CACAGAAG GCcgaaagGCGaGuCaaGGuCu UCCAAAUU 1 


GAGAGUAA GCcgaaagGCGaGuCaaGGuCu UCCACAGA 


CUCUAAGG GCcgaaagGCGaGuCaaGGuCu CCCCCGAU 1 


UCCGGAGA GCcgaaagGCGaGuCaaGGuCu UCUAAGGC 


CCUGAGUG GCcgaaagGCGaGuCaaGGuCu CGUAUGGU 


AUAGCUUG GCcgaaagGCGaGuCaaGGuCu CUGAGUGC 


CAGAAUAG GCcgaaagGCGaGuCaaGGuCu UUGCCUGA 


UCAACUCA GCcgaaagGCGaGuCaaGGuCu CCCAACAC 


UUCAUCAA GCcgaaagGCGaGuCaaGGuCu UCACCCCA 


CCAGGUGG GCcgaaagGCGaGuCaaGGuCu UAGAUUCA 


ACUUCCCA GCcgaaagGCGaGuCaaGGuCu CCAGGUGG 


CCAAAUUA GCcgaaagGCGaGuCaaGGuCu UUCCCACC 


CCUGGAUG GCcgaaagGCGaGuCaaGGuCu UGGAUCUU 


GCUGACUA GCcgaaagGCGaGuCaaGGuCu UAAUUCCC 


AUAGCUGA GCcgaaagGCGaGuCaaGGuCu UACUAAUU 


UGACAUAG GCcgaaagGCGaGuCaaGGuCu UGACUACU 


CAUAUUAA GCcgaaagGCGaGuCaaGGuCu GUUGACAU 


AUGUGAAA GCcgaaagGCGaGuCaaGGuCu CACAAUAG 


AAAAGACA GCcgaaagGCGaGuCaaGGuCu CAAAUAUU 


GGGACCUG GCcgaaagGCGaGuCaaGGuCu CUCUUCGU 


CUAGGGGA GCcgaaagGCGaGuCaaGGuCu CUGCCUCU 


GAUUGAGA GCcgaaagGCGaGuCaaGGuCu CUUCGUCU 


UUCUGCGA GCcgaaagGCGaGuCaaGGuCu GCGGCGAU 


AAGGAAUA GCcgaaagGCGaGuCaaGGuCu UAACAUUG 


GUUUCCCA GCcgaaagGCGaGuCaaGGuCu CUUAUGUG 
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CUGUUCAA G CCUCCAAG 


GCCUCCAA G CUGUGCCU 


UGCCUUGG G UGGCUUUG 


CUUGGGUG G CUUUGGGG 


GCUUUGGG G CAUGGACA 


AUUGACCC G UAUAAAGA 


AAUUUGGA G CUUCUGUG 


UCUGUGGA G UUACUCUC 
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GCCUUAGA G UCUCCGGA 


ACCAUACG G CACUCAGG 


GCACUCAG G CAAGCUAU 


UCAGGCAA G CUAUUCUG 
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UGGGGUGA G UUGAUGAA 


UGAAUCUA G CCACCUGG 


CCACCUGG G UGGGAAGU 


GGUGGGAA G UAAUUUGG 


AAGAUCCA G CAUCCAGG 


GGGAAUUA G UAGUCAGC 


AAUUAGUA G UCAGCUAU 


AGUAGUCA G CUAUGUCA 


AUGUCAAC G UUAAUAUG 
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ACGAAGAG G CAGGUCCC 


AGAGGCAG G UCCCCUAG 


AGACGAAG G UCUCAAUC 


AUCGCCGC G UCGCAGAA 


CAAUGUUA G UAUUCCUU 


CACAUAAG G UGGGAAAC 
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Table 41 

Table 41: Human HBV DNAzyme and Substrate Sequence 



631 
687 
795 
798 



1068 
1072 
1103 
1155 
1177 
1188 
1194 



1262 
1275 
1290 
1299 
1303 
1349 
1357 



CAACCAGC A CCGGACCA 
GAACGCCC A CAGGAACC 
CAACCCGC A CAAGGACA 
ACUUUCCU G CUGGUGGC 
UGAGCCCU G CUCAGAAU 
CUGUCUCU G CCAUAUCG 
AGAACAUC G CAUCAGGA 
GGACCCCU G CUCGUGUU 
CAAAAUUC G CAGUCCCA 
UGGUUAUC G CUGGAUGU 
AUGUGUCU G CGGCGUUU 
CUUCCUCU G CAUCCUGC 
UGCAUCCU G CUGCUAUG 
AUCCUGCU G CUAUGCCU 
GCUGCUAU G CCUCAUCU 
GGUAUGUU G CCCGUUUG 
CGGACCAU G CAAAACCU 
CAAAACCU G CACAACUC 
CAACUCCU G CUCAAGGA 
CUCAUGUU G CUGUACAA 
CGGAAACU G CACCUGUA 
GGGCUUUC G CAAAAUAC 
UUACUAGU G CCAUUUGU 
CCCUUUAU G CCGCUGUU 
UUUAUGCC G CUGUUACC 
GGCACAUU G CCACAGGA 
UGGGGUUU G CCGCCCCU 
GGUUUGCC G CCCCUUUC 
CCUUUCAC G CAAUGUGG 
GAUAUUCU G CUUUAAUG 
GCUUUAAU G CCUUUAUA 
CUUUAUAU G CAUGCAUA 
AUAUGCAU G CAUACAAG 
ACUUUCUC G CCAACUUA 
ACCCCGUU G CUCGGCAA 
UGGUCUAU G CCAAGUGU 
AAGUGUUU G CUGACGCA 
UUGCUGAC G CAACCCCC 
CCAUCAGC G CAUGCGUG 
CAGCGCAU G CGUGGAAC 
UCUCCUCU G CCGAUCCA 
UCCAUACC G CGGAACUC 
UCCUAGCC G CUUGUUUU 
CUUGUUUU G CUCGCAGC 
UUUUGCUC G CAGCAGGU 
UCUGUCGU G CUCUCCCG 
GCUCUCCC G CAAAUAUA 



TGGTCCGG GGCTAGCTACAACGA GCTGGTTG 
GGTTCCTG GGCTAGCTACAACGA GGGCGTTC 
TGTCCTTG GGCTAGCTACAACGA GCGGGTTG 
GCCACCAG GGCTAGCTACAACGA AGGAAAGT 
ATTCTGAG GGCTAGCTACAACGA AGGGCTCA 
CGATATGG GGCTAGCTACAACGA AGAGACAG 
TCCTGATG GGCTAGCTACAACGA GATGTTCT 
AACACGAG GGCTAGCTACAACGA AGGGGTCC 
TGGGACTG GGCTAGCTACAACGA GAATTTTG 
ACATCCAG GGCTAGCTACAACGA GATAACCA 
AAACGCCG GGCTAGCTACAACGA AGACACAT 
GCAGGATG GGCTAGCTACAACGA AGAGGAAG 
CATAGCAG GGCTAGCTACAACGA AGGATGCA 
AGGCATAG GGCTAGCTACAACGA AGCAGGAT 
AGATGAGG GGCTAGCTACAACGA ATAGCAGC 
CAAACGGG GGCTAGCTACAACGA AACATACC 
AGGTTTTG GGCTAGCTACAACGA ATGGTCCG 
GAGTTGTG GGCTAGCTACAACGA AGGTTTTG 
TCCTTGAG GGCTAGCTACAACGA AGGAGTTG 
TTGTACAG GGCTAGCTACAACGA AACATGAG 
TACAGGTG GGCTAGCTACAACGA AGTTTCCG 
GTATTTTG GGCTAGCTACAACGA GAAAGCCC 
ACAAATGG GGCTAGCTACAACGA ACTAGTAA 
AACAGCGG GGCTAGCTACAACGA ATAAAGGG 
GGTAACAG GGCTAGCTACAACGA GGCATAAA 
TCCTGTGG GGCTAGCTACAACGA AATGTGCC 
AGGGGCGG GGCTAGCTACAACGA AAACCCCA 
GAAAGGGG GGCTAGCTACAACGA GGCAAACC 
CCACATTG GGCTAGCTACAACGA GTGAAAGG 
CATTAAAG GGCTAGCTACAACGA AGAATATC 
TATAAAGG GGCTAGCTACAACGA ATTAAAGC 
TATGCATG GGCTAGCTACAACGA ATATAAAG 
CTTGTATG GGCTAGCTACAACGA ATGCATAT 
TAAGTTGG GGCTAGCTACAACGA GAGAAAGT 
TTGCCGAG GGCTAGCTACAACGA AACGGGGT 
ACACTTGG GGCTAGCTACAACGA ATAGACCA 
TGCGTCAG GGCTAGCTACAACGA AAACACTT 
GGGGGTTG GGCTAGCTACAACGA GTCAGCAA 
CACGCATG GGCTAGCTACAACGA GCTGATGG 
GTTCCACG GGCTAGCTACAACGA ATGCGCTG 
TGGATCGG GGCTAGCTACAACGA AGAGGAGA 
GAGTTCCG GGCTAGCTACAACGA GGTATGGA 
AAAACAAG GGCTAGCTACAACGA GGCTAGGA 
GCTGCGAG GGCTAGCTACAACGA AAAACAAG 
ACCTGCTG GGCTAGCTACAACGA GAGCAAAA 
CGGGAGAG GGCTAGCTACAACGA ACGACAGA 
TATATTTG GGCTAGCTACAACGA GGGAGAGC 



WO 01/16312 



PCT/US00/23998 



CCAUGGCU G CUAGGCUG 



CAGCCTAG GGCTAGC TACAACGA AGCCATGG 



UAGGCUGU G CUGCCAAC 



GTTGGCAG GGCTAGCTACAACGA ACAGCCTA 



GCUGUGCU G CCAACUGG 



CCAGTTGG GGCTAGCTACAACGA AGCACAGC 



GAUCCUAC G CGGGACGU 



ACGTCCCG GGCTAGCTACAACGA GTAGGATC 



CCGUCGGC G CUGAAUCC 



GGATTCAG GGCTAGCTACAACGA GCCGACGG 



UGAAUCCC G CGGACGAC 



GTCGTCCG GGCTAGCTACAACGA GGGATTCA 



CCGGGGCC G CUUGGGGC 



GCCCCAAG GGCTAGCTACAACGA GGCCCCGG 



GCUCUACC G CCCGCUUC 



GAAGCGGG GGCTAGCTACAACGA GGTAGAGC 



UACCGCCC G CUUCUCCG 



CGGAGAAG GGCTAGCTACAACGA GGGCGGTA 



GCUUCUCC G CCUAUUGU 



ACAATAGG GGCTAGCTACAACGA GGAGAAGC 



CACGGGGC G CACCUCUC 



GAGAGGTG GGCTAGCTACAACGA GCCCCGTG 



CUCUUUAC G CGGACUCC 



GGAGTCCG GGCTAGCTACAACGA GTAAAGAG 



CCGUCUGU G CCUUCUCA 



TGAGAAGG GGCTAGCTACAACGA ACAGACGG 



UCUCAUCU G CCGGACCG 



CGGTCCGG GGCTAGCTACAACGA AGATGAGA 



GACCGUGU G CACUUCGC 



GCGAAGTG GGCTAGCTACAACGA ACACGGTC 



UGCACUUC G CUUCACCU 



AGGTGAAG GGCTAGCTACAACGA GAAGTGCA 



UCACCUCU G CACGUCGC 



GCGACGTG GGCTAGCTACAACGA AGAGGTGA 



UGCACGUC G CAUGGAGA 



TCTCCATG GGCTAGCTACAACGA GACGTGCA 



CCGUGAAC G CCCACAGG 



CCTGTGGG GGCTAGCTACAACGA GTTCACGG 



AGGAACCU G CCCAAGGU 



ACCTTGGG GGCTAGCTACAACGA AGGTTCCT 



AAGGUCUU G CAUAAGAG 



CTCTTATG GGCTAGCTACAACGA AAGACCTT 



AGCACCAU G CAACUUUU 



AAAAGTTG GGCTAGCTACAACGA ATGGTGCT 



UCACCUCU G CCUAAUCA 



TGATTAGG GGCTAGCTACAACGA AGAGGTGA 



CAAGCUGU G CCUUGGGU 



ACCCAAGG GGCTAGCTACAACGA ACAGCTTG 



UCUUUUUU G CCUUCUGA 



TCAGAAGG GGCTAGCTACAACGA AAAAAAGA 



UCGACACC G CCUCUGCU 



AGCAGAGG GGCTAGCTACAACGA GGTGTCGA 



CCGCCUCU G CUCUGUAU 



ATACAGAG GGCTAGCTACAACGA AGAGGCGG 



GUGGAUUC G CACUCCUC 



GAGGAGTG GGCTAGCTACAACGA GAATCCAC 



4579 
4580 
4581 



CUCCUCCU G CAUAUAGA 



TCTATATG GGCTAGCTACAACGA AGGAGGAG 



CACCAAAU G CCCCUAUC 



GATAGGGG GGCTAGCTACAACGA ATTTGGTG 



ACUCCCUC G CCUCGCAG 



CTGCGAGG GGCTAGCTACAACGA GAGGGAGT 



CUCGCCUC G CAGACGAA 



TTCGTCTG GGCTAGCTACAACGA GAGGCGAG 



UCUCAAUC G CCGCGUCG 



CGACGCGG GGCTAGCTACAACGA GATTGAGA 



CAAUCGCC G CGUCGCAG 



CTGCGACG GGCTAGCTACAACGA GGCGATTG 



4585 
4586 



GCCGCGUC G CAGAAGAU 



ATCTTCTG GGCTAGCTACAACGA GACGCGGC 



GGUACCUU G CUUUAAUC 



GATTAAAG GGCTAGCTACAACGA AAGGTACC 



AUUCAUUU G CAGGAGGA 



TCCTCCTG GGCTAGCTACAACGA AAATGAAT 



4588 
4589 
4590 
4591 



264 



UUAACUAU G CCUGCUAG 



CTAGCAGG GGCTAGCTACAACGA ATAGTTAA 



CUAUGCCU G CUAGGUUU 



AAACCTAG GGCTAGCTACAACGA AGGCATAG 



AAAUAUUU G CCCUUAGA 



TCTAAGGG GGCTAGCTACAACGA AAATATTT 



UUCCAGAC G CGACAUUA 



TAATGTCG GGCTAGCTACAACGA GTCTGGAA 



CACGUAGC G CCUCAUUU 



AAATGAGG GGCTAGCTACAACGA GCTACGTG 



CUCAUUUU G CGGGUCAC 



GTGACCCG GGCTAGCTACAACGA AAAATGAG 



UGGACCCU G CAUUCAAA 



TTTGAATG GGCTAGCTACAACGA AGGGTCCA 



CUCAACCC G CACAAGGA 



TCCTTGTG GGCTAGCTACAACGA GGGTTGAG 



GGCCGGAC G CCAACAAG 



CTTGTTGG GGCTAGCTACAACGA GTCCGGCC 



GCCCUCAC G CUCAGGGC 



GCCCTGAG GGCTAGCTACAACGA GTGAGGGC 



ACAACUGU G CCAGCAGC 



GCTGCTGG GGCTAGCTACAACGA ACAGTTGT 



CUCCUCCU G CCUCCACC 



GGTGGAGG GGCTAGCTACAACGA AGGAGGAG 



51 



AGGGCCCU G UACUUUCC 



GGAAAGTA GGCTAGCTACAACGA AGGGCCCT 



AGAAUACU G UCUCUGCC 



GGCAGAGA GGCTAGCTACAACGA AGTATTCT 
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GGGACCCU G UACCGAAC 



GTTCGGTA GGCTAGCTACAACGA AGGGTCCC 



CUGCUCGU G UUACAGGC 



GCCTGTAA GGCTAGCTACAACGA ACGAGCAG 



UUUUUCUU G UUGACAAA 



TTTGTCAA GGCTAGCTACAACGA AAGAAAAA 



ACACCCGU G UGUCUUGG 



CCAAGACA GGCTAGCTACAACGA ACGGGTGT 



ACCCGUGU G UCUUGGCC 



GGCCAAGA GGCTAGCTACAACGA ACACGGGT 



ACCAACCU G UUGUCCUC 



GAGGACAA GGCTAGCTACAACGA AGGTTGGT 



AACCUGUU G UCCUCCAA 



TTGGAGGA GGCTAGCTACAACGA AACAGGTT 



UCCAAUUU G UCCUGGUU 



AACCAGGA GGCTAGCTACAACGA AAATTGGA 



CGCUGGAU G UGUCUGCG 



CGCAGACA GGCTAGCTACAACGA ATCCAGCG 



CUGGAUGU G UCUGCGGC 



GCCGCAGA GGCTAGCTACAACGA ACATCCAG 



AUCUUCUU G UUGGUUCU 



AGAACCAA GGCTAGCTACAACGA AAGAAGAT 



CAAGGUAU G UUGCCCGU 



ACGGGCAA GGCTAGCTACAACGA ATACCTTG 



GCCCGUUU G UCCUCUAA 



TTAGAGGA GGCTAGCTACAACGA AAACGGGC 



ACCUCUAU G UUUCCCUC 



GAGGGAAA GGCTAGCTACAACGA ATAGAGGT 



UCCCUCAU G UUGCUGUA 



TACAGCAA GGCTAGCTACAACGA ATGAGGGA 



AUGUUGCU G UACAAAAC 



GTTTTGTA GGCTAGCTACAACGA AGCAACAT 



CUGCACCU G UAUUCCCA 



TGGGAATA GGCTAGCTACAACGA AGGTGCAG 



UGCCAUUU G UUCAGUGG 



CCACTGAA GGCTAGCTACAACGA AAATGGCA 



CCCCCACU G UCUGGCUU 



AAGCCAGA GGCTAGCTACAACGA AGTGGGGG 



UGGAUGAU G UGGUUUUG 



CAAAACCA GGCTAGCTACAACGA ATCATCCA 



CCAAGUCU G UACAACAU 



ATGTTGTA GGCTAGCTACAACGA AGACTTGG 



AUGCCGCU G UUACCAAU 



ATTGGTAA GGCTAGCTACAACGA AGCGGCAT 



UUUCUUUU G UCUUUGGG 



CCCAAAGA GGCTAGCTACAACGA AAAAGAAA 



UGGGAUAU G UAAUUGGG 



CCCAATTA GGCTAGCTACAACGA ATATCCCA 



AACAUAUU G UACAAAAA 



TTTTTGTA GGCTAGCTACAACGA AATATGTT 



AUCAAAAU G UGUUUUAG 



CTAAAACA GGCTAGCTACAACGA ATTTTGAT 



CAAAAUGU G UUUUAGGA 



TCCTAAAA GGCTAGCTACAACGA ACATTTTG 



AACUUCCU G UAAACAGG 



CCTGTTTA GGCTAGCTACAACGA AGGAAGTT 



GAAAGUAU G UCAACGAA 



TTCGTTGA GGCTAGCTACAACGA ATACTTTC 



AACGAAUU G UGGGUCUU 



AAGACCCA GGCTAGCTACAACGA AATTCGTT 



CACGCAAU G UGGAUAUU 



AATATCCA GGCTAGCTACAACGA ATTGCGTG 



! UGAACCUU 



AAGGTTCA GGCTAGCTACAACGA ATACTGTT 



UGCCAAGU G UUUGCUGA 



TCAGCAAA GGCTAGCTACAACGA ACTTGGCA 



GAACCUUU G UGUCUCCU 



AGGAGACA GGCTAGCTACAACGA AAAGGTTC 



ACCUUUGU G UCUCCUCU 



AGAGGAGA GGCTAGCTACAACGA ACAAAGGT 



AGCCGCUU G UUUUGCUC 



GAGCAAAA GGCTAGCTACAACGA AAGCGGCT 



ACAAUUCU G UCGUGCUC 



GAGCACGA GGCTAGCTACAACGA AGAATTGT 



TGGCAGCA GGCTAGCTACAACGA AGCCTAGC 



CGUCCUUU G UUUACGUC 



GACGTAAA GGCTAGCTACAACGA AAAGGACG 



CGCCUAUU G UACCGACC 



GGTCGGTA GGCTAGCTACAACGA AATAGGCG 



CCCCGUCU G UGCCUUCU 



AGAAGGCA GGCTAGCTACAACGA AGACGGGG 



CGGACCGU G UGCACUUC 



GAAGTGCA GGCTAGCTACAACGA ACGGTCCG 



UCAGCAAU G UCAACGAC 



GTCGTTGA GGCTAGCTACAACGA ATTGCTGA 



CAAAGACU G UGUGUUUA 



TAAACACA GGCTAGCTACAACGA AGTCTTTG 



AAGACUGU G UGUUUAAU 



ATTAAACA GGCTAGCTACAACGA ACAGTCTT 



GACUGUGU G UUUAAUGA 



TCATTAAA GGCTAGCTACAACGA ACACAGTC 



AGGUCUUU G UACUAGGA 



TCCTAGTA GGCTAGCTACAACGA AAAGACCT 



AGGAGGCU G UAGGCAUA 



TATGCCTA GGCTAGCTACAACGA AGCCTCCT 



AAAUUGGU G UGUUCACC 



GGTGAACA GGCTAGCTACAACGA ACCAATTT 



AUUGGUGU G UUCACCAG 



CTGGTGAA GGCTAGCTACAACGA ACACCAAT 



CAUCUCAU G UUCAUGUC 



GACATGAA GGCTAGCTACAACGA ATGAGATG 
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AUGUUCAU G UCCUACUG 



CAGTAGGA GGCT AGC T ACAACGA ATGAACAT 



GUCCUACU G UUCAAGCC 



GGCTTGAA GGCT AGC TACAACGA AGTAGGAC 



UCCAAGCU G UGCCUUGG 



CCAAGGCA GGCTAGC TACAACGA AGCTTGGA 



GAGCUUCU G UGGAGUUA 



TAACTCCA GGCTAGC TACAACGA AGAAGCTC 



UCUGCUCU G UAUCGGGG 



CCCCGATA GGCTAGCTACAACGA AGAGCAGA 



GGAACAUU G UUCACCUC 



GAGGTGAA GGCTAGCTACAACGA AATGTTCC 



GCUAUUCU G UGUUGGGG 



CCCCAACA GGCTAGCTACAACGA AGAATAGC 



UAUUCUGU G UUGGGGUG 



CACCCCAA GGCTAGCTACAACGA ACAGAATA 



UCAGCUAU G UCAACGUU 



AACGTTGA GGCTAGCTACAACGA ATAGCTGA 



CAACUAUU G UGGUUUCA 



TGAAACCA GGCTAGCTACAACGA AATAGTTG 



CAUUUCCU G UCUUACUU 



AAGTAAGA GGCTAGCTACAACGA AGGAAATG 



GAGAAACU G UUCUUGAA 



TTCAAGAA GGCTAGCTACAACGA AGTTTCTC 



UAUUUGGU G UCUUUUGG 



CCAAAAGA GGCTAGCTACAACGA ACCAAATA 



UUUGGAGU G UGGAUUCG 



CGAATCCA GGCTAGCTACAACGA ACTCCAAA 



AAACUACU G UUGUUAGA 



TCTAACAA GGCTAGCTACAACGA AGTAGTTT 



CUACUGUU G UUAGACGA 



TCGTCTAA GGCTAGCTACAACGA AACAGTAG 



AUCUCAAU G UUAGUAUU 



AATACTAA GGCTAGCTACAACGA ATTGAGAT 



AGGACAUU G UUGAUAGA 



TCTATCAA GGCTAGCTACAACGA AATGTCCT 



UGAUAGAU G UAAGCAAU 



ATTGCTTA GGCTAGCTACAACGA ATCTATCA 



AGCAAUUU G UGGGGCCC 



GGGCCCCA GGCTAGCTACAACGA AAATTGCT 



AUCCCAAU G UUACUAAA 



TTTAGTAA GGCTAGCTACAACGA ATTGGGAT 



CAGAGUAU G UAGUUAAU 



ATTAACTA GGCTAGCTACAACGA ATACTCTG 



AUCUUUCU G UCCCCAAU 



ATTGGGGA GGCTAGCTACAACGA AGAAAGAT 



GGGGGACU G UUGGGGUG 



CACCCCAA GGCTAGCTACAACGA AGTCCCCC 



UCACAACU G UGCCAGCA 



TGCTGGCA GGCTAGCTACAACGA AGTTGTGA 



AUCCCAGA G UCAGGGCC 



GGCCCTGA GGCTAGCTACAACGA TCTGGGAT 



GAGUCAGG G CCCUGUAC 



GTACAGGG GGCTAGCTACAACGA CCTGACTC 



UCCUGCUG G UGGCUCCA 



TGGAGCCA GGCTAGCTACAACGA CAGCAGGA 



UGCUGGUG G CUCCAGUU 



AACTGGAG GGCTAGCTACAACGA CACCAGCA 



UGGCUCCA G UUCAGGAA 



TTCCTGAA GGCTAGCTACAACGA TGGAGCCA 



CAGGAACA G UGAGCCCU 



AGGGCTCA GGCTAGCTACAACGA TGTTCCTG 



AACAGUGA G CCCUGCUC 



GAGCAGGG GGCTAGCTACAACGA TCACTGTT 



GCCAUAUC G UCAAUCUU 



AAGATTGA GGCTAGCTACAACGA GATATGGC 



CCCUGCUC G UGUUACAG 



CTGTAACA GGCTAGCTACAACGA GAGCAGGG 



UGUUACAG G CGGGGUUU 



AAACCCCG GGCTAGCTACAACGA CTGTAACA 



CAGGCGGG G UUUUUCUU 



AAGAAAAA GGCTAGCTACAACGA CCCGCCTG 



ACCACAGA G UCUAGACU 



AGTCTAGA GGCTAGCTACAACGA TCTGTGGT 



CUAGACUC G UGGUGGAC 



GTCCACCA GGCTAGCTACAACGA GAGTCTAG 



GACUCGUG G UGGACUUC 



GAAGTCCA GGCTAGCTACAACGA CACGAGTC 



GAACACCC G UGUGUCUU 



AAGACACA GGCTAGCTACAACGA GGGTGTTC 



GUGUCUUG G CCAAAAUU 



AATTTTGG GGCTAGCTACAACGA CAAGACAC 



AAUUCGCA G UCCCAAAU 



ATTTGGGA GGCTAGCTACAACGA TGCGAATT 



AAUCUCCA G UCACUCAC 



GTGAGTGA GGCTAGCTACAACGA TGGAGATT 



UUGUCCUG G UUAUCGCU 



AGCGATAA GGCTAGCTACAACGA CAGGACAA 



UGUCUGCG G CGUUUUAU 



ATAAAACG GGCTAGCTACAACGA CGCAGACA 



UCUGCGGC G UUUUAUCA 



TGATAAAA GGCTAGCTACAACGA GCCGCAGA 



UCUUGUUG G UUCUUCUG 



CAGAAGAA GGCTAGCTACAACGA CAACAAGA 



CUAUCAAG G UAUGUUGC 



GCAACATA GGCTAGCTACAACGA CTTGATAG 



UGUUGCCC G UUUGUCCU 



AGGACAAA GGCTAGCTACAACGA GGGCAACA 



AACAACCA G CACCGGAC 



GTCCGGTG GGCTAGCTACAACGA TGGTTGTT 



CAUCUUGG G CUUUCGCA 



TGCGAAAG GGCTAGCTACAACGA CCAAGATG 
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CUAUGGGA G UGGGCCUC 



GAGGCCCA GGCTAGCTACAACGA TCCCATAG 



GGGAGUGG G CCUCAGUC 



GACTGAGG GGCTAGCTACAACGA CCACTCCC 



GGGCCUCA G UCCGOUUC 



GAAACGGA GGCTAGCTACAACGA TGAGGCCC 



CUCAGUCC G UUUCUCUU 



AAGAGAAA GGCTAGCTACAACGA GGACTGAG 



UUCUCUUG G CUCAGUUU 



AAACTGAG GGCTAGCTACAACGA CAAGAGAA 



UUGGCUCA G UUUACUAG 



CTAGTAAA GGCTAGCTACAACGA TGAGCCAA 



GUUUACUA G UGCCAUUU 



AAATGGCA GGCTAGCTACAACGA TAGTAAAC 



UUUGUUCA G UGGUUCGU 



ACGAACCA GGCTAGCTACAACGA TGAACAAA 



GUUCAGUG G UUCGUAGG 



CCTACGAA GGCTAGCTACAACGA CACTGAAC 



AGUGGUUC G UAGGGCUU 



AAGCCCTA GGCTAGCTACAACGA GAACCACT 



UUCGUAGG G CUUUCCCC 



GGGGAAAG GGCTAGCTACAACGA CCTACGAA 



ACUGUCUG G CUUUCAGU 



ACTGAAAG GGCTAGCTACAACGA CAGACAGT 



GGCUUUCA G UUAUAUGG 



CCATATAA GGCTAGCTACAACGA TGAAAGCC 



AUGAUGUG G UUUUGGGG 



CCCCAAAA GGCTAGCTACAACGA CACATCAT 



UUUUGGGG G CCAAGUCU 



AGACTTGG GGCTAGCTACAACGA CCCCAAAA 



GGGGCCAA G UCUGUACA 



TGTACAGA GGCTAGCTACAACGA TTGGCCCC 



CAUCUUGA G UCCCUUUA 



TAAAGGGA GGCTAGCTACAACGA TCAAGATG 



GUCUUUGG G UAUACAUU 



AATGTATA GGCTAGCTACAACGA CCAAAGAC 



AAUUGGGA G UUGGGGCA 



TGCCCCAA GGCTAGCTACAACGA TCCCAATT 



GAGUUGGG G CACAUUGC 



GCAATGTG GGCTAGCTACAACGA CCCAACTC 



GUAAACAG G CCUAUUGA 



TCAATAGG GGCTAGCTACAACGA CTGTTTAC 



AUUGGAAA G UAUGUCAA 



TTGACATA GGCTAGCTACAACGA TTTCCAAT 



AAUUGUGG G UCUUUUGG 



CCAAAAGA GGCTAGCTACAACGA CCACAATT 



CUUUUGGG G UUUGCCGC 



GCGGCAAA GGCTAGCTACAACGA CCCAAAAG 



GCAUACAA G CAAAACAG 



CTGTTTTG GGCTAGCTACAACGA TTGTATGC 



CAAAACAG G CUUUUACU 



AGTAAAAG GGCTAGCTACAACGA CTGTTTTG 



CUUACAAG G CCUUUCUA 



TAGAAAGG GGCTAGCTACAACGA CTTGTAAG 



CUUUCUAA G UAAACAGU 



ACTGTTTA GGCTAGCTACAACGA TTAGAAAG 



AGUAAACA G UAUGUGAA 



TTCACATA GGCTAGCTACAACGA TGTTTACT 



UUUACCCC G UUGCUCGG 



CCGAGCAA GGCTAGCTACAACGA GGGGTAAA 



GGCCGTTG GGCTAGCTACAACGA CGAGCAAC 



CGGCAACG G CCUGGUCU 



AGACCAGG GGCTAGCTACAACGA CGTTGCCG 



ACGGCCUG G UCUAUGCC 



GGCATAGA GGCTAGCTACAACGA CAGGCCGT 



UAUGCCAA G UGUUUGCU 



AGCAAACA GGCTAGCTACAACGA TTGGCATA 



CCCCACUG G UUGGGGCU 



AGCCCCAA GGCTAGCTACAACGA CAGTGGGG 



; CUUGGCCA 



TGGCCAAG GGCTAGCTACAACGA CCCAACCA 



GGGGCUUG G CCAUAGGC 



GCCTATGG GGCTAGCTACAACGA CAAGCCCC 



GGCCAUAG G CCAUCAGC 



GCTGATGG GGCTAGCTACAACGA CTATGGCC 



GGCCAUCA G CGCAUGCG 



CGCATGCG GGCTAGCTACAACGA TGATGGCC 



GCGCAUGC G UGGAACCU 



AGGTTCCA GGCTAGCTACAACGA GCATGCGC 



AACUCCUA G CCGCUUGU 



ACAAGCGG GGCTAGCTACAACGA TAGGAGTT 



UGCUCGCA G CAGGUCUG 



CAGACCTG GGCTAGCTACAACGA TGCGAGCA 



CGCAGCAG G UCUGGGGC 



GCCCCAGA GGCTAGCTACAACGA CTGCTGCG 



GGUCUGGG G CAAAACUC 



GAGTTTTG GGCTAGCTACAACGA CCCAGACC 



AUUCUGUC G UGCUCUCC 



GGAGAGCA GGCTAGCTACAACGA GACAGAAT 



UUUCCAUG G CUGCUAGG 



CCTAGCAG GGCTAGCTACAACGA CATGGAAA 



GCUGCUAG G CUGUGCUG 



CAGCACAG GGCTAGCTACAACGA CTAGCAGC 



CGCGGGAC G UCCUUUGU 



ACAAAGGA GGCTAGCTACAACGA GTCCCGCG 



UUGUUUAC G UCCCGUCG 



CGACGGGA GGCTAGCTACAACGA GTAAACAA 



UACGUCCC G UCGGCGCU 



AGCGCCGA GGCTAGCTACAACGA GGGACGTA 



UCCCGUCG G CGCUGAAU 



ATTCAGCG GGCTAGCTACAACGA CGACGGGA 
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2162 
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CUCCCGGG G CCGCUUGG 
CGCUUGGG G CUCUACCG 
UACCGACC G UCCACGGG 
UCCACGGG G CGCACCUC 
GACUCCCC G UCUGUGCC 
GCCGGACC G UGUGCACU 
CUCUGCAC G UCGCAUGG 
AGACCACC G UGAACGCC 
UGCCCAAG G UCUUGCAU 
GACUUUCA G CAAUGUCA 
ACCUUGAG G CAUACUUC 
UUUAAUGA G UGGGAGGA 
UGGGAGGA G UUGGGGGA 
GGGAGGAG G UUAGGUUA 
GAGGUUAG G UUAAAGGU 
GGUUAAAG G UCUUUGUA 
ACUAGGAG G CUGUAGGC 
GGCUGUAG G CAUAAAUU 
AUAAAUUG G UGUGUUCA 
GUUCACCA G CACCAUGC 
CUGUUCAA G CCUCCAAG 
GCCUCCAA G CUGUGCCU 
UGCCUUGG G UGGCUUUG 
CUUGGGUG G CUUUGGGG 
GCUUUGGG G CAUGGACA 
AUUGACCC G UAUAAAGA 



5 CUUCUGUG 
UCUGUGGA G UUACUCUC 
AUCGGGGG G CCUUAGAG 
GCCUUAGA G UCUCCGGA 
ACCAUACG G CACUCAGG 



GUGUUGGG G UGAGUUGA 
UGGGGUGA G UUGAUGAA 
UGAAUCUA G CCACCUGG 
CCACCUGG G UGGGAAGU 
GGUGGGAA G UAAUUUGG 
AAGAUCCA G CAUCCAGG 
GGGAAUUA G UAGUCAGC 
AAUUAGUA G UCAGCUAU 
AGUAGUCA G CUAUGUCA 
AUGUCAAC G UUAAUAUG 
UAAUAUGG G CCUAAAAA 
CUAUUGUG G UUUCACAU 
ACUUUUGG G CGAGAAAC 
AAUAUUUG G UGUCUUUU 
CUUUUGGA G UGUGGAUU 
ACGAAGAG G CAGGUCCC 
AGAGGCAG G UCCCCUAG 
AGACGAAG G UCUCAAUC 



CCAAGCGG GGCTAGCTACAACGA CCCGGGAG 
CGGTAGAG GGC TAGCT ACAACGA CCCAAGCG 
CCCGTGGA GGCTAGCTACAACGA GGTCGGTA 
GAGGTGCG GGCTAGCTACAACGA CCCGTGGA 
GGCACAGA GGCTAGCTACAACGA GGGGAGTC 
AGTGCACA GGCTAGCTACAACGA GGTCCGGC 
CCATGCGA GGCTAGCTACAACGA GTGCAGAG 
GGCGTTCA GGCTAGCTACAACGA GGTGGTCT 
ATGCAAGA GGCTAGCTACAACGA CTTGGGCA 
TGACATTG GGCTAGCTACAACGA TGAAAGTC 
GAAGTATG GGCTAGCTACAACGA CTCAAGGT 
TCCTCCCA GGCTAGCTACAACGA TCATTAAA 
TCCCCCAA GGCTAGCTACAACGA TCCTCCCA 
TAACCTAA GGCTAGCTACAACGA CTCCTCCC 
ACCTTTAA GGCTAGCTACAACGA CTAACCTC 
TACAAAGA GGCTAGCTACAACGA CTTTAACC 
GCCTACAG GGCTAGCTACAACGA CTCCTAGT 
AATTTATG GGCTAGCTACAACGA CTACAGCC 
TGAACACA GGCTAGCTACAACGA CAATTTAT 
GCATGGTG GGCTAGCTACAACGA TGGTGAAC 
CTTGGAGG GGCTAGCTACAACGA TTGAACAG 
AGGCACAG GGCTAGCTACAACGA TTGGAGGC 
CAAAGCCA GGCTAGCTACAACGA CCAAGGCA 
CCCCAAAG GGCTAGCTACAACGA CACCCAAG 
TGTCCATG GGCTAGCTACAACGA CCCAAAGC 
TCTTTATA GGCTAGCTACAACGA GGGTCAAT 
CACAGAAG GGCTAGCTACAACGA TCCAAATT 
GAGAGTAA GGCTAGCTACAACGA TCCACAGA 
CTCTAAGG GGCTAGCTACAACGA CCCCCGAT 
TCCGGAGA GGCTAGCTACAACGA TCTAAGGC 
CCTGAGTG GGCTAGCTACAACGA CGTATGGT 
ATAGCTTG GGCTAGCTACAACGA CTGAGTGC 
CAGAATAG GGCTAGCTACAACGA TTGCCTGA 
TCAACTCA GGCTAGCTACAACGA CCCAACAC 
TTCATCAA GGCTAGCTACAACGA TCACCCCA 
CCAGGTGG GGCTAGCTACAACGA TAGATTCA 
ACTTCCCA GGCTAGCTACAACGA CCAGGTGG 
CCAAATTA GGCTAGCTACAACGA TTCCCACC 
CCTGGATG GGCTAGCTACAACGA TGGATCTT 
GCTGACTA GGCTAGCTACAACGA T AATT C CC 
ATAGCTGA GGCTAGCTACAACGA TACTAATT 
TGACATAG GGCTAGCTACAACGA TGACTACT 
CATATTAA GGCTAGCTACAACGA GTTGACAT 
TTTTTAGG GGCTAGCTACAACGA CCATATTA 
ATGTGAAA GGCTAGCTACAACGA CACAATAG 
GTTTCTCG GGCTAGCTACAACGA CCAAAAGT 
AAAAGACA GGCTAGCTACAACGA CAAATATT 
AATCCACA GGCTAGCTACAACGA TCCAAAAG 
GGGACCTG GGCTAGCTACAACGA CTCTTCGT 
CTAGGGGA GGCTAGCTACAACGA CTGCCTCT 
GATTGAGA GGCTAGCTACAACGA CTTCGTCT 
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AUCGCCGC G UCGCAGAA 
CAAUGUUA G UAUUCCUU 
CACAUAAG G UGGGAAAC 
UUUACGGG G CUUUAUUC 
CUUCUACG G UACCUUGC 
CCUAAAUG G CAAACUCC 
AGAUGUAA G CAAUUUGU 
UUUGUGGG G CCCCUUAC 
CCCUUACA G UAAAUGAA 
CCUGCUAG G UUUUAUCC 
AUCAAACC G UAUUAUCC 
UAUCCAGA G UAUGUAGU 
AGUAUGUA G UUAAUCAU 
UUUGGAAG G CGGGGAUC 
AAAAGAGA G UCCACACG 
GUCCACAC G UAGCGCCU 
CACACGUA G CGCCUCAU 
UUUUGCGG G UCACCAUA 
GAUCUACA G CAUGGGAG 
CAUGGGAG G UUGGUCUU 
GGAGGUUG G UCUUCCAA 
UCGAAAAG G CAUGGGGA 
GAUCAUCA G UUGGACCC~ 
CAUUCAAA G CCAACUCA 
CCAACUCA G UAAAUCCA 
GACAACUG G CCGGACGC 
CCAACAAG G UGGGAGUG 
AGGUGGGA G UGGGAGCA 
GAGUGGGA G CAUUCGGG 
GCAUUCGG G CCAGGGUU 
GGGCCAGG G UUCACCCC 
CUGUUGGG G UGGAGCCC 
GGGGUGGA G CCCUCACG 
CGCUCAGG G CCUACUCA 
CUGUGCCA G CAGCUCCU 
UGCCAGCA G CUCCUCCU 
ACCAAUCG G CAGUCAGG 
AAUCGGCA G UCAGGAAG 
UCAGGAAG G CAGCCUAC 
GGAAGGCA G CCUACUCC 
AUCCUCAG G CCAUGCAG 
ACUCCACC A CUUUCCAC 
CACUUUCC A CCAAACUC 
UCCACCAA A CUCUUCAA 
UCUUCAAG A UCCCAGAG 
GGCCCUGU A CUUUCCUG 
GUUCAGGA A CAGUGASC 
UGCUCAGA A UACUGUCU 
CUCAGAAU A CUGUCUCU 
UCUCUGCC A UAUCGUCA 
UCUGCCAU A UCGUCAAU 



TTCTGCGA GGCTAGCTACAACGA GCGGCGAT 
AAGGAATA GGCTAGCTACAACGA TAACATTG 
GTTTCCCA GGCTAGCTACAACGA CTTATGTG 
GAATAAAG GGCTAGCTACAACGA CCCGTAAA 
GCAAGGTA GGCTAGCTACAACGA CGTAGAAG 
GGAGTTTG GGCTAGCTACAACGA CATTTAGG 
ACAAATTG GGCTAGCTACAACGA TTACATCT 
GTAAGGGG GGCTAGCTACAACGA CCCACAAA 
TTCATTTA GGCTAGCTACAACGA TGTAAGGG 
GGATAAAA GGCTAGCTACAACGA CTAGCAGG 
GGATAATA GGCTAGCTACAACGA GGTTTGAT 
ACTACATA GGCTAGCTACAACGA TCTGGATA 
ATGATTAA GGCTAGCTACAACGA TACATACT 
GATCCCCG GGCTAGCTACAACGA CTTCCAAA 
CGTGTGGA GGCTAGCTACAACGA TCTCTTTT 
AGGCGCTA GGCTAGCTACAACGA GTGTGGAC 
ATGAGGCG GGCTAGCTACAACGA TACGTGTG 
TATGGTGA GGCTAGCTACAACGA CCGCAAAA 
CTCCCATG GGCTAGCTACAACGA TGTAGATC 
AAGACCAA GGCTAGCTACAACGA CTCCCATG 
TTGGAAGA GGCTAGCTACAACGA CAACCTCC 
TCCCCATG GGCTAGCTACAACGA CTTTTCGA 
GGGTCCAA GGCTAGCTACAACGA TGATGATC 
TGAGTTGG GGCTAGCTACAACGA TTTGAATG 
TGGATTTA GGCTAGCTACAACGA TGAGTTGG 
GCGTCCGG GGCTAGCTACAACGA CAGTTGTC 
CACTCCCA GGCTAGCTACAACGA CTTGTTGG 
TGCTCCCA GGCTAGCTACAACGA TCCCACCT 
CCCGAATG GGCTAGCTACAACGA TCCCACTC 
AACCCTGG GGCTAGCTACAACGA CCGAATGC 
GGGGTGAA GGCTAGCTACAACGA CCTGGCCC 
~~ GGGCTCCA GGCTAGCTACAACGA CCCAACAG 
CGTGAGGG GGCTAGCTACAACGA TCCACCCC 
TGAGTAGG GGCTAGCTACAACGA CCTGAGCG 
AGGAGCTG GGCTAGCTACAACGA TGGCACAG 
AGGAGGAG GGCTAGCTACAACGA TGCTGGCA 
CCTGACTG GGCTAGCTACAACGA CGATTGGT 
CTTCCTGA GGCTAGCTACAACGA TGCCGATT 
GTAGGCTG GGCTAGCTACAACGA CTTCCTGA 
GGAGTAGG GGCTAGCTACAACGA TGCCTTCC 
CTGCATGG GGCTAGCTACAACGA CTGAGGAT 
GTGGAAAG GGCTAGCTACAACGA GGTGGAGT 
GAGTTTGG GGCTAGCTACAACGA GGAAAGTG 
TTGAAGAG GGCTAGCTACAACGA TTGGTGGA 
CTCTGGGA GGCTAGCTACAACGA CTTGAAGA 
CAGGAAAG GGCTAGCTACAACGA ACAGGGCC 
GCTCACTG GGCTAGCTACAACGA TCCTGAAC 
AGACAGTA GGCTAGCTACAACGA TCTGAGCA 
AGAGACAG GGCTAGCTACAACGA ATTCTGAG 
TGACGATA GGCTAGCTACAACGA GGCAGAGA 
ATTGACGA GGCTAGCTACAACGA ATGGCAGA 
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UAUCGUCA A UCUUAUCG 
UCAAUCUU A UCGAAGAC 
UAUCGAAG A CUGGGGAC 
GACUGGGG A CCCUGUAC 
GACCCUGU A CCGAACAU 
UGUACCGA A CAUGGAGA 
UACCGAAC A UGGAGAAC 
CAUGGAGA A CAUCGCAU 
UGGAGAAC A UCGCAUCA 
AACAUCGC A UCAGGACU 
GCAUCAGG A CUCCUAGG 
CUCCUAGG A CCCCUGCU 
CUCGUGUU A CAGGCGGG 
UCUUGUUG A CAAAAAUC 
UGACAAAA A UCCUCACA 
AAAUCCUC A CAAUACCA 
UCCUCACA A UACCACAG 
CUCACAAU A CCACAGAG 
ACAAUACC A CAGAGUCU 
GAGUCUAG A CUCGUGGU 
CGUGGUGG A CUUCUCUC 
UUCUCUCA A UUUUCUAG 
UAGGGGGA A CACCCGUG 
GGGGGAAC A CCCGUGUG 
UGGCCAAA A UUCGCAGU 
AGUCCCAA A UCUCCAGU 
CUCCAGUC A CUCACCAA 
AGUCACUC A CCAACCUG 
ACUCACCA A CCUGUUGU 
GUCCUCCA A UUUGUCCU 



UCCUGGUU A 
AUCGCUGG A UGUGUCUG 
GGCGUUUU A UCAUCUUC 
GUUUUAUC A UCUUCCUC 
UCCUCUGC A UCCUGCUG 
CUGCUGCU A UGCCUCAU 
UAUGCCUC A UCUUCUUG 
UCUUCUGG A CUAUCAAG 



AUCAAGGU A UGUUGCCC 
GUCCUCUA A UUCCAGGA 
AUUCCAGG A UCAUCAAC 
CCAGGAUC A UCAACAAC 
GAUCAUCA A CAACCAGC 
CAUCAACA A CCAGCACC 
AGCACCGG A CCAUGCAA 
ACCGGACC A UGCAAAAC 
CAUGCAAA A CCUGCACA 
AAACCUGC A CAACUCCU 
CCUGCACA A CUCCUGCU 
CUCAAGGA A CCUCUAUG 



CGATAAGA GGCTAGCTACAACGA TGACGATA 
GTCTTCGA GGCTAGCTACAACGA AAGATTGA 
GTCCCCAG GGCTAGCTACAACGA CTTCGATA 
GTACAGGG GGCTAGCTACAACGA CCCCAGTC 
ATGTTCGG GGCTAGCTACAACGA ACAGGGTC 
TCTCCATG GGCTAGCTACAACGA TCGGTACA 
GTTCTCCA GGCTAGCTACAACGA GTTCGGTA 
ATGCGATG GGCTAGCTACAACGA TCTCCATG 
TGATGCGA GGCTAGCTACAACGA GTTCTCCA 
AGTCCTGA GGCTAGCTACAACGA GCGATGTT 
CCTAGGAG GGCTAGCTACAACGA CCTGATGC 
AGCAGGGG GGCTAGCTACAACGA CCTAGGAG 
CCCGCCTG GGCTAGCTACAACGA AACACGAG 
GATTTTTG GGCTAGCTACAACGA CAACAAGA 
TGTGAGGA GGCTAGCTACAACGA TTTTGTCA 
TGGTATTG GGCTAGCTACAACGA GAGGATTT 
CTGTGGTA GGCTAGCTACAACGA TGTGAGGA 
CTCTGTGG GGCTAGCTACAACGA ATTGTGAG 
AGACTCTG GGCTAGCTACAACGA GGTATTGT 
ACCACGAG GGCTAGCTACAACGA CTAGACTC 
GAGAGAAG GGCTAGCTACAACGA CCACCACG 
CTAGAAAA GGCTAGCTACAACGA TGAGAGAA 
CACGGGTG GGCTAGCTACAACGA TCCCCCTA 
CACACGGG GGCTAGCTACAACGA GTTCCCCC 
ACTGCGAA GGCTAGCTACAACGA TTTGGCCA 
ACTGGAGA GGCTAGCTACAACGA TTGGGACT 
TTGGTGAG GGCTAGCTACAACGA GACTGGAG 
CAGGTTGG GGCTAGCTACAACGA GAGTGACT 
ACAACAGG GGCTAGCTACAACGA TGGTGAGT 
AGGACAAA GGCTAGCTACAACGA TGGAGGAC 
TCCAGCGA GGCTAGCTACAACGA AACCAGGA 
CAGACACA GGCTAGCTACAACGA CCAGCGAT 
GAAGATGA GGCTAGCTACAACGA AAAACGCC 
GAGGAAGA GGCTAGCTACAACGA GATAAAAC 
CAGCAGGA GGCTAGCTACAACGA GCAGAGGA 
ATGAGGCA GGCTAGCTACAACGA AGCAGCAG 
CAAGAAGA GGCTAGCTACAACGA GAGGCATA 
CTTGATAG GGCTAGCTACAACGA CCAGAAGA 
TACCTTGA GGCTAGCTACAACGA AGTCCAGA 
GGGCAACA GGCTAGCTACAACGA ACCTTGAT 
TCCTGGAA GGCTAGCTACAACGA TAGAGGAC 
GTTGATGA GGCTAGCTACAACGA CCTGGAAT 
GTTGTTGA GGCTAGCTACAACGA GATCCTGG 
GCTGGTTG GGCTAGCTACAACGA TGATGATC 
GGTGCTGG GGCTAGCTACAACGA TGTTGATG 
TTGCATGG GGCTAGCTACAACGA CCGGTGCT 
GTTTTGCA GGCTAGCTACAACGA GGTCCGGT 
TGTGCAGG GGCTAGCTACAACGA TTTGCATG 
AGGAGTTG GGCTAGCTACAACGA GCAGGTTT 
AGCAGGAG GGCTAGCTACAACGA TGTGCAGG 
CATAGAGG GGCTAGCTACAACGA TCCTTGAG 
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GAACCUCU A UGUUUCCC 



GGGAAACA GGCTAGCTACAACGA AGAGGTTC 



UUUCCCUC A UGUUGCUG 



CAGCAACA GGCTAGCTACAACGA GAGGGAAA 



GUUGCUGU A CAAAACCU 



AGGTTTTG GGCTAGCTACAACGA ACAGCAAC 



UGUACAAA A CCUACGGA 



TCCGTAGG GGCTAGCTACAACGA TTTGTACA 



CAAAACCU A CGGACGGA 



TCCGTCCG GGCTAGCTACAACGA AGGTTTTG 



ACCUACGG A CGGAAACU 



AGTTTCCG GGCTAGCTACAACGA CCGTAGGT 



GGACGGAA A CUGCACCU 



AGGTGCAG GGCTAGCTACAACGA TTCCGTCC 



GAAACUGC A CCUGUAUU 



AATACAGG GGCTAGCTACAACGA GCAGTTTC 



GCACCUGU A UUCCCAUC 



GATGGGAA GGCTAGCTACAACGA ACAGGTGC 



GUAUUCCC A UCCCAUCA 



TGATGGGA GGCTAGCTACAACGA GGGAATAC 



CCCAUCCC A UCAUCUUG 



CAAGATGA GGCTAGCTACAACGA GGGATGGG 



AUCCCAUC A UCUUGGGC 



GCCCAAGA GGCTAGCTACAACGA GATGGGAT 



UUCGCAAA A UACCUAUG 



CATAGGTA GGCTAGCTACAACGA TTTGCGAA 



CGCAAAAU A CCUAUGGG 



CCCATAGG GGCTAGCTACAACGA ATTTTGCG 



AAAUACCU A UGGGAGUG 



CACTCCCA GGCTAGCTACAACGA AGGTATTT 



CUCAGUUU A CUAGUGCC 



GGCACTAG GGCTAGCTACAACGA AAACTGAG 



CUAGUGCC A UUUGUUCA 



TGAACAAA GGCTAGCTACAACGA GGCACTAG 



CCAGACAG GGCTAGCTACAACGA GGGGGAAA 



CATCCATA GGCTAGCTACAACGA AACTGAAA 



UCAGUUAU A UGGAUGAU 



ATCATCCA GGCTAGCTACAACGA ATAACTGA 



UUAUAUGG A UGAUGUGG 



CCACATCA GGCTAGCTACAACGA CCATATAA 



UAUGGAUG A UGUGGUUU 



AAACCACA GGCTAGCTACAACGA CATCCATA 



AAGUCUGU A CAACAUCU 



AGATGTTG GGCTAGCTACAACGA ACAGACTT 



UCUGUACA A CAUCUUGA 



TCAAGATG GGCTAGCTACAACGA TGTACAGA 



UGUACAAC A UCUUGAGU 



ACTCAAGA GGCTAGCTACAACGA GTTGTACA 



GUCCCUUU A UGCCGCUG 



CAGCGGCA GGCTAGCTACAACGA AAAGGGAC 



CCGCUGUU A CCAAUUUU 



AAAATTGG GGCTAGCTACAACGA AACAGCGG 



UGUUACCA A UUUUCUUU 



AAAGAAAA GGCTAGCTACAACGA TGGTAACA 



CUUUGGGU A UACAUUUA 



TAAATGTA GGCTAGCTACAACGA ACCCAAAG 



UUGGGUAU A CAUUUAAA 



TTTAAATG GGCTAGCTACAACGA ATACCCAA 



GGGUAUAC A UUUAAACC 



GGTTTAAA GGCTAGCTACAACGA GTATACCC 



ACAUUUAA A CCCUCACA 



TGTGAGGG GGCTAGCTACAACGA TTAAATGT 



AAACCCUC A CAAAACAA 



TTGTTTTG GGCTAGCTACAACGA GAGGGTTT 



CUCACAAA A CAAAAAGA 



TCTTTTTG GGCTAGCTACAACGA TTTGTGAG 



ACAAAAAG A UGGGGAUA 



TATCCCCA GGCTAGCTACAACGA CTTTTTGT 



AGAUGGGG A UAUUCCCU 



AGGGAATA GGCTAGCTACAACGA CCCCATCT 



AUGGGGAU A UUCCCUUA 



TAAGGGAA GGCTAGCTACAACGA ATCCCCAT 



UUCCCUUA A CUUCAUGG 



CCATGAAG GGCTAGCTACAACGA TAAGGGAA 



UUAACUUC A UGGGAUAU 



ATATCCCA GGCTAGCTACAACGA GAAGTTAA 



UUCAUGGG A UAUGUAAU 



ATTACATA GGCTAGCTACAACGA CCCATGAA 



CAUGGGAU A UGUAAUUG 



CAATTACA GGCTAGCTACAACGA ATCCCATG 



GAUAUGUA A UUGGGAGU 



ACTCCCAA GGCTAGCTACAACGA TACATATC 



GUUGGGGC A CAUUGCCA 



TGGCAATG GGCTAGCTACAACGA GCCCCAAC 



UGGGGCAC A UUGCCACA 



TGTGGCAA GGCTAGCTACAACGA GTGCCCCA 



ACAUUGCC A CAGGAACA 



TGTTCCTG GGCTAGCTACAACGA GGCAATGT 



CCACAGGA A CAUAUUGU 



ACAATATG GGCTAGCTACAACGA TCCTGTGG 



ACAGGAAC A UAUUGUAC 



GTACAATA GGCTAGCTACAACGA GTTCCTGT 



AGGAACAU A UUGUACAA 



TTGTACAA GGCTAGCTACAACGA ATGTTCCT 



CAUAUUGU A CAAAAAAU 



ATTTTTTG GGCTAGCTACAACGA ACAATATG 



UACAAAAA A UCAAAAUG 



CATTTTGA GGCTAGCTACAACGA TTTTTGTA 



AAAUCAAA A UGUGUUUU 



AAAACACA GGCTAGCTACAACGA TTTGATTT 
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UUUAGGAA A CUUCCUGU 



ACAGGAAG GGCTAGCTACAACGA TTCCTAAA 



UCCUGUAA A CAGGCCUA 



TAGGCCTG GGCTAGCTACAACGA TTACAGGA 



ACAGGCCU A UUGAUUGG 



CCAATCAA GGCTAGCTACAACGA AGGCCTGT 



GCCUAUUG A UUGGAAAG 



CTTTCCAA GGCTAGCTACAACGA CAATAGGC 



UGGAAAGU A UGUCAACG 



CGTTGACA GGCTAGCTACAACGA ACTTTCCA 



GUAUGUCA A CGAAUUGU 



ACAATTCG GGCTAGCTACAACGA TGACATAC 



999 



GUCAACGA A UUGUGGGU 



ACCCACAA GGCTAGCTACAACGA TCGTTGAC 



CCCCUUUC A CGCAAUGU 



ACATTGCG GGCTAGCTACAACGA GAAAGGGG 



UUCACGCA A UGUGGAUA 



TATCCACA GGCTAGCTACAACGA TGCGTGAA 



CAAUGUGG A UAUUCUGC 



GCAGAATA GGCTAGCTACAACGA CCACATTG 



AUGUGGAU A UUCUGCUU 



AAGCAGAA GGCTAGCTACAACGA ATCCACAT 



CUGCUUUA A UGCCUUUA 



TAAAGGCA GGCTAGCTACAACGA TAAAGCAG 



AUGCCUUU A UAUGCAUG 



CATGCATA GGCTAGCTACAACGA AAAGGCAT 



GCCUUUAU A UGCAUGCA 



TGCATGCA GGCTAGCTACAACGA ATAAAGGC 



UUAUAUGC A UGCAUACA 



TGTATGCA GGCTAGCTACAACGA GCATATAA 



AUGCAUGC A UACAAGCA 



TGCTTGTA GGCTAGCTACAACGA GCATGCAT 



GCAUGCAU A CAAGCAAA 



TTTGCTTG GGCTAGCTACAACGA ATGCATGC 



CAAGCAAA A CAGGCUUU 



AAAGCCTG GGCTAGCTACAACGA TTTGCTTG 



AGGCUUUU A CUUUCUCG 



CGAGAAAG GGCTAGCTACAACGA AAAAGCCT 



UCUCGCCA A CUUACAAG 



CTTGTAAG GGCTAGCTACAACGA TGGCGAGA 



GCCAACUU A CAAGGCCU 



AGGCCTTG GGCTAGCTACAACGA AAGTTGGC 



CUAAGUAA A CAGUAUGU 



ACATACTG GGCTAGCTACAACGA TTACTTAG 



UAAACAGU A UGUGAACC 



GGTTCACA GGCTAGCTACAACGA ACTGTTTA 



GUAUGUGA A CCUUUACC 



GGTAAAGG GGCTAGCTACAACGA TCACATAC 



GAACCUUU A CCCCGUUG 



CAACGGGG GGCTAGCTACAACGA AAAGGTTC 



GCUCGGCA A CGGCCUGG 



CCAGGCCG GGCTAGCTACAACGA TGCCGAGC 



CCUGGUCU A UGCCAAGU 



ACTTGGCA GGCTAGCTACAACGA AGACCAGG 



1192 



GUUUGCUG A CGCAACCC 



GGGTTGCG GGCTAGCTACAACGA CAGCAAAC 



CUGACGCA A CCCCCACU 



AGTGGGGG GGCTAGCTACAACGA TGCGTCAG 



CAACCCCC A CUGGUUGG 



CCAACCAG GGCTAGCTACAACGA GGGGGTTG 



GCUUGGCC A UAGGCCAU 



ATGGCCTA GGCTAGCTACAACGA GGCCAAGC 



CAUAGGCC A UCAGCGCA 



TGCGCTGA GGCTAGCTACAACGA GGCCTATG 



AUCAGCGC A UGCGUGGA 



TCCACGCA GGCTAGCTACAACGA GCGCTGAT 



UGCGUGGA A CCUUUGUG 



CACAAAGG GGCTAGCTACAACGA TCCACGCA 



CUCUGCCG A UCCAUACC 



GGTATGGA GGCTAGCTACAACGA CGGCAGAG 



GCCGAUCC A UACCGCGG 



CCGCGGTA GGCTAGCTACAACGA GGATCGGC 



CGAUCCAU A CCGCGGAA 



TTCCGCGG GGCTAGCTACAACGA ATGGATCG 



ACCGCGGA A CUCCUAGC 



GCTAGGAG GGCTAGCTACAACGA TCCGCGGT 



GGGGCAAA A CUCAUCGG 



CCGATGAG GGCTAGCTACAACGA TTTGCCCC 



CAAAACUC A UCGGGACU 



AGTCCCGA GGCTAGCTACAACGA GAGTTTTG 



UCAUCGGG A CUGACAAU 



ATTGTCAG GGCTAGCTACAACGA CCCGATGA 



CGGGACUG A CAAUUCUG 



CAGAATTG GGCTAGCTACAACGA CAGTCCCG 



GACUGACA A UUCUGUCG 



CGACAGAA GGCTAGCTACAACGA TGTCAGTC 



UCCCGCAA A UAUACAUC 



GATGTATA GGCTAGCTACAACGA TTGCGGGA 



CCGCAAAU A UACAUCAU 



ATGATGTA GGCTAGCTACAACGA ATTTGCGG 



GCAAAUAU A CAUCAUUU 



AAATGATG GGCTAGCTACAACGA ATATTTGC 



AAAUAUAC A UCAUUUCC 



GGAAATGA GGCTAGCTACAACGA GTATATTT 



UAUACAUC A UUUCCAUG 



CATGGAAA GGCTAGCTACAACGA GATGTATA 



UCAUUUCC A UGGCUGCU 



AGCAGCCA GGCTAGCTACAACGA GGAAATGA 



UGCUGCCA A CUGGAUCC 



GGATCCAG GGCTAGCTACAACGA TGGCAGCA 



CCAACUGG A UCCUACGC 



GCGTAGGA GGCTAGCTACAACGA CCAGTTGG 
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UGGAUCCU A CGCGGGAC 



GTCCCGCG GGCTAGCTACAACGA AGGATCCA 



UACGCGGG A CGUCCUUU 



AAAGGACG GGCTAGCTACAACGA CCCGCGTA 



CUUUGUUU A CGUCCCGU 



ACGGGACG GGCTAGCTACAACGA AAACAAAG 



GGCGCUGA A UCCCGCGG 



CCGCGGGA GGCTAGCTACAACGA TCAGCGCC 



UCCCGCGG A CGACCCCU 



AGGGGTCG GGCTAGCTACAACGA CCGCGGGA 



CGCGGACG A CCCCUCCC 



GGGAGGGG GGCTAGCTACAACGA CGTCCGCG 



GGGGCUCU A CCGCCCGC 



GCGGGCGG GGCTAGCTACAACGA AGAGCCCC 



CUCCGCCU A UUGUACCG 



CGGTACAA GGCTAGCTACAACGA AGGCGGAG 



CCUAUUGU A CCGACCGU 



ACGGTCGG GGCTAGCTACAACGA ACAATAGG 



UUGUACCG A CCGUCCAC 



GTGGACGG GGCTAGCTACAACGA CGGTACAA 



GACCGUCC A CGGGGCGC 



GCGCCCCG GGCTAGCTACAACGA GGACGGTC 



CGGGGCGC A CCUCUCUU 



AAGAGAGG GGCTAGCTACAACGA GCGCCCCG 



CUCUCUUU A CGCGGACU 



AGTCCGCG GGCTAGCTACAACGA AAAGAGAG 



UUACGCGG A CUCCCCGU 



ACGGGGAG GGCTAGCTACAACGA CCGCGTAA 



GCCUUCUC A UCUGCCGG 



CCGGCAGA GGCTAGCTACAACGA GAGAAGGC 



UCUGCCGG A CCGUGUGC 



GCACACGG GGCTAGCTACAACGA CCGGCAGA 



CCGUGUGC A CUUCGCUU 



AAGCGAAG GGCTAGCTACAACGA GCACACGG 



UUCGCUUC A CCUCUGCA 



TGCAGAGG GGCTAGCTACAACGA GAAGCGAA 



ACCUCUGC A CGUCGCAU 



ATGCGACG GGCTAGCTACAACGA GCAGAGGT 



CACGUCGC A UGGAGACC 



GGTCTCCA GGCTAGCTACAACGA GCGACGTG 



GCAUGGAG A CCACCGUG 



CACGGTGG GGCTAGCTACAACGA CTCCATGC 



GTTCACGG GGCTAGCTACAACGA GGTCTCCA 



CACCGUGA A CGCCCACA 



TGTGGGCG GGCTAGCTACAACGA TCACGGTG 



CCACAGGA A CCUGCCCA 



TGGGCAGG GGCTAGCTACAACGA TCCTGTGG 



GGUCUUGC A UAAGAGGA 



TCCTCTTA GGCTAGCTACAACGA GCAAGACC 



AUAAGAGG A CUCUUGGA 



TCCAAGAG GGCTAGCTACAACGA CCTCTTAT 



ACUCUUGG A CUUUCAGC 



GCTGAAAG GGCTAGCTACAACGA CCAAGAGT 



UUUCAGCA A UGUCAACG 



CGTTGACA GGCTAGCTACAACGA TGCTGAAA 



CAAUGUCA A CGACCGAC 



GTCGGTCG GGCTAGCTACAACGA TGACATTG 



UGUCAACG A CCGACCUU 



AAGGTCGG GGCTAGCTACAACGA CGTTGACA 



AACGACCG A CCUUGAGG 



CCTCAAGG GGCTAGCTACAACGA CGGTCGTT 



CUUGAGGC A UACUUCAA 



TTGAAGTA GGCTAGCTACAACGA GCCTCAAG 



UGAGGCAU A CUUCAAAG 



CTTTGAAG GGCTAGCTACAACGA ATGCCTCA 



CUUCAAAG A CUGUGUGU 



ACACACAG GGCTAGCTACAACGA CTTTGAAG 



UGUGUUUA A UGAGUGGG 



CCCACTCA GGCTAGCTACAACGA TAAACACA 



GUCUUUGU A CUAGGAGG 



CCTCCTAG GGCTAGCTACAACGA ACAAAGAC 



CUGUAGGC A UAAAUUGG 



CCAATTTA GGCTAGCTACAACGA GCCTACAG 



AGGCAUAA A UUGGUGUG 



CACACCAA GGCTAGCTACAACGA TTATGCCT 



GUGUGUUC A CCAGCACC 



GGTGCTGG GGCTAGCTACAACGA GAACACAC 



UCACCAGC A CCAUGCAA 



TTGCATGG GGCTAGCTACAACGA GCTGGTGA 



CCAGCACC A UGCAACUU 



AAGTTGCA GGCTAGCTACAACGA GGTGCTGG 



ACCAUGCA A CUUUUUCA 



TGAAAAAG GGCTAGCTACAACGA TGCATGGT 



ACUUUUUC A CCUCUGCC 



GGCAGAGG GGCTAGCTACAACGA GAAAAAGT 



UCUGCCUA A UCAUCUCA 



TGAGATGA GGCTAGCTACAACGA TAGGCAGA 



GCCUAAUC A UCUCAUGU 



ACATGAGA GGCTAGCTACAACGA GATTAGGC 



AUCAUCUC A UGUUCAUG 



CATGAACA GGCTAGCTACAACGA GAGATGAT 



UCAUGUUC A UGUCCUAC 



GTAGGACA GGCTAGCTACAACGA GAACATGA 



CAUGUCCU A CUGUUCAA 



TTGAACAG GGCTAGCTACAACGA AGGACATG 



UUUGGGGC A UGGACAUU 



AATGTCCA GGCTAGCTACAACGA GCCCCAAA 



GGGCAUGG A CAUUGACC 



GGTCAATG GGCTAGCTACAACGA CCATGCCC 



GCAUGGAC A UUGACCCG 



CGGGTCAA GGCTAGCTACAACGA GTCCATGC 
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GGACAUUG A CCCGUAUA 
UGACCCGU A UAAAGAAU 
UAUAAAGA A UUUGGAGC 
GUGGAGUU A CUCUCUUU 
GCCUUCUG A CUUCUUUC 
UUCCUUCU A UUCGAGAU 
UAUUCGAG A UCUCCUCG~ 
UCUCCUCG A CACCGCCU 
UCCUCGAC A CCGCCUCU 
UGCUCUGU A UCGGGGGG 
UCUCCGGA A CAUUGUUC 
UCCGGAAC A UUGUUCAC 
CAUUGUUC A CCUCACCA 
UUCACCUC A CCAUACGG~ 
ACCUCACC A UACGGCAC 
CUCACCAU A CGGCACUC 
CAUACGGC A CUCAGGCA 
GGCAAGCU A UUCUGUGU 



TATACGGG GGCTAGCTACAACGA CAATGTCC 
ATTCTTTA GGCTAGCTACAACGA ACGGGTCA 
GCTCCAAA GGCTAGCTACAACGA TCTTTATA 
AAAGAGAG GGCTAGCTACAACGA AACTCCAC 
GAAAGAAG GGCTAGCTACAACGA CAGAAGGC 
ATCTCGAA GGCTAGCTACAACGA AGAAGGAA 
CGAGGAGA GGCTAGCTACAACGA CTCGAATA 
AGGCGGTG GGCTAGCTACAACGA CGAGGAGA 
AGAGGCGG GGCTAGCTACAACGA GTCGAGGA 
CCCCCCGA GGCTAGCTACAACGA ACAGAGCA 
GAACAATG GGCTAGCTACAACGA TCCGGAGA 
GTGAACAA GGCTAGCTACAACGA GTTCCGGA 
TGGTGAGG GGCTAGCTACAACGA GAACAATG 
CCGTATGG GGCTAGCTACAACGA GAGGTGAA 
GTGCCGTA GGCTAGCTACAACGA GGTGAGGT 
GAGTGCCG GGCTAGCTACAACGA ATGGTGAG 
TGCCTGAG GGCTAGCTACAACGA GCCGTATG 
GGCTAGCTACAACGA AGCTTGCC 



GUGAGUUG A UGAAUCUA 



TAGATTCA GGCTAGCTACAACGA CAACTCAC 



GUUGAUGA A UCUAGCCA 



TGGCTAGA GGCTAGCTACAACGA TCATCAAC 



AUCUAGCC A CCUGGGUG 



CACCCAGG GGCTAGCTACAACGA GGCTAGAT 



GGGAAGUA A UUUGGAAG 



CTTCCAAA GGCTAGCTACAACGA TACTTCCC 



UUUGGAAG A UCCAGCAU 



ATGCTGGA GGCTAGCTACAACGA CTTCCAAA 



GAUCCAGC A UCCAGGGA 



TCCCTGGA GGCTAGCTACAACGA GCTGGATC 



UCCAGGGA A UUAGUAGU 



ACTACTAA GGCTAGCTACAACGA TCCCTGGA 



AGUCAGCU A UGUCAACG 



CGTTGACA GGCTAGCTACAACGA AGCTGACT 



CUAUGUCA A CGUUAAUA 



TATTAACG GGCTAGCTACAACGA TGACATAG 



CAACGUUA A UAUGGGCC 



GGCCCATA GGCTAGCTACAACGA TAACGTTG 



ACGUUAAU A UGGGCCUA 



TAGGCCCA GGCTAGCTACAACGA ATTAACGT 



GCCUAAAA A UCAGACAA 



TTGTCTGA GGCTAGCTACAACGA TTTTAGGC 



AAAAUCAG A 
AUCAGACA A CUAUUGUG 
AGACAACU A UUGUGGUU 
GUGGUUUC A CAUUUCCU 
GGUUUCAC A UUUCCUGU 
CCUGUCUU A CUUUUGGG 
GGCGAGAA A CUGUUCUU 
GUUCUUGA A UAUUUGGU 
UCUUGAAU A UUUGGUGU 
GAGUGUGG A UUCGCACU 
GGAUUCGC A CUCCUCCU 
CCUCCUGC A UAUAGACC 
UCCUGCAU A UAGACCAC 
GCAUAUAG A CCACCAAA 
UAUAGACC A CCAAAUGC 
ACCACCAA A UGCCCCUA 
AUGCCCCU A UCUUAUCA 
CCUAUCUU A UCAACACU 
UCUUAUCA A CACUUCCG 
UUAUCAAC A CUUCCGGA 
UUCCGGAA A CUACUGUU 



AATAGTTG GGCTAGCTACAACGA CTGATTTT 
CACAATAG GGCTAGCTACAACGA TGTCTGAT 
AACCACAA GGCTAGCTACAACGA AGTTGTCT 
AGGAAATG GGCTAGCTACAACGA GAAACCAC 
ACAGGAAA GGCTAGCTACAACGA GTGAAACC 
CCCAAAAG GGCTAGCTACAACGA AAGACAGG 
AAGAACAG GGCTAGCTACAACGA TTCTCGCC 
ACCAAATA GGCTAGCTACAACGA TCAAGAAC 
ACACCAAA GGCTAGCTACAACGA ATTCAAGA 
AGTGCGAA GGCTAGCTACAACGA CCACACTC 
AGGAGGAG GGCTAGCTACAACGA GCGAATCC 
GGTCTATA GGCTAGCTACAACGA GCAGGAGG 
GTGGTCTA GGCTAGCTACAACGA ATGCAGGA 
TTTGGTGG GGCTAGCTACAACGA CTATATGC 
GCATTTGG GGCTAGCTACAACGA GGTCTATA 
TAGGGGCA GGCTAGCTACAACGA TTGGTGGT 
TGATAAGA GGCTAGCTACAACGA AGGGGCAT 
AGTGTTGA GGCTAGCTACAACGA AAGATAGG 
CGGAAGTG GGCTAGCTACAACGA TGATA AGA 
TCCGGAAG GGCTAGCTACAACGA GTTGATAA 
AACAGTAG GGCTAGCTACAACGA TTCCGGAA 
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234: 

2352 

2380 

2397 

2409 

2427 

2433 

2442 

2448 

2456 

2465 

2467 

2469 

2481 

2486 

2496 

2504 

2509 ~ 

2520 

2527 



CGGAAACU A CUGUUGUU 
GUUGUUAG A CGAAGAGG 
GAAGAAGA A CUCCCUCG 
CCUCGCAG A CGAAGGUC 
AGGUCUCA A UCGCCGCG 
CGCAGAAG A UCUCAAUC 
AGAUCUCA A UCUCGGGA 
UCUCGGGA A UCUCAAUG 
GAAUCUCA A UGUUAGUA 
AUGUUAGU A UUCCUUGG 
UUCCUUGG A CACAUAAG 
CCUUGGAC A CAUAAGGU 
UUGGACAC A UAAGGUGG 
GGUGGGAA A CUUUACGG 
GAAACUUU A CGGGGCUU 
GGGGCUUU A UUCUUCUA 
AUUCUUCU A CGGUACCU 
UCUACGGU A 



AACAACAG GGCTAGCTACAACGA AGTTTCCG 
CCTCTTCG GGCTAGCTACAACGA CTAACAAC 
CGAGGGAG GGCTAGCTACAACGA TCTTCTTC 
GACCTTCG GGCTAGCTACAACGA CTGCGAGG 
CGCGGCGA GGCTAGCTACAACGA TGAGACCT 
GATTGAGA GGCTAGCTACAACGA CTTCTGCG 
TCCCGAGA GGCTAGCTACAACGA TGAGATCT 
CATT3AGA GGCTAGCTACAACGA TCCCGAGA 
TACTAACA GGCTAGCTACAACGA TGAGATTC 
CCAAGGAA GGCTAGCTACAACGA ACTAACAT 
CTTATGTG GGCTAGCTACAACGA CCAAGGAA 
ACCTTATG GGCTAGCTACAACGA GTCCAAGG 
CCACCTTA GGCTAGCTACAACGA GTGTCCAA 
CCGTAAAG GGCTAGCTACAACGA TTCCCACC 
AAGCCCCG GGCTAGCTACAACGA AAAGTTTC 
TAGAAGAA GGCTAGCTACAACGA AAAGCCCC 
AGGTACCG GGCTAGCTACAACGA AGAAGAAT 
AAGCAAGG GGCTAGCTACAACGA ACCGTAGA 



UUGCUUUA A UCCUAAAU 



ATTTAGGA GGCTAGCTACAACGA TAAAGCAA 



AAUCCUAA A UGGCAAAC 



GTTTGCCA GGCTAGCTACAACGA TTAGGATT 



2534 



AAUGGCAA A CUCCUUCU 



AGAAGGAG GGCTAGCTACAACGA TTGCCATT 



2550 
2552 
2556 



UUUUCCUG A CAUUCAUU 



AATGAATG GGCTAGCTACAACGA CAGGAAAA 



UUCCUGAC A UUCAUUUG 



CAAATGAA GGCTAGCTACAACGA GTCAGGAA 



UGACAUUC A UUUGCAGG 



CCTGCAAA GGCTAGCTACAACGA GAATGTCA 



GCAGGAGG A CAUUGUUG 



CAACAATG GGCTAGCTACAACGA CCTCCTGC 



2570 
2577 
2581 
2590 
2606 
2613 
2619 
2626 
2632 
2635 



AGGAGGAC A UUGUUGAU 



ATCAACAA GGCTAGCTACAACGA GTCCTCCT 



CAUUGUUG A UAGAUGUA 



TACATCTA GGCTAGCTACAACGA CAACAATG 



GUUGAUAG A UGUAAGCA 



TGCTTACA GGCTAGCTACAACGA CTATCAAC 



UGUAAGCA A UUUGUGGG 



CCCACAAA GGCTAGCTACAACGA TGCTTACA 



GGCCCCUU A CAGUAAAU 



ATTTACTG GGCTAGCTACAACGA AAGGGGCC 



UACAGUAA A UGAAAACA 



TGTTTTCA GGCTAGCTACAACGA TTACTGTA 



AAAUGAAA A CAGGAGAC 



GTCTCCTG GGCTAGCTACAACGA TTTCATTT 



AACAGGAG A CUUAAAUU 



AATTTAAG GGCTAGCTACAACGA CTCCTGTT 



AGACUUAA A UUAACUAU 



ATAGTTAA GGCTAGCTACAACGA TTAAGTCT 



UUAAAUUA A CUAUGCCU 



AGGCATAG GGCTAGCTACAACGA TAATTTAA 



AAUUAACU A UGCCUGCU 



AGCAGGCA GGCTAGCTACAACGA AGTTAATT 



2655 
2661 
2666 
2671 
2673 
2685 
2693 
2698 
2703 
2706 
2715 
2724 
2727 
2730 
2738 



UAGGUUUU A UCCCAAUG 



CATTGGGA GGCTAGCTACAACGA AAAACCTA 



UUAUCCCA A UGUUACUA 
CCAAUGUU A CUAAAUAU 
GUUACUAA A UAUUUGCC 
UACUAAAU A UUUGCCCU 
GCCCUUAG A UAAAGGGA~ 
AUAAAGGG A UCAAACCG 
GGGAUCAA A CCGUAUUA 
CAAACCGU A UUAUCCAG 
ACCGUAUU A UCCAGAGU 
UCCAGAGU A UGUAGUUA 
UGUAGUUA A UCAUUACU 
AGUUAAUC A UUACUUCC 
UAAUCAUU A CUUCCAGA 
ACUUCCAG A CGCGACAU 



TAGTAACA GGCTAGCTACAACGA 
AT ATT TAG GGCTAGCTACAACGA AACATTGG 
GGCAAATA GGCTAGCTACAACGA TTAGTAAC 
AGGGCAAA GGCTAGCTACAACGA ATTTAGTA 
TCCCTTTA GGCTAGCTACAACGA CTAAGGGC 
CGGTTTGA GGCTAGCTACAACGA CCCTTTAT 
TAATACGG GGCTAGCTACAACGA TTGATCCC 
CTGGATAA GGCTAGCTACAACGA ACGGTTTG 
ACTCTGGA GGCTAGCTACAACGA AATACGGT 
TAACTACA GGCTAGCTACAACGA ACTCTGGA 
AGTAATGA GGCTAGCTACAACGA TAACTACA 
GGAAGTAA GGCTAGCTAC AACGA GATTAACT 
TCTGGAAG GGCTAGCTACAACGA AATGATTA 
ATGTCGCG GGCTAGCTACAACGA CTGGAAGT 
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CAGACGCG A CAUUAUUU 



AAATAATG GGCTAGCTACAACGA CGCGTCTG 



GACGCGAC A UUAUUUAC 



GTAAATAA GGCTAGCTACAACGA GTCGCGTC 



GCGACAUU A UUUACACA 



TGTGTAAA GGCTAGCTACAACGA AATGTCGC 



CAUUAUUU A CACACUCU 



AGAGTGTG GGCTAGCTACAACGA AAATAATG 



UUAUUUAC A CACUCUUU 



AAAGAGTG GGCTAGCTACAACGA GTAAATAA 



AUUUACAC A CUCUUUGG 



CCAAAGAG GGCTAGCTACAACGA GTGTAAAT 



277* 



AGGCGGGG A UCUUAUAU 



ATATAAGA GGCTAGCTACAACGA CCCCGCCT 



GGGAUCUU A UAUAAAAG 



CTTTTATA GGCTAGCTACAACGA AAGATCCC 



GAUCUUAU A UAAAAGAG 



CTCTTTTA GGCTAGCTACAACGA ATAAGATC 



GAGAGUCC A CACGUAGC 



GCTACGTG GGCTAGCTACAACGA GGACTCTC 



GAGUCCAC A CGUAGCGC 



GCGCTACG GGCTAGCTACAACGA GTGGACTC 



AGCGCCUC A UUUUGCGG 



CCGCAAAA GGCTAGCTACAACGA GAGGCGCT 



UGCGGGUC A CCAUAUUC 



GAATATGG GGCTAGCTACAACGA GACCCGCA 



GGGUCACC A UAUUCUUG 



CAAGAATA GGCTAGCTACAACGA GGTGACCC 



GUCACCAU A UUCUUGGG 



CCCAAGAA GGCTAGCTACAACGA ATGGTGAC 



UCUUGGGA A CAAGAUCU 



AGATCTTG GGCTAGCTACAACGA TCCCAAGA 



GGAACAAG A UCUACAGC 



GCTGTAGA GGCTAGCTACAACGA CTTGTTCC 



CAAGAUCU A CAGCAUGG 



CCATGCTG GGCTAGCTACAACGA AGATCTTG 



UCUACAGC A UGGGAGGU 



ACCTCCCA GGCTAGCTACAACGA GCTGTAGA 



287C 



UCUUCCAA A CCUCGAAA 



TTTCGAGG GGCTAGCTACAACGA TTGGAAGA 



GAAAAGGC A UGGGGACA 



TGTCCCCA GGCTAGCTACAACGA GCCTTTTC 



GCAUGGGG A CAAAUCUU 



AAGATTTG GGCTAGCTACAACGA CCCCATGC 



GGGGACAA A UCUUUCUG 



CAGAAAGA GGCTAGCTACAACGA TTGTCCCC 



UGUCCCCA A UCCCCUGG 



CCAGGGGA GGCTAGCTACAACGA TGGGGACA 



CCCCUGGG A UUCUUCCC 



GGGAAGAA GGCTAGCTACAACGA CCCAGGGG 



CUUCCCCG A UCAUCAGU 



ACTGATGA GGCTAGCTACAACGA CGGGGAAG 



CCCCGAUC A UCAGUUGG 



CCAACTGA GGCTAGCTACAACGA GATCGGGG 



UCAGUUGG A CCCUGCAU 



ATGCAGGG GGCTAGCTACAACGA CCAACTGA 



GACCCUGC A UUCAAAGC 



GCTTTGAA GGCTAGCTACAACGA GCAGGGTC 



CAAAGCCA A CUCAGUAA 



TTACTGAG GGCTAGCTACAACGA TGGCTTTG 



CUCAGUAA A UCCAGAUU 



AATCTGGA GGCTAGCTACAACGA TTACTGAG 



AAAUCCAG A UUGGGACC 



GGTCCCAA GGCTAGCTACAACGA CTGGATTT 



AGAUUGGG A CCUCAACC 



GGTTGAGG GGCTAGCTACAACGA CCCAATCT 



GGACCUCA A CCCGCACA 



TGTGCGGG GGCTAGCTACAACGA TGAGGTCC 



GCACAAGG A CAACUGGC 



GCCAGTTG GGCTAGCTACAACGA CCTTGTGC 



CAAGGACA A CUGGCCGG 



CCGGCCAG GGCTAGCTACAACGA TGTCCTTG 



CUGGCCGG A CGCCAACA 



TGTTGGCG GGCTAGCTACAACGA CCGGCCAG 



GGACGCCA A CAAGGUGG 



CCACCTTG GGCTAGCTACAACGA TGGCGTCC 



GUGGGAGC A UUCGGGCC 



GGCCCGAA GGCTAGCTACAACGA GCTCCCAC 



CAGGGUUC A CCCCUCCC 



GGGAGGGG GGCTAGCTACAACGA GAACCCTG 



CCCUCCCC A UGGGGGAC 



GTCCCCCA GGCTAGCTACAACGA GGGGAGGG 



CAUGGGGG A CUGUUGGG 



CCCAACAG GGCTAGCTACAACGA CCCCCATG 



GAGCCCUC A CGCUCAGG 



CCTGAGCG GGCTAGCTACAACGA GAGGGCTC 



CAGGGCCU A CUCACAAC 



GTTGTGAG GGCTAGCTACAACGA AGGCCCTG 



GCCUACUC A CAACUGUG 



CACAGTTG GGCTAGCTACAACGA GAGTAGGC 



UACUCACA A CUGUGCCA 



TGGCACAG GGCTAGCTACAACGA TGTGAGTA 



CUGCCUCC A CCAAUCGG 



CCGATTGG GGCTAGCTACAACGA GGAGGCAG 



CUCCACCA A UCGGCAGU 



ACTGCCGA GGCTAGCTACAACGA TGGTGGAG 



GGCAGCCU A CUCCCUUA 



TAAGGGAG GGCTAGCTACAACGA AGGCTGCC 



ACUCCCUU A UCUCCACC 



GGTGGAGA GGCTAGCTACAACGA AAGGGAGT 



UUAUCUCC A CCUCUAAG 



CTTAGAGG GGCTAGCTACAACGA GGAGATAA 
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3190 I UCUAAGGG A CACUCAUC 2192 I GATGAGTG GGCTAGCTACAACGA CCCTTAGA I 5215 

3192 UAAGGGAC A CUCAUCCU 2193 AGGATGAG GGCTAGCTACAACGA GTCCCTTA 5216 

~3196 GGACACUC A UCCUCAGG 2194 CCTGAGGA GGCTAGCTACAACGA GAGTGTCC 5217 

3207 CUCAGGCC A UGCAGUGG 2195 CCACTGCA GGCTAGCTACAACGA GGCCTGAG 5218 



Input Sequence = AF100308. Cut Site = YG/M or UG/U. 
Stem Length = 8 . Core Sequence = GGCTAGCTACAACGA 
AF100308 (Hepatitis B virus strain 2-18, 3215 bp) 
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Table 44 

Table 44: Group Designation and Dosage levels for HBV transgenic mouse study 



Group 


Compound 


Dose 


Number of Mice 


Duration of 
Treatment 


1 


RPI. 18341 

(site 273) 


100 mg/kg/day* 


10F 


14 days 


2 


RPI.18371 

(site 1833) 


100 mg/kg/day* 


10F 


14 days 


3 


RPI.18418 

(site 1873) 


100 mg/kg/day* 


10F 


14 days 


4 


RPI.18372 
(site 1874) 


100 mg/kg/day* 


10F 


14 days 


5 


Saline control 


100 mg/kg/day* 


10F 


14 days 


6 


Untreated 




10F 


0 days 



*administered via sc infusion using Alzet mini-osmotic pumps 
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Table 45 

TABLE 45. NUCLEOSIDES USED FOR CHEMICAL SYNTHESIS OF MODIFIED NUCLEOTIDE 
TRIPHOSPHATES 





NUCLEOSIDES 


Abbreviation 


CHEMICAL STRUCTURE 


1 


2'-0-methyl-2,6- 
diaminopurine riboside 


2'-0-Me-DAP 


NH 2 

HO OCH 3 


2 


2'-deoxy-2'amino-2,6- 
diaminopurine riboside 


2'-NH 2 -DAP 


NH 2 

HO NH 2 


3 


2'-(N-alanyl)amino-2'- 
deoxy-uridine 


ala-2'- NHj U 


1 1 

H0 V. ^N^O 
HO HN v ^c" CH 3 

Y "nh 2 
o 


4 


2'-(N- 

phenylalanyl)arninp-2'- 
deoxy-uridine 


phe-2'- NH 2 -U 


HO. ^•N'^O 

to, 

HO HN^ J3-CH 2 Ph 
0 


5 


2'-(N-p-alanyl) amino- 
2'-deoxy uridine 


2'-p-Ala-NHi-U 


o 

ll 

HO HN^ 
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6 


2'-Deoxy-2'-(lysiyl) 
amino uridine 


2'-L-lys-NH2-U 


0 

HO HN^ 

NH 2 


7 


2'-C-allyl uridine 


2'-C-allyl-U 


HO ^N-^O 
HO 


8 


2'-0-amino-uridine 


2'-0-NH 2 -U 


0 

HO 0^ 

NH 2 


9 


2'-0-methy ! th i omethy 1 
adenosine 


2'-0-MTM-A 


NH 2 

HO O 

S 
\ 


10 


cytidine 


2'-0-MTM-C 


HO ^ 
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11 


2'-0-methylthiomethyl 
guanosine 


2'-0-MTM-G 


0 

HO i 

s> 


12 


2'-0-methylthiomethyl- 
uridine 


2'-0-MTM-U 


HO O^ 

\ 


13 


2'-(N-histidyl) amino 
uridine 


2'-his-NH 2 -U 


0 

HO HN^ 

NH 2 ^ 


14 


2'-Deoxy-2'-amino-5- 
methyl cytidine 


5-Me-2'-NH2-C 


NH 2 
HO NH 2 
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15 


2'-(N-p-carboxamid i ne- 

P-alanyl)amino-2'- 

deoxy-uridine 


p-ala-CA-NH2-U 


O 

\ f H-> H KI1J 

HO HN , C y ^ 

V H 2 NH 
O 


16 


2'-(N-p-alanyl) 
guanosine 


p-Ala-NH 2 -G 


0 

HO 

o 


17 


2'-0-Amino-Uridine 


2'-0-NH 2 -U 


w 

HO 0. 

NH 2 


18 


2'-(N-lysyl)amino -2'- 
deoxy-cytidine 


2'- NH 2 -lys-C 
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19 


2'-Deoxy -2'-(L- 
histidine) amino 
Cytidine 


2'- Nhh-his-C 


ljlH2 

HO HN^/ 


20 


5-lmidazoleacetic acid 
2'-deoxy uridine 


5-IAA-U 


HO 


21 


5-{3-(N-4- 

imidazoleacetyl)amino 
propynyl]-2'-0-methyl 
uridine 


5-IAA- 
propynylamino- 
2'-OMe U 


f? 

HO OCH 3 


22 


5-(3-aminopropynyl)- 
2 -O-methyl uridine 


5-aminopropynyl- 
2'-OMe U 


HO OCH 3 


23 


5-( 3-a m i nop ropy 1 )-2 '- 
O-methyl uridine 


5-aminopropyl- 
2'-OMe U 


HO OCH3 


24 


5-[3-(N-4- 

imidazoleacetyl)amino 

propyl]-2'-0-methyl 

Uridine 


5-IAA- 
propylamino-2 1 - 
OMeU 


^W^ s y- {CH2)3NHCOCh ' 2 ]T\ 

Ha ysV V 

HO 0CH 3 



WO 01/16312 



603 



PCT/US00/23998 



Table 45 



25 


5-(3-aminopropyl)-2'- 
deoxy-2-fluoro uridine 


5-aminopropyl- 
2'-F dU 


Av^-(CH 2 ) 3 NH 2 

„rf 


26 


2'-Deoxy-2'-(p-alanyl- 

L-histidyl)amino 

Uridine 


2'-amino-p-ALA- 
HlSdU 


o 

I .H 
HO HN^ 

^ P 


27 


2'-deoxy-2'-p- 
alaninamido-uridine 


2'-p-ALA dU 




28 


3-(2'-deoxy-2'-fluoro-|3- 
D- 

ri bof uranosy 1) pi perazi n 
o[2,3-D]pyrimidine-2- 
one 


2'-F piperazino- 
pyrimidinone 


HO F 


29 


5-[3-(N-4- 

imidazoleacetyDamino 
propyl]-2'-deoxy-2'- 
fluoro Uridine 


5-IAA- 
propylamino-2'-F 
dU 


^Xjp-(CH 2 )3NHCOCH 2 
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Table 45 



30 


5-[3-(N-4- 

imidazoleacetyDamino 
p ropyny l]-2 '-deoxy-2'- 
fluoro uridine 


5-IAA- 

propynylamino- 
2'-FdU 




31 


5-E-[2-carboxyvinyl-2'- 
deoxy-2'-fluoro uridine 


5-carboxyvinyl- 
2'-F dU 


HOP 


32 


5-[3-(IM-4- 

aspartyDaminopropyny 
l-2'-fluoro uridine 


5-ASP- 
aminopropyl-2'-F- 
dU 


H |T^ >S ^^ NHCO CHNH 2 CH 2 COO H 


33 


5-(3-aminopropyl)-2'- 
deoxy-2-fluoro cytidine 


5-aminopropyl- 
2'-F dC 


H 


34 


5-[3-(N-4- 

succynyl)aminopropv-l- 
2'-d eoxy-2-fluoro 
cytidine 


5-succynylamino- 
propyl-2'-F dC 
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Table 46: PHOSPHORYLATION OF URIDINE IN THE PRESENCE OF DMAP 



0 equiv. DMAP 


0.2 equiv. DMAP 


0.5 equiv. DMAP 


1.0 equiv. DMAP 


Time 
(min) 


Product 

% 


Time 
(min) 


Product 

% 


Time 
(min) 


Product 

% 


Time 
(min) 


Product 

% 


0 


1 


0 


0 


0 


0 


0 


0 


40 












30 




80 


10 


50 


24 


60 


46 


70 


77 


120 


12 


90 


33 


100 


57 


110 


84 


160 


14 


130 


39 


140 


63 


150 


83 


200 


17 


170 


43 


180 


63 


190 


84 


240 


19 


210 


47 


220 


64 


230 


77 


320 


20 


250 


48 


260 


68 


270 


79 


1130 


48 


290 


49 


300 


64 


310 


77 


1200 


46 


1140 


68 


1150 


76 


1160 


72 






1210 


69 


1220 


76 


1230 


74 
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p 
~a. 
E 
a> 
H 


LO LO LO LO LO LO 
CM. CM CM CM CM CM 




i-^ i^. r^. r^. 

CO CO CO CO CO CO 


O — 
LU o* 
Q. — 


1 Tt Tf 




1 Tj- Tf Tf 


LiCI 
(mM) 








METHANOL 

(%) 


O O , O , O 


t 
: 


o o , o , o 


Triton 
X-100 (%) 


0.01 
0.01 

0.002 
0.002 
0.002 
0.002 




0.01 
0.01 
0.002 
0.002 
0.002 
0.002 


Spermidine 
(mM) 




s 


lO IT) X- T~ T- T- 


DTT 

(mM) 




i 

'- 
-- 


^ m m m m 


of 
f£ 


O O CM CM CM CM 
CM CM i- t- t- -r- 




O O CM CM CM CM 
CM CM i- t- t- t- 


TRIS-HCL 
(mM) 


q q -<- t- ■<- 

00 00 00 00 CO CO 

1 X X I I I 
aa a. a. a. a. 

o o o o o o 
t t t ^ t 




40 (pH 8.0) 
40 (pH 8.0) 
40 (pH 8.1) 
40 (pH 8.1) 
40 (pH 8.1) 
40 (pH 8.1) 


Condition 
No. 


t- CM CO LO CD 




r- oo ex. o £ £ 
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8 




o 


































































































aminopropyl UTP j 


2'-F-C5-propylamine CTP | 


a. 
o 

0) 

S 

6 

CM 


2'-0-Me UTP i 


2'-0-Me 5-3-aminopropyl 
UTP 


2'-0-Me 5-3-aminopropyl 
UTP 
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Table 49: INCORPORATION OF MODIFIED NUCLEOTIDE TRIPHOSPHATES 
USING WILD TYPE BACTERIOPHAGE T7 POLYMERASE 



Modification 


label 


% ribo control 


2'-NH2-GTP 


ATP 


4% 


2'-dGTP 


ATP 


3% 


2'-0-Me-GTP 


ATP 


3% 


2'-F-GTP 


ATP 


4% 


2'-0-MTM-CTP 


ATP 


3% 


2'-NH2-UTP 


ATP 


39% 


2'-dTTP 


ATP 


5% 


2'-0-Me-UTP 


ATP 


3% 


ala-2'-NH 2 -UTP 


ATP 


2% 


phe-2'-NH 2 - UTP 


ATP 


1% 


2'-p-ala-NH 2 -UTP 


ATP 


3% 


2'-C-allyl-UTP 


ATP 


2% 


2'-0-NH 2 -UTP 


ATP 


1% 


2'-0-MTM-UTP 


ATP 


64% 


2'-NH 2 -ATP 


GTP 


1% 


2'-0-MTM-ATP 


GTP 


1% 


2'-NH 2 -CTP 


GTP 


59% 


2'-dCTP 


GTP 


40% 


2'-F-CTP 


GTP 


1 00% 


2'-F-UTP 


GTP 


100% 


2'-F-TTP 


GTP 


0% 


2'-F-C5-carboxyvinyl UTP 


GTP 


100% 


2'-F-C5-aspartyl-aminopropyl 
UTP 


GTP 


100% 


2'-F-C5-propylamine CTP 


GTP 


100% 


2'-0-Me CTP 


GTP 


0% 


2' O-Me UTP 


GTP 


0% 


2' O-Me 5-3-aminopropyl 
UTP 


GTP 


0% 


2'-0-Me 5-3-aminopropyl 
UTP 


GTP 


0% 
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Table 50 

Table 50 a: Incorporation of 2'-his-UTP and Modified CTP's 



modification 


2 , -his-UTP 


rUTP 


CTP 


16.1 


100 


2'-amino-CTP 


9.5* 


232.7 


2'-deoxy-CTP 


9.6* 


130.1 


2'-OMe-CTP 


1.9 


6.2 


2--MTM-CTP 


5.9 


5.1 








control 


1.2 





Table 50 b: Incorporation of 2'-his-UTP, 2-amino CTP, and Modified ATP's 



modification 


2'-his-UTP and 
2'-amino-CTP 


rUTP and rCTP 


ATP 


15.7 


100 


2-amino-ATP 


2.4 


28.9 


2'-deoxy-ATP 


2.3 


146.3 


2'-OMe-ATP 


2.7 


15 


2'-F-ATP 


4 


222.6 


2--MTM-ATP 


4.7 


15.3 


2'-OMe-DAP 


1.9 


5.7 


2 , -amino-DAP 


8.9* 


9.6 



Numbers shown are a percentage of incorporation compared to the all-RNA 
control 

* -Bold number indicates best observed rate of modified nucleotide triphosphate 
incorporation 
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Table 51: INCORPORATION OF 2'-his-UTP, 2'-NH 2 -CTP, 2'-NH 2 -DAP, and rGTP 
USING VARIOUS REACTION CONDITIONS 



Conditions 


compared to all 
rNTP 


7 


8.7* 


8 


7* 


9 


2.3 


10 


2.7 


11 


1.6 


12 


2.5 



Numbers shown are a percentage of incorporation compared to the all-RNA 
control 

* Two highest levels of incorporation contained both methanol and LiCI 
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Table 52 



Table 52: Selection of Oligonucleotides with Ribozyme Activity 



pool 


Generation 


time 


substrate 
rem3ining(%) 


time 


reiri3inincj (%) 


N60 


o 


4 hr 


100.00 


24 hr 


100 98 


N60 


14 


4 hr 


99 67 


24 hr 


97 51 


N60 


1 5 


4 hr 


98.76 


24 hr 


96 76 


N60 


16 


4 hr 


97 09 


24 hr 


96 60 


N60 


1 7 


4 hr 


79 50 


24 hr 


64 01 


N40 


o 


4 hr 


99 89 


24 hr 


99 78 


N40 


1 0 


4 hr 


99 74 


24 hr 


99 42 


N40 


1 1 


4 hr 


97 18 


24 hr 


90 38 


N40 












N40 


— ri — 


— 4~rlr — 


54 28 


Ulir 


36 46 


N20 


o 


4 hr 


99 18 


24 hr 


1 00 00 


N20 


1 1 


4 hr 


100.00 


24 hr 


100.00 


N20 


12 


4 hr 


99.51 


24 hr 


100.00 


N20 


13 


4hr 


90.63 


24 hr 


84.89 


N20 


14 


4 hr 


91.16 


24 hr 


85.92 


N60B 


0 


4 hr 


100.00 


24 hr 


100.00 


N60B 


1 


4hr 


100.00 


24 hr 


100.00 


N60B 


2 


4 hr 


100.00 


24 hr 


100.00 


N60B 


3 


4 hr 


100.00 


24 hr 


100.00 


N60B 


4 


4 hr 


99.24 


24 hr 


100.00 


N60B 


5 


4 hr 


97.81 


24 hr 


96.65 


N60B 


6 


4 hr 


89.95 


24 hr 


77.14 I 
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Table 53 



Table 53: Kinetic Activity of Combinatorial Libraries 



Pool 


Generation 


K 0 bs (mm ; 


N60 


17 


0.0372 




18 


0.0953 




19 


0.0827 




N40 


12 


0.0474 




13 


0.037 




14 


0.065 




15 


0.0254 


N20 


13 


0.0359 




14 


0.0597 




15 


0.0549 




16 


0.0477 


N60B 


6 


0.0209 




7 


0.0715 




8 


0.0379 
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Table 54 



Table 54: Kinetic Activity of Clones within N60 and N40 Combinatorial Libraries 



1 G18 


... 


3ctivity(min ) 


■Vel ' 




— 'nTo^ — 




1 00 ' 




N60 


u.uooa 


17 21 




r|i? 


u.uuuouy 


0 27 ' 




Riln 


U.UUUOf o 


[rfi 






U.UU1 U4 




0-8 


mSS 




u.uuu/ oy 


rPv^ 

T^cj 


0-1 1 








0-1 2 


MKn 


\J.V\J£.Zh 


TTqq 


0-1 3 


Rd! — 




— TTlij — 


I 0-14 


N60 


0.000878 


0.39 


j 0-15 


N60 


0.0000686 


0.03 


0-21 


N60 


0.0109 


4.82 


0-22 


N60 


0.000835 


0.37 


i 0-24 


N60 


0.000658 


0.29 


0-28 


N40 


0.000741 


0.33 


! 0-35 


N40 


0.00658 


2.91 


3-1 


N40 


0.0264 


11.68 


3-3 


N40 


0.000451 


0.20 


3-7 


N40 


0.000854 


0.38 


j 3-15 


N40 


0.000832 


0.37 
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Table 55 

Table 55: Effect of Magnesium Concentration of the Cleavage Rate of N20 





kobsfmin" 1 ) 


25 


0.0259 


20 


0.0223 


15 


0.0182 


I 10 


0.0208 


5 


0.0121 


I 2 


0.00319 


2 


0.00226 



WO 01/16312 



616 



PCT7US00/23998 



)8Si 



III 



88 



8 8 E 



| 
B 



3 B 6 B B 



8BBB 



JBBB 



? 5 1 S 1 



j B B B B B e 



B B B B B B 



BBBBBBBBBBBBB 



BBBBBBB 



BB 



BB 



BB 
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Table 57 

Table 57. Additional Class II enzymatic nucleic acid Motifs 



Class II 
Motif ID 


Sequence 


Seq ID No. 


Kinetic 
Rate 


A2 


GGGAGGAGGAAGUGCCUGGUCAGUCACACCGAGACUGGCAGACGCUGAAACC 
GCCGCGCUCGCUCCCAGUCC 


77 


UNK 


A12 


GGGAGGAGGAAGUGCCUGGUAGUAAUAUAAUCGUUACUACGAGUGCAAGGUC 
GCCGCGCUCGCUCCCAGUCC 






All 


GGGAGGAGGAAGUGCCUGGUAGUUGCCCGAACUGUGACUACGAGUGAGGUC 

:c 


79 


UNK 


B14 


GGGAGGAGGAAGUGCCUGGCGAUCAGAUGAGAUGAUGGCAC-ACGCAGAGACC 
GCCGCGCUCGCUCCCAGUCC 


80 


UNK 


BIO 


GGGAGGAGGAAGUGCCUGGCGACUGAUACGAAAAGUCGCAGUUUCGAAACC 
GCCGCGCUCGCUCCCAGUCC 


81 


UNK 


B21 


GGGAGGAGGAAGUGCCUGGCGACUGAUACGAAAAGUCGCAC-GUUUCGAAACC 


82 


UNK 


B7 


GGGAGGAGGAAGUGCCUUGGCUCAGCAUAAGUGAGCAGAUUGCGACACC 
GCCGCGCUCGCUCCCAGUCC 


83 


UNK 


C8 


GGGAGGAGGAAGUGCCUUGGUCAUUAGGAUGACAAACGUAUACUGAACACU 
GCCGCGCUCGCUCCCAGUCC 


84 


0.01 
MIN" 1 
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aaaaaaaaa? 


iSHaaaaaaaaHsassasss 


isiliiiii 


iiiiiiiiimimtiiii 


erbB2-1199 Zin.Rz-6 amino stabl 


erbB2-12 92 Zin.Rz-6 amino stabl 
erbB2-1292 zin.Rz-7 amino stabl 
erbB2-l313 Zin.Rz-7 amino stabl 

erbB2-1536 Zin.Rz-6 amino stabl 

erbB2-1757 Zin.Rz-6 amino stabl 
erbB2-1757 Zin.Rz-7 amino stabl 

erbB2-1784 Zin.Rz-6 amino stabl 
erbB2-i784 Zin.Rz-7 amino stabl 
erbB2-2063 Zin,Rz-6 amino stabl 


s s s ^ 




I!!! 


isilff 




T| | 3 55 5 ? gF| g ? | 5 5 5 ? 5 I I 


11111111111111 nil nil iti mill 
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!!!!! 



!! 



Hi 



!!'!! 



I!!!! 
si si 

mi; 



iiiii 



i iliiiiiiiiiiiiiiili 
liiilSjlSiilijiijjl 



mi 



ml 



1 3 



I 3 



8 1 1 EE 1 5 1 1 i 1 1 i 



1 1 1 i I I 5 E 1 1 1 1 1 i I 5 1 



sill 
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a as s s s i s s ss s 


1 1 1 1|| |i i|i | 


i i II 1 1 1 1 i 1 1 s 


s iiiiiiiiiiiii 
* i i i iiiii 

lllllllllllli 


iliililiiil! 

iJiiililiiil 

iiilliimil 






i iilliiil 


1 | 1 1 1 | | | j | 1 1 J 

Iii[SiiliSS| HI 


1 1 1 1 i 3 1 1 1 1 1 1 1 


| ; ; |U | 1 1 1 I S ||| 


II 111 
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Table 59 

Table 59: Human HER2 Class II (zinzyme) Ribozyme and Target Sequece 



Pos 


Substrate 


Seq ID 


Ribozyme 


RzSeq 
ID 


46 


GGGCAGCC G CGCGCCCC 


310 


GGGGCGCG GCCGAAAGGCGAGUCAAGGUCU GGCUGCCC 


895 


48 


GCAGCCGC G CGCCCCUU 


311 


AAGGGGCG GCCGAAAGGCGAGUCAAGGUCU GCGGCUGC 


896 


50 


AGCCGCGC G CCCCUUCC 


312 


GGAAGGGG GCCGAAAGGCGAGUCAAGGUCU GCGCGGCU 


897 


75 


CCUUUACU G CGCCGCGC 


313 


GCGCGGCG GCCGAAAGGCGAGUCAAGGUCU AGUAAAGG 


898 


77 


UUUACUGC G CCGCGCGC 


314 


GCGCGCGG GCCGAAAGGCGAGUCAAGGUCU GCAGUAAA 


899 


80 


ACUGCGCC G CGCGCCCG 


315 


CGGGCGCG GCCGAAAGGCGAGUCAAGGUCU GGCGCAGU 


900 


82 


UGCGCCGC G CGCCCGGC 


316 


GCCGGGCG GCCGAAAGGCGAGUCAAGGUCU GCGGCGCA 


901 


84 


CGCCGCGC G CCCGGCCC 


317 


GGGCCGGG GCCGAAAGGCGAGUCAAGGUCU GCGCGGCG 


902 


102 


CACCCCUC G CAGCACCC 


318 


GGGUGCUG GCCGAAAGGCGAGUCAAGGUCU GAGGGGUG 


903 


112 


AGCACCCC G CGCCCCGC 


319 


GCGGGGCG GCCGAAAGGCGAGUCAAGGUCU GGGGUGCU 


904 


114 


CACCCCGC G CCCCGCGC 


320 


GCGCGGGG GCCGAAAGGCGAGUCAAGGUCU GCGGGGUG 


905 


119 


CGCGCCCC G CGCCCUCC 


321 


GGAGGGCG GCCGAAAGGCGAGUCAAGGUCU GGGGCGCG 


906 


121 


CGCCCCGC G CCCUCCCA 


322 


UGGGAGGG GCCGAAAGGCGAGUCAAGGUCU GCGGGGCG 


907 


163 


CCGGAGCC G CAGUGAGC 


323 


GCUCACUG GCCGAAAGGCGAGUCAAGGUCU GGCUCCGG 


908 


194 


GGCCUUGU G CCGCUGGG 


324 


CCCAGCGG GCCGAAAGGCGAGUCAAGGUCU ACAAGGCC 


909 


197 


CUUGOGCC G CUGGGGGC 


325 


GCCCCCAG GCCGAAAGGCGAGUCAAGGUCU GGCACAAG 


910 


214 


UCCUCCUC G CCCUCUUG 


326 


CAAGAGGG GCCGAAAGGCGAGUCAAGGUCU GAGGAGGA 


911 


222 


GCCCUCUU G CCCCCCGG 


327 


CCGGGGGG GCCGAAAGGCGAGUCAAGGUCU AAGAGGGC 


912 


235 


CCGGAGCC G CGAGCACC 


328 


GGUGCUCG GCCGAAAGGCGAGUCAAGGUCU GGCUCCGG 


913 


251 


CCAAGUGU G CACCGGCA 


329 


UGCCGGUG GCCGAAAGGCGAGUCAAGGUCU ACACUUGG 


914 


273 


AUGAAGCU G CGGCUCCC 


330 


GGGAGCCG GCCGAAAGGCGAGUCAAGGUCU AGCUUCAU 


915 


283 


GGCUCCCU G CCAGUCCC 


331 


GGGACUGG GCCGAAAGGCGAGUCAAGGUCU AGGGAGCC 


916 


309 


CUGGACAU G CUCCGCCA 


332 


UGGCGGAG GCCGAAAGGCGAGUCAAGGUCU AUGUCCAG 


917 


314 


CAUGCUCC G CCACCUCU 


333 


AGAGGUGG GCCGAAAGGCGAGUCAAGGUCU GGAGCAUG 


918 


332 


CCAGGGCU G CCAGGUGG 


334 


CCACCUGG GCCGAAAGGCGAGUCAAGGUCU AGCCCUGG 


919 


342 


CAGGUGGU G CAGGGAAA 


335 


UUUCCCUG GCCGAAAGGCGAGUCAAGGUCU ACCACCUG 


920 


369 


ACCUACCU G CCCACCAA 


336 




921 


379 


CCACCAAU G CCAGCCUG 


337 


CAGGCUGG GCCGAAAGGCGAGUCAAGGUCU AUUGGUGG 


922 


396 


UCCUUCCU G CAGGAUAU 


338 


AUAUCCUG GCCGAAAGGCGAGUCAAGGUCU AGGAAGGA 


923 


414 


CAGGAGGU G CAGGGCUA 


339 


UAGCCCUG GCCGAAAGGCGAGUCAAGGUCU ACCUCCUG 


924 


426 


GGCUACGU G CUCAUCGC 


340 


GCGAUGAG GCCGAAAGGCGAGUCAAGGUCU ACGUAGCC 


925 


433 


UGCUCAUC G CUCACAAC 


341 


GUUGUGAG GCCGAAAGGCGAGUCAAGGUCU GAUGAGCA 


926 


462 


GUCCCACU G CAGAGGCU 


342 


AGCCUCUG GCCGAAAGGCGAGUCAAGGUCU AGUGGGAC 




471 


CAGAGGCU G CGGAUUGU 


343 


ACAAUCCG GCCGAAAGGCGAGUCAAGGUCU AGCCUCUG 


928 




CGGAUUGU G CGAGGCAC 


344 


GUGCCUCG GCCGAAAGGCGAGUCAAGGUCU ACAAUCCG 


929 


511 


ACAACUAU G CCCUGGCC 


345 


GGCCAGGG GCCGAAAGGCGAGUCAAGGUCU AUAGUUGU 


930 


522 


CUGGCCGU G CUAGACAA 


346 


UUGUCUAG GCCGAAAGGCGAGUCAAGGUCU ACGGCCAG 


931 


54 0 


GGAGACCC G CUGAACAA 


347 


UUGUUCAG GCCGAAAGGCGAGUCAAGGUCU GGGUCUCC 


932 


585 


GGAGGCCU G CGGGAGCU 


348 


AGCUCCCG GCCGAAAGGCGAGUCAAGGUCU AGGCCUCC 


933 


594 


CGGGAGCU G CAGCUUCG 


349 


CGAAGCUG GCCGAAAGGCGAGUCAAGGUCU AGCUCCCG 


934 


659 


CCAGCUCU G CUACCAGG 


350 


CCUGGUAG GCCGAAAGGCGAGUCAAGGUCU AGAGCUGG 


935 


73 7 


CACCAACC G CUCUCGGG 


351 


CCCGAGAG GCCGAAAGGCGAGUCAAGGUCU GGUUGGUG 


936 


74 9 


UCGGGCCU G CCACCCCU 


352 


AGGGGUGG GCCGAAAGGCGAGUCAAGGUCU AGGCCCGA 
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1274 
1305 



1482 
1492 
1500 
1509 
1539 
1541 
1598 
1605 



GGGCUCCC G CUGCUGGG 

CUCCCGCU G CUGGGGAG 

AGCCUGAC G CGCACUGU 

CCUGACGC G CACUGUCU 

CUGUCUGU G CCGGUGGC 

GUGGCUGU G CCCGCUGC 

CUGUGCCC G CUGCAAGG 

UGCCCGCU G CAAGGGGC 

GGGCCACU G CCCACUGA 

CACUGACU G CUGCCAUG 

UGACUGCU G CCAUGAGC 

AGCAGUGU G CUGCCGGC 

AGUGUGCU G CCGGCUGC 

UGCCGGCU G~CACC-GGCC 

CUCUGACU G CCUGGCCU 

CCUGGCCU G CCUCCACU 

UGUGAGCU G CACUGCCC 

GCUGCACU G CCCAGCCC 

GAGUCCAU G CCCAAUCC 

CAUUCGGC G CCAGCUGU 

GUGUGACU G CCUGUCCC 

GGGAUCCU G CACCCUCG 

CCUCGUCU G CCCCCUGC 

UGCCCCCU G CACAACCA 

UGAGAAGU G CAGCAAGC 

AGCCCUGU G CCCGAGUG 

CCGAGUGU G CUAUGGUC 

GAGCACUU G CGAGAGGU 

UUACCAGU G CCAAUAUC 

AGGAGUUU G CUGGCUGC 

UGCUGGCU G CAAGAAGA 

GCAUUUCU G CCGGAGAG 

CCAACACU G CCCCGCUC 

ACUGCCCC G CUCCAGCC 

GACAGCCU G CCUGACCU 

CAGAACCU G CAAGUAAU 

CGAAUUCU G CACAAUGG 



CCCAGCAG GCCGAAAGGCGAGUCAAGGUCU GGGAGCCC 
CUCCCCAG GCCGAAAGGCGAGUCAAGGUCU AGCGGGAG 
ACAGUGCG GCCGAAAGGCGAGUCAAGGUCU GUCAGGCU 
AGACAGUG GCCGAAAGGCGAGUCAAGGUCU GCGUCAGG 
GCCACCGG GCCGAAAGGCGAGUCAAGGUCU ACAGACAG 
GCAGCGGG GCCGAAAGGCGAGUCAAGGUCU ACAGCCAC 
CCUUGCAG GCCGAAAGGCGAGUCAAGGUCU GGGCACAG 
GCCCCUUG GCCGAAAGGCGAGUCAAGGUCU AGCGGGCA 
UCAGUGGG GCCGAAAGGCGAGUCAAGGUCU AGUGGCCC 
CAUGGCAG GCCGAAAGGCGAGUCAAGGUCU AGUCAGUG 
GCUCAUGG GCCGAAAGGCGAGUCAAGGUCU AGCAGUCA 
GCCGGCAG GCCGAAAGGCGAGUCAAGGUCU ACACUGCU 
GCAGCCGG GCCGAAAGGCGAGUCAAGGUCU AGCACACU 
GGCCCGUG GCCGAAAGGCGAGUCAAGGUCU AGCCGGCA 
AGGCCAGG GCCGAAAGGCGAGUCAAGGUCU AGUCAGAG 
AGUGGAGG GCCGAAAGGCGAGUCAAGGUCU AGGCCAGG 
GGGCAGUG GCCGAAAGGCGAGUCAAGGUCU AGCUCACA 
GGGCUGGG GCCGAAAGGCGAGUCAAGGUCU AGUGCAGC 
GGAUUGGG GCCGAAAGGCGAGUCAAGGUCU AUGGACUC 
ACAGCUGG GCCGAAAGGCGAGUCAAGGUCU GCCGAAUG 
GGGACAGG GCCGAAAGGCGAGUCAAGGUCU AGUCACAC 
CGAGGGUG GCCGAAAGGCGAGUCAAGGUCU AGGAUCCC 
GCAGGGGG GCCGAAAGGCGAGUCAAGGUCU AGACGAGG 
UGGUUGUG GCCGAAAGGCGAGUCAAGGUCU AGGGGGCA 
GCUUGCUG GCCGAAAGGCGAGUCAAGGUCU ACUUCUCA 
CACUCGGG GCCGAAAGGCGAGUCAAGGUCU ACAGGGCU 
GACCAUAG GCCGAAAGGCGAGUCAAGGUCU ACACUCGG 
ACCUCUCG GCCGAAAGGCGAGUCAAGGUCU AAGU3CUC 
GAUAUUGG GCCGAAAGGCGAGUCAAGGUCU ACUG3UAA 
GCAGCCAG GCCGAAAGGCGAGUCAAGGUCU AAACUCCU 
UCUUCUUG GCCGAAAGGCGAGUCAAGGUCU AGCCAGCA 
CUCUCCGG GCCGAAAGGCGAGUCAAGGUCU AGAAAUGC 
GAGCGGGG GCCGAAAGGCGAGUCAAGGUCU AGUGUUGG 
GGCUGGAG GCCGAAAGGCGAGUCAAGGUCU GGGGCAGU 
AGGUCAGG GCCGAAAGGCGAGUCAAGGUCU AGGCUGUC 
AUUACUUG GCCGAAAGGCGAGUCAAGGUCU AGGUUCUG 
GCCGAAAGGCGAGUCAAGGUCU AGAAUUCG 



ACAAUGGC G CCUACUCG 



CGAGUAGG GCCGAAAGGCGAGUCAAGGUCU GCCAUUGU 



GCCUACUC G 



AGGGUCAG GCCGAAAGGCGAGUCAAGGUCU GAGUAGCC 



CUGACCCU G CAAGGGCU 



AGCCCUUG GCCGAAAGGCGAGUCAAGGUCU AGGGUCAG 



CUGGGGCU C 



AGUGAGCG GCCGAAAGGCGAGUCAAGGUCU AGCCCCAG 



GGGGCUGC G CUCACUGA 



UCAGUGAG GCCGAAAGGCGAGUCAAGGUCU GCAGCCCC 



CCACCUCU G CUUCGUGC 



GCACGAAG GCCGAAAGGCGAGUCAAGGUCU AGAGGUGG 



UGCUUCGU G CACACGGU 



ACCGUGUG GCCGAAAGGCGAGUCAAGGUCU ACGAAGCA 



1S41 
1653 



CACACGGU G CCCUGGGA 
CGGAACCC G CACCAAGC 



CAAGCUCU G CUCCACAC 



GCCGAAAGGCGAGUCAAGGUCU ACCGUGUG 
GCUUGGUG GCCGAAAGGCGAGUCAAGGUCU GGGUUCCG 
GUGUGGAG GCCGAAAGGCGAGUCAAGGUCU AGAGCUUG 
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1862 
1936 
1961 
1970 
1 973 
2007 



UCCACACU G CCAACC3G 
CCUGGCCU G CCACCAGC 
CCAGCUGU G CGCCCGAG 
AGCUGUGC G CCCGAGGG 



J G CAGCCAGU 



GGAGGAAU G CCGAGUAC 
CGAGUACU G CAGGGGCU 
AUGUGAAU G CCAGGCAC 
CACUGUUU G CCGUGCCA 
UUUGCCGU G CCACCCUG 
UGGCCUGU G CCCACUAU 
UCCCUUCU G CGUGGCCC 
CGUGGCCC G CUGCCCCA 



GGCCCGCU G 



CCGGUUGG GCCGAAAGGCGAGUCAAGGUCU AGUGUGGA 
GCUGGUGG GCCGAAAGGCGAGUCAAGGUCU AGGCCAGG 
CUCGGGCG GCCGAAAGGCGAGUCAAGGUCU ACAGCUGG ~~ 
CCCUCGGG GCCGAAAGGCGAGUCAAGGUCU GCACAGCU 
GACCCCAG GCCGAAAGGCGAGUCAAGGUCU AGUGCCCU 
ACUGGCUG GCCGAAAGGCGAGUCAAGGUCU AGUUGACA 
CCUCCACG GCCGAAAGGCGAGUCAAGGUCU ACUCCUGG 
GUACUCGG GCCGAAAGGCGAGUCAAGGUCU AUUCCUCC 
AGCCCCUG GCCGAAAGGCGAGUCAAGGUCU AGUACUCG 
GUGCCUGG GCCGAAAGGCGAGUCAAGGUCU AUUCACAU 
UGGCACGG GCCGAAAGGCGAGUCAAGGUCU AAACAGUG 
CAGGGUGG GCCGAAAGGCGAGUCAAGGUCU ACGGCAAA 
AUAGUGGG GCCGAAAGGCGAGUCAAGGUCU ACAGGCCA 
GGGCCACG GCCGAAAGGCGAGUCAAGGUCU AGAAGGGA 
UGGGGCAG GCCGAAAGGCGAGUCAAGGUCU GGGCCACG 
CGCUGGGG GCCGAAAGGCGAGUCAAGGUCU AGCGGGCC 



UCCUACAU G CCCAUCUG 



CAGAUGGG GCCGAAAGGCGAGUCAAGGUCU AUGUAGGA 



AGGAGGGC G CAUGCCAG 



CUGGCAUG GCCGAAAGGCGAGUCAAGGUCU GCCCUCCU 



GGGCGCAU G CCAGCCUU 



AAGGCUGG GCCGAAAGGCGAGUCAAGGUCU AUGCGCCC 



2051 
2063 
2099 



CCAGCCUU G CCCCAUCA 



UGAUGGGG GCCGAAAGGCGAGUCAAGGUCU AAGGCUGG 



CAUCAACU G CACCCACU 



AGUGGGUG GCCGAAAGGCGAGUCAAGGUCU AGUUGAUG 



CAAGGGCU G CCCCGCCG 



CGGCGGGG GCCGAAAGGCGAGUCAAGGUCU AGCCCUUG 



GCUGCCCC G CCGAGCAG 



CUGCUCGG GCCGAAAGGCGAGUCAAGGUCU GGGGCAGC 



UCAUCUCU G CGGUGGUU 



AACCACCG GCCGAAAGGCGAGUCAAGGUCU AGAGAUGA 



2160 
2235 



GGCAUUCU G CUGGUCGU 



ACGACCAG GCCGAAAGGCGAGUCAAGGUCU AGAAUGCC 



UACACGAU G CGGAGACU 



AGUCUCCG GCCGAAAGGCGAGUCAAGGUCU AUCGUGUA 



CGGAGACU G CUGCAGGA 



UCCUGCAG GCCGAAAGGCGAGUCAAGGUCU AGUCUCCG 



AGACUGCU G CAGGAAAC 



GUUUCCUG GCCGAAAGGCGAGUCAAGGUCU AGCAGUCU 



2271 
2292 
2304 
2310 
2349 
2362 



GUGGAGCC G CUGACACC 



GGUGUCAG GCCGAAAGGCGAGUCAAGGUCU GGCUCCAC 



GGAGCGAU G CCCAACCA 



UGGUUGGG GCCGAAAGGCGAGUCAAGGUCU AUCGCUCC 



AACCAGGC G CAGAUGCG 



CGCAUCUG GCCGAAAGGCGAGUCAAGGUCU GCCUGGUU 



GCGCAGAU G CGGAUCCU 



AGGAUCCG GCCGAAAGGCGAGUCAAGGUCU AUCUGCGC 



GUGAAGGU G CUUGGAUC 



GAUCCAAG GCCGAAAGGCGAGUCAAGGUCU ACCUUCAC 



GAUCUGGC G CUUUUGGC 



GCCAAAAG GCCGAAAGGCGAGUCAAGGUCU 



UGUCUCCC G 



CCAGAAGG GCCGAAAGGCGAGUCAAGGUCU GGGAGACA 



GGGCAUCU G CCUGACAU 



AUGUCAGG GCCGAAAGGCGAGUCAAGGUCU 



2556 
2577 



UCCACGGU G 



ACCAGCUG GCCGAAAGGCGAGUCAAGGUCU 



CAGCUUAU G CCCUAUGG 



CCAUAGGG GCCGAAAGGCGAGUCAAGGUCU AUAAGCUG 



CUAUGGCU G CCUCUUAG 



CUAAGAGG GCCGAAAGGCGAGUCAAGGUCU AGCCAUAG 



GGAAAACC G CGGACGCC 



GGCGUCCG GCCGAAAGGCGAGUCAAGGUCU GGUUUUCC 



CCGCGGAC G CCUGGGCU 



AGCCCAGG GCCGAAAGGCGAGUCAAGGUCU GUCCGCGG 



2716 
2727 



CAGGACCU G CUGAACUG 
UGGUGUAU G CAGAUUGC 
UGCAGAUU G CCAAGGGG 
GAGGAUGU G CGGCUCGU 
ACUUGGCC G CUCGGAAC 
CGGAACGU G CUGGUCAA 



GCCGAAAGGCGAGUCAAGGUCU AGGUCCUG 
GCAAUCUG GCCGAAAGGCGAGUCAAGGUCU AUACACCA 
CCCCUUGG GCCGAAAGGCGAGUCAAGGUCU AAUCUGCA ~ 
ACGAGCCG GCCGAAAGGCGAGUCAAGGUCU ACAUCCUC 
GUUCCGAG GCCGAAAGGCGAGUCAAGGUCU GGCCAAGU 
UUGACCAG GCCGAAAGGCGAGUCAAGGUCU ACGUUCCG 
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2781 
2809 



2976 
2997 



3107 
3128 



3232 
3280 
3289 
3317 
3468 
3534 
3559 
3572 
3627 
3645 



GCUCGGCU G CUGGACAU 
AGUACCAU G CAGAUGGG 
GGCAAGGU G CCCAUCAA 
UGGAUGGC G CUGGAGUC 
CAUUCUCC G CCGGCGGU 
CCUGACCU G CUGGAAAA 
GAGCGGCU G CCCCAGCC 
CCCCAUCU G CACCAUUG 
AUUCUCCC G CAUGGCCA 
CCCCCAGC G CUUUGUGG 
CUUCUACC G CUCACUGC 
CGCUCACU G CUGGAGGA 
UGGUGGAU G CUGAGGAG 
CAGACCCU G CCCCGGGC 
CCCCGGGC G CUGGGGGC 
CAGGCACC G CAGCUCAU 
AAGGGGCU G CAAAGCCU 
GUACCCCU G CCCUCUGA 
GCUACGUU G CCCCCCUG 
CCUGACCU G CAGCCCCC 



AUGUCCAG GCCGAAAGGCGAGUCAAGGUCU AGCCGAGC 
CCCAUCUG GCCGAAAGGCGAGUCAAGGUCU AUGGUACU 
UUGAUGGG GCCGAAAGGCGAGUCAAGGUCU ACCUUGCC 
GACUCCAG GCCGAAAGGCGAGUCAAGGUCU GCCAUCCA 
ACCGCCGG GCCGAAAGGCGAGUCAAGGUCU GGAGAAUG 
UUUUCCAG GCCGAAAGGCGAGUCAAGGUCU AGGUCAGG 
GGCUGGGG GCCGAAAGGCGAGUCAAGGUCU AGCCGCUC 
CAAUGGUG GCCGAAAGGCGAGUCAAGGUCU AGAUGGGG 
UGGCCAUG GCCGAAAGGCGAGUCAAGGUCU GGGAGAAU 
CCACAAAG GCCGAAAGGCGAGUCAAGGUCU GCUGGGGG 
GCAGUGAG GCCGAAAGGCGAGUCAAGGUCU GGUAGAAG 
UCCUCCAG GCCGAAAGGCGAGUCAAGGUCU AGUGAGCG 
CUCCUCAG GCCGAAAGGCGAGUCAAGGUCU AUCCACCA 
GCCCGGGG GCCGAAAGGCGAGUCAAGGUCU AGGGUCUG 
GCCCCCAG GCCGAAAGGCGAGUCAAGGUCU GCCCGGGG 
AUGAGCUG GCCGAAAGGCGAGUCAAGGUCU GGUGCCUG 
AGGCUUUG GCCGAAAGGCGAGUCAAGGUCU AGCCCCUU 
UCAGAGGG GCCGAAAGGCGAGUCAAGGUCU AGGGGUAC 
CAGGGGGG GCCGAAAGGCGAGUCAAGGUCU AACGUAGC 
GccgAAAGGCGAGUCAAGGUCU agguCAGG 



CCCCCUUC G CCCCGAGA 



UCUCGGGG GCCGAAAGGCGAGUCAAGGUCU GAAG3GGG 



GGCCCUCU G CCUGCUGC 



GCAGCAGG GCCGAAAGGCGAGUCAAGGUCU AGAG3GCC 



CUCUGCCU G CUGCCCGA 



UCGGGCAG GCCGAAAGGCGAGUCAAGGUCU AGGCAGAG 



3652 
3661 
3667 
3730 
3742 



UGCCUGCU G CCCGACCU 



AGGUCGGG GCCGAAAGGCGAGUCAAGGUCU AGCAGGCA 



CCCGACCU G CUGGUGCC 



GGCACCAG GCCGAAAGGCGAGUCAAGGUCU AGGUCGGG 



CUGCUGGU G CCACUCUG 



ACCAGCAG 



ACGUUUUU G CCUUUGGG 



CCCAAAGG GCCGAAAGGCGAGUCAAGGUCU AAAAACGU 



UUGGGGGU G CCGUGGAG 



CUCCACGG GCCGAAAGGCGAGUCAAGGUCU ACCCCCAA 



CUGAGGGG GCCGAAAGGCGAGUCAAGGUCU AGCUCCUC 



CUCCUCCU G CCUUCAGC 



GCUGAAGG GCCGAAAGGCGAGUCAAGGUCU AGGAGGAG 



CUGGACGU C 



CACACUGG GCCGAAAGGCGAGUCAAGGUCU ACGUCCi 



CCAAGUCC G CAGAAGCC 



GGCUUCUG GCCGAAAGGCGAGUCAAGGUCU GGACUUGG 



UGACUUCU G CUGGCAUC 



GAUGCCAG GCCGAAAGGCGAGUCAAGGUCU AGAAGUCA 



GGGAACCU G CCAUGCCA 



UGGCAUGG GCCGAAAGGCGAGUCAAGGUCU AGGUUCCC 



CCUGCCAU G CCAGGAAC 



GUUCCUGG GCCGAAAGGCGAGUCAAGGUCU AUGGCAGG 



UCCUUCCU G CUUGAGUU 



AACUCAAG GCCGAAAGGCGAGUCAAGGUCU AGGAAGGA 



G CCCAAUGA 



UCAUUGGG GCCGAAAGGCGAGUCAAGGUCU AGGGCCUC 



CAGUGGAU G CCACAGCC 



GGCUGUGG GCCGAAAGGCGAGUCAAGGUCU AUCCACUG 



CUAGUACU G CCCCCCAU 



GCCGAAAGGCGAGUCAAGGUCU AGUACUAG 



UACAGAGU G CUUUUCUG 



CAGAAAAG GCCGAAAGGCGAGUCAAGGUCU ACUCUGUA 



CAGCGGCA GCCGAAAGGCGAGUCAAGGUCU AAGGCCGC 



ACCCAAGU G UGCACCGG 



CCGGUGCA GCCGAAAGGCGAGUCAAGGUCU ACUUGGGU 



GCCAGCCU C 
UGCGGAUU G UGCGAGGC 
CCACCCCU G UCACAGGG 
ACGAUUUU G UGGAAGGA 
CCACCCCU G UUCUCCGA 



AGGAAGGA GCCGAAAGGCGAGUCAAGGUCU AGGCUGGC 
GCCUCGCA GCCGAAAGGCGAGUCAAGGUCU AAUCCGCA 
CCCUGUGA GCCGAAAGGCGAGUCAAGGUCU AGGGGUGG 
UCCUUCCA GCCGAAAGGCGAGUCAAGGUCU AAAAUCGU 
UCGGAGAA GCCGAAAGGCGAGUCAAGGUCU AGGGGUGG 
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2075 
2077 
2410 
2436 
2503 
2518 
2S02 



UCUCCGAU G UGUAAGGG 
UCCGAUGU G UAAGGGCU 
UGAGGAUU G UCAGAGCC 
CGCGCACU G UCUGUGCC 
CACUGUCU G UGCCGGUG 
CGGUGGCU G UGCCCGCU 
UGAGCAGU G UGCUGCCG 
UGGCAUCU G UGAGCUGC 
CGCCAGCU G UGUGACUG 
CCAGCUGU G UGACUGCC 
GACUGCCU G UCCCUACA 
ACAGCGGU G UGAGAAGU 
CAAGCCCU G UGCCCGAG 
GCCCGAGU G UGCUAUGG 
CUCCAAGU G UUUGAGAC 
GGACGAGU G UGUGGGCG 
ACGAGUGU G UGGGCGAG 
CACCAGCU G UGCGCCCG 
CACCCAGU G UGUCAACU 
CCCAGUGU G UCAACUGC 
GGGAGUAU G UGAAUGCC 
CAGGCACU G UUUGCCGU 
CCCUGAGU G UCAGCCCC 
AGUGACCU G UUUUGGAC 
UGACCAGU G UGUGGCCU 
ACCAGUGU G UGGCCUGU 
UGUGGCCU G UGCCCACU 
CCAGCGGU G UGAAACCU 
CCACUCCU G UGUGGACC 



CCCUUACA GCCGAAAGGCGAGUCAAGGUCU AUCGGAGA 
AGCCCUUA GCCGAAAGGCGAGUCAAGGUCU ACAUCGGA 
GGCUCUGA GCCGAAAGGCGAGUCAAGGUCU AAUCCUCA 
GGCACAGA GCCGAAAGGCGAGUCAAGGUCU AGUGCGCG ~ 
CACCGGCA GCCGAAAGGCGAGUCAAGGUCU AGACAGUG ~ 
AGCGGGCA GCCGAAAGGCGAGUCAAGGUCU AGCCACCG 
CGGCAGCA GCCGAAAGGCGAGUCAAGGUCU ACUGCUCA 
GCAGCUCA GCCGAAAGGCGAGUCAAGGUCU AGAUGCCA 
CAGUCACA GCCGAAAGGCGAGUCAAGGUCU AGCUGGCG 
GGCAGUCA GCCGAAAGGCGAGUCAAGGUCU ACAGCUGG 
UGUAGGGA GCCGAAAGGCGAGUCAAGGUCU AGGCAGUC 
ACUUCUCA GCCGAAAGGCGAGUCAAGGUCU ACCGCUGU 
CUCGGGCA GCCGAAAGGCGAGUCAAGGUCU AGGGCUUG 
CCAUAGCA GCCGAAAGGCGAGUCAAGGUCU ACUCGGGC 
GUCUCAAA GCCGAAAGGCGAGUCAAGGUCU ACUUGGAG 
CGCCCACA GCCGAAAGGCGAGUCAAGGUCU ACUCGUCC 
CUCGCCCA GCCGAAAGGCGAGUCAAGGUCU ACACUCGU 
CGGGCGCA GCCGAAAGGCGAGUCAAGGUCU AGCUGGUG 
AGUUGACA GCCGAAAGGCGAGUCAAGGUCU ACUGGGUG 
GCAGUUGA GCCGAAAGGCGAGUCAAGGUCU ACACUGGG 
GGCAUUCA GCCGAAAGGCGAGUCAAGGUCU AUACUCCC 
ACGGCAAA GCCGAAAGGCGAGUCAAGGUCU AGUGCCUG 
GGGGCUGA GCCGAAAGGCGAGUCAAGGUCU ACUCAGGG 
GUCCAAAA GCCGAAAGGCGAGUCAAGGUCU AGGUCACU 
AGGCCACA GCCGAAAGGCGAGUCAAGGUCU ACUGGUCA 
ACAGGCCA GCCGAAAGGCGAGUCAAGGUCU ACACUGGU 
AGUGGGCA GCCGAAAGGCGAGUCAAGGUCU AGGCCACA 
AGGUUUCA GCCGAAAGGCGAGUCAAGGUCU ACCGCUGG 
5CGAGUCAAGGUCU 



ACUCCUGU G UGGACCUG 



CAGGUCCA GCCGAAAGGCGAGUCAAGGUCU ACAGGAGU 



GGGAGAAU G UGAAAAUU 



AAUUUUCA GCCGAAAGGCGAGUCAAGGUCU AUUCUCCC 



AUCAAAGU G UUGAGGGA 



UCCCUCAA GCCGAAAGGCGAGUCAAGGUCU ACUUUGAU 



UGGCUGGU G UGGGCUCC 



GGAGCCCA GCCGAAAGGCGAGUCAAGGUCU ACCAGCCA 



CCCCAUAU G UCUCCCGC 



GCGGGAGA GCCGAAAGGCGAGUCAAGGUCU AUAUGGGG 



UAGACCAU G UCCGGGAA 



UUCCCGGA GCCGAAAGGCGAGUCAAGGUCU AUGGUCUA 



GAACUGGU G UAUGCAGA 



GCCGAAAGGCGAGUCAAGGUCU ACCAGUUC 



UGGAGGAU G UGCGGCUC 



GAGCCGCA GCCGAAAGGCGAGUCAAGGUCU AUCCUCCA 



AAUUUUGA GCCGAAAGGCGAGUCAAGGUCU AUGGUUGG 



2889 
2902 



GCCGAAAGGCGAGUCAAGGUCU AUCACUCU 



AGUGAUGU G UGGAGUUA 



UAACUCCA GCCGAAAGGCGAGUCAAGGUCU ACAUCACU 



GUUAUGGU C 



CACAGUCA GCCGAAAGGCGAGUCAAGGUCU ACCAUAAC 



GUGUGACU G UGUGGGAG 



CUCCCACA GCCGAAAGGCGAGUCAAGGUCU AGUCACAC 



2910 
3025 



GUGACUGU G UGGGAGCU 



AGCUCCCA GCCGAAAGGCGAGUCAAGGUCU ACAGUCAC 



CCAUUGAU G UCUACAUG 



CAUGUAGA GCCGAAAGGCGAGUCAAGGUCU AUCAAUGG 



3068 
3093 



CUCUGAAU G UCGGCCAA 
GAGUUGGU G UCUGAAUU 



UUGGCCGA GCCGAAAGGCGAGUCAAGGUCU AUUCAGAG 
AAUUCAGA GCCGAAAGGCGAGUCAAGGUCU ACCAACUC 
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3133 
3269 

3592 
3507 
3939 
3 974 
3976 



AGCGCUUU G UGGUCAUC 
CUUCUUCU G UCCAGACC 
GCUCCGAU G UAUUUGAU 
CUGAAUAU G UGAACCAG 
AGCCAGAU G UUCGGCCC 
GUGCCAGU G UGAACCAG 
GCCCUGAU G UGUCCUCA 
CCUGAUGU G UCCUCAGG 
AGGAACCU G UCCUAAGG 
GAGUCUUU G UGGAUUCU 
AAGGGAGU G UCUAAGAA 
CAGAGACU G UCCCUGAA 
GCAAUGGU G UCAGUAUC 
CAGGCUUU G UACAGAGU 
GCUUUUCU G UUUAGUUU 
CUUUUUUU G UUUUGUUU 
UUUGUUUU G UUUUUUUA 
G UAACCCUG 



GAUGACCA GCCGAAAG3CGAGUCAAGGUCU AAAGCGCU 
GGUCUGGA GCCGAAAGGCGAGUCAAGGUCU AGAAGAAG 
AUCAAAUA GCCGAAAGGCGAGUCAAGGUCU AUCGGAGC 
CUGGUUCA GCCGAAAGGCGAGUCAAGGUCU AUAUUCAG 
GGGCCGAA GCCGAAAGGCGAGUCAAGGUCU AUCUGGCU 
CUGGUUCA GCCGAAAGGCGAGUCAAGGUCU ACUGGCAC 
UGAGGACA GCCGAAAGGCGAGUCAAGGUCU AUCAGGGC 
CCUGAGGA GCCGAAAGGCGAGUCAAGGUCU ACAUCAGG 
CCUUAGGA GCCGAAAGGCGAGUCAAGGUCU AGGUUCCU 
AGAAUCCA GCCGAAAGGCGAGUCAAGGUCU AAAGACUC 
UUCUUAGA GCCGAAAGGCGAGUCAAGGUCU ACUCCCUU 
UUCAGGGA GCCGAAAGGCGAGUCAAGGUCU AGUCUCUG 
GAUACUGA GCCGAAAGGCGAGUCAAGGUCU ACCAUUGC 
ACUCUGUA GCCGAAAGGCGAGUCAAGGUCU AAAGCCUG 
AAACUAAA GCCGAAAGGCGAGUCAAGGUCU AGAAAAGC 
AAACAAAA GCCGAAAGGCGAGUCAAGGUCU AAAAAAAG 
UAAAAAAA GCCGAAAGGCGAGUCAAGGUCU AAAACAAA 
GCCGAAAGGCGAGUCAAGGUCU CUCCCCUU 



UAACCCUG G CCCCUUUG 



CAAAGGGG GCCGAAAGGCGAGUCAAGGUCU CAGGGUUA 



CCCCUUUG G UCGGGGCC 



GGCCCCGA GCCGAAAGGCGAGUCAAGGUCU CAAAGGGG 



UGGUCGGG G CCCCGGGC 



GCCCGGGG GCCGAAAGGCGAGUCAAGGUCU CCCGACCA 



GGCCCCGG G CAGCCGCG 



CGCGGCUG GCCGAAAGGCGAGUCAAGGUCU CCGGGGCC 



CCCGGGCA G CCGCGCGC 



GCGCGCGG GCCGAAAGGCGAGUCAAGGUCU UGCCCGGG 



CCCACGGG G CCCUUUAC 



GUAAAGGG GCCGAAAGGCGAGUCAAGGUCU CCCGUGGG 



CGCGCCCG G CCCCCACC 



GGUGGGGG GCCGAAAGGCGAGUCAAGGUCU CGGGCGCG 



CCCUCGCA G CACCCCGC 



GCGGGGUG GCCGAAAGGCGAGUCAAGGUCU 



CCCUCCCA G CCGGGUCC 



GGACCCGG GCCGAAAGGCGAGUCAAGGUCU UGGGAGGG 



CCAGCCGG G UCCAGCCG 



CGGCUGGA GCCGAAAGGCGAGUCAAGGUCU CCGGCUGG 



CGGGUCCA G CCGGAGCC 



GGCUCCGG GCCGAAAGGCGAGUCAAGGUCU UGGACCCG 



CAGCCGGA G CCAUGGGG 



CCCCAUGG GCCGAAAGGCGAGUCAAGGUCU UCCGGCUG 



GCCAUGGG G CCGGAGCC 



GGCUCCGG GCCGAAAGGCGAGUCAAGGUCU CCCAUGGC 



GGGCCGGA G CCGCAGUG 



CACUGCGG GCCGAAAGGCGAGUCAAGGUCU UCCGGCCC 



GAGCCGCA G UGAGCACC 



GGUGCUCA GCCGAAAGGCGAGUCAAGGUCU UGCGGCUC 



CGCAGUGA G CACCAUGG 



GCCGAAAGGCGAGUCAAGGUCU UCACUGCG 



ACCAUGGA G CUGGCGGC 



GCCGCCAG GCCGAAAGGCGAGUCAAGGUCU UCCAUGGU 



UGGAGCUG G 



CAAGGCCG GCCGAAAGGCGAGUCAAGGUCU CAGCUCCA 



AGCUGGCG G 



GCACAAGG GCCGAAAGGCGAGUCAAGGUCU CGCCAGCU 



CGCUGGGG G CUCCUCCU 



AGGAGGAG GCCGAAAGGCGAGUCAAGGUCU CCCCAGCG 



CCCCCGGA G CCGCGAGC 



GCUCGCGG GCCGAAAGGCGAGUCAAGGUCU UCCGGGGG 



AGCCGCGA G CACCCAAG 



CUUGGGUG GCCGAAAGGCGAGUCAAGGUCU UCGCGGCU 



GCACCCAA G UGUGCACC 



GGUGCACA GCCGAAAGGCGAGUCAAGGUCU UUGGGUGC 



GUGCACCG G CACAGACA 



UGUCUGUG GCCGAAAGGCGAGUCAAGGUCU CGGUGCAC 



GACAUGAA G CUGCGGCU 



GCCGAAAGGCGAGUCAAGGUCU UUCAUGUC 



AAGCUGCG G CUCCCUGC 



GCAGGGAG GCCGAAAGGCGAGUCAAGGUCU CGCAGCUU 



G UCCCGAGA 
CUACCAGG G CUGCCAGG 



GCUGCCAG G UGGUGCAG 



UCUCGGGA GCCGAAAGGCGAGUCAAGGUCU UGGCAGGG 
CCUGGCAG GCCGAAAGGCGAGUCAAGGUCU CCUGGUAG 
CUGCACCA GCCGAAAGGCGAGUCAAGGUCU CUGGCAGC 
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GCCAGGUG G UGCAGGGA 
G CCUGUCCU 



UCCCUGCA GCCGAAAGGCGAGUCAAGGUCU CACCUGGC 
AGGACAGG GCCGAAAGGCGAGUCAAGGUCU UGGCAUUG ~ 



UCCAGGAG G UGCAGGGC 



GCCCUGCA GCCGAAAGGCGAGUCAAGGUCU CUCCUGGA 



GGUGCAGG G CUACGUGC 



GCACGUAG GCCGAAAGGCGAGUCAAGGUCU CCUGCACC 



AGGGCUAC G UGCUCAUC 



GAUGAGCA GCCGAAAGGCGAGUCAAGGUCU GUAGCCCU 



ACAACCAA G UGAGGCAG 



CUGCCUCA GCCGAAAGGCGAGUCAAGGUCU UUGGUUGU 



CAAGUGAG G CAGGUCCC 



GGGACCUG GCCGAAAGGCGAGUCAAGGUCU CUCACUUG 



UGAGGCAG G UCCCACUG 



CAGUGGGA GCCGAAAGGCGAGUCAAGGUCU CUGCCUCA 



G CUGCGGAU 



AUCCGCAG GCCGAAAGGCGAGUCAAGGUCU CUCUGCAG 



UGUGCGAG G CACCCAGC 



GCUGGGUG GCCGAAAGGCGAGUCAAGGUCU CUCGCACA 



GGCACCCA G CUCUUUGA 



UCAAAGAG GCCGAAAGGCGAGUCAAGGUCU UGGGUGCC 



AUGCCCUG G CCGUGCUA 



UAGCACGG GCCGAAAGGCGAGUCAAGGUCU CAGGGCAU 



CCCUGGCC G UGCUAGAC 



GUCUAGCA GCCGAAAGGCGAGUCAAGGUCU GGCCAGGG 



UCACAGGG G CCUCCCCA 



UGGGGAGG GCCGAAAGGCGAGUCAAGGUCU CCCUGUGA 



CCCAGGAG G CCUGCGGG 



CUGCGGGA G CUGCAGCU 



AGCUGCAG GCCGAAAGGCGAGUCAAGGUCU UCCCGCAG 



GAGCUGCA G CUUCGAAG 



CUUCGAAG GCCGAAAGGCGAGUCAAGGUCU UGCAGCUC 



GCUUCGAA G CCUCACAG 



CUGUGAGG GCCGAAAGGCGAGUCAAGGUCU UUCGAAGC 



5 UCUUGAUC 



GAUCAAGA GCCGAAAGGCGAGUCAAGGUCU CCCUCCUU 



UUGAUCCA G CGGAACCC 



GGGUUCCG GCCGAAAGGCGAGUCAAGGUCU UGGAUCAA 



AACCCCCA G CUCUGCUA 



UAGCAGAG GCCGAAAGGCGAGUCAAGGUCU UGGGGGUU 



! CUGGCUCU 



AGAGCCAG GCCGAAAGGCGAGUCAAGGUCU UGGUUGUU 



ACCAGCUG G CUCUCACA 



UGUGAGAG GCCGAAAGGCGAGUCAAGGUCU CAGCUGGU 



GCUCUCGG G CCUGCCAC 



GUGGCAGG GCCGAAAGGCGAGUCAAGGUCU CCGAGAGC 



GUGUAAGG C 



AGCGGGAG GCCGAAAGGCGAGUCAAGGUCU CCUUACAC 



GGGAGAGA G UUCUGAGG 



CCUCAGAA GCCGAAAGGCGAGUCAAGGUCU UCUCUCCC 



UUGUCAGA G CCUGACGC 



GCGUCAGG GCCGAAAGGCGAGUCAAGGUCU UCUGACAA 



OJGUGCCG G UGGCUGUG 



CACAGCCA GCCGAAAGGCGAGUCAAGGUCU CGGCACAG 



UGCCGGUG G CUGUGCCC 



GGGCACAG GCCGAAAGGCGAGUCAAGGUCU CACCGGCA 



UGCAAGGG G CCACUGCC 



GGCAGUGG GCCGAAAGGCGAGUCAAGGUCU CCCUUGCA 



UGCCAUGA G CAGUGUGC 



GCACACUG GCCGAAAGGCGAGUCAAGGUCU UCAUGGCA 



CAUGAGCA G UGUGCUGC 



GCAGCACA GCCGAAAGGCGAGUCAAGGUCU UGCUCAUG 



UGCUGCCG G 



CCGUGCAG GCCGAAAGGCGAGUCAAGGUCU 



CUGCACGG G CCCCAAGC 



GCUUGGGG GCCGAAAGGCGAGUCAAGGUCU CCGUGCAG 



1008 
1014 
1034 
1038 
1049 
1055 



GGCCCCAA G CACUCUGA 
ACUGCCUG G CCUGCCUC 
CAACCACA G UGGCAUCU 
CCACAGUG G CAUCUGUG 
AUCUGUGA G CUGCACUG 
ACUGCCCA G CCCUGGUC 
CAGCCCUG G UCACCUAC 
ACAGACAC G UUUGAGUC 
ACGUUUGA G UCCAUGCC 
UCCCGAGG G CCGGUAUA 
GAGGGCCG G UAUACAUU 
UACAUUCG G CGCCAGCU 
CGGCGCCA G CUGUGUGA 



GCCGAAAGGCGAGUCAAGGUCU UUGGGGCC 
GAGGCAGG GCCGAAAGGCGAGUCAAGGUCU CAGGCAGU 
AGAUGCCA GCCGAAAGGCGAGUCAAGGUCU UGUGGUUG 
CACAGAUG GCCGAAAGGCGAGUCAAGGUCU CACUGUGG 
CAGUGCAG GCCGAAAGGCGAGUCAAGGUCU UCACAGAU 
GACCAGGG GCCGAAAGGCGAGUCAAGGUCU UGGGCAGU 
GUAGGUGA GCCGAAAGGCGAGUCAAGGUCU CAGGGCUG 
GACUCAAA GCCGAAAGGCGAGUCAAGGUCU GUGUCUGU 
GGCAUGGA GCCGAAAGGCGAGUCAAGGUCU UCAAACGU 
UAUACCGG GCCGAAAGGCGAGUCAAGGUCU CCUCGGGA 
AAUGUAUA GCCGAAAGGCGAGUCAAGGUCU CGGCCCUC 
AGCUGGCG GCCGAAAGGCGAGUCAAGGUCU CGAAUGUA 
UCACACAG GCCGAAAGGCGAGUCAAGGUCU UGGCGCCG 
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1164 
1173 



1211 
1218 



CUACGGAC G UGGGAUCC 
GCACCCUC G UCUGCCCC 
ACCAAGAG G UGACAGCA' 
AGGUGACA G CAGAC-GAU 
GGAACACA G CGGUGUGA 
ACACAGCG G UGUGAGAA 
UGUGAGAA G UGCAGCAA ' 
GAAGUGCA G CAAGCCCU 
UGCAGCAA G CCCUGUGC 
GUGCCCGA G UGUGCUAU 
GUGCUAUG G UCUGGGCA 
UGGUCUGG G CAUGGAGC 
GGCAUGGA G CACUUGCG 
UGCGAGAG G UGAGGGCA 
AGGUGAGG G 



GGAUCCCA GCCGAAAGGCGAGUCAAGGUCU GUCCGUAG 

GGGGCAGA GCCGAAAGGCGAGUCAAGGUCU GAGGGUGC 

UGCUGUCA GCCGAAAGGCGAGUCAAGGUCU CUCUUGGU 

AUCCUCUG GCCGAAAGGCGAGUCAAGGUCU UGUCACCU 

UCACACCG GCCGAAAGGCGAGUCAAGGUCU UGUGUUCC 

UUCUCACA GCCGAAAGGCGAGUCAAGGUCU CGCUGUGU 

UUGCUGCA GCCGAAAGGCGAGUCAAGGUCU UUCUCACA 

AGGGCUUG GCCGAAAGGCGAGUCAAGGUCU UGCACUUC 

GCACAGGG GCCGAAAGGCGAGUCAAGGUCU UUGCUGCA 

AUAGCACA GCCGAAAGGCGAGUCAAGGUCU UCGGGCAC 

UGCCCAGA GCCGAAAGGCGAGUCAAGGUCU CAUAGCAC 

GCUCCAUG GCCGAAAGGCGAGUCAAGGUCU CCAGACCA 

CGCAAGUG GCCGAAAGGCGAGUCAAGGUCU UCCAUGCC 

UGCCCUCA GCCGAAAGGCGAGUCAAGGUCU CUCUCGCA 

GGUAACUG GCCGAAAGGCGAGUCAAGGUCU CCUCACCU 



UGAGGGCA G UUACCAGU 



ACUGGUAA GCCGAAAGGCGAGUCAAGGUCU UGCCCUCA 



AGUUACCA G UGCCAAUA 



UAUUGGCA GCCGAAAGGCGAGUCAAGGUCU UGGUAACU 



AUCCAGGA G UUUGCUGG 



CCAGCAAA GCCGAAAGGCGAGUCAAGGUCU UCCUGGAU 



1271 
1292 
1297 



GUUUGCUG G CUGCAAGA 



UCUUGCAG GCCGAAAGGCGAGUCAAGGUCU CAGCAAAC 



CUUUGGGA G CCUGGCAU 



AUGCCAGG GCCGAAAGGCGAGUCAAGGUCU UCCCAAAG 



GGAGCCUG G CAUUUCUG 



CAGAAAUG GCCGAAAGGCGAGUCAAGGUCU CAGGCUCC 



GCCGGAGA G CUUUGAUG 



CAUCAAAG GCCGAAAGGCGAGUCAAGGUCU UCUCCGGC 



1330 
1353 



GGGACCCA G CCUCCAAC 



GUUGGAGG GCCGAAAGGCGAGUCAAGGUCU UGGGUCCC 



CCGCUCCA G CCAGAGCA 



UGCUCUGG GCCGAAAGGCGAGUCAAGGUCU UGGAGCGG 



CAGCCAGA G CAGCUCCA 



UGGAGCUG GCCGAAAGGCGAGUCAAGGUCU UCUGGCUG 



G CUCCAAGU 



ACUUGGAG GCCGAAAGGCGAGUCAAGGUCU UGCUCUGG 



AGCUCCAA G UGUUUGAG 



CUCAAACA GCCGAAAGGCGAGUCAAGGUCU UUGGAGCU 



1397 
1414 
1419 



GAUCACAG G UUACCUAU 



AUAGGUAA GCCGAAAGGCGAGUCAAGGUCU CUGUGAUC 



ACAUCUCA G CAUGGCCG 



GCCGAAAGGCGAGUCAAGGUCU UGAGAUGU 



UCAGCAUG G CCGGACAG 



CUGUCCGG GCCGAAAGGCGAGUCAAGGUCU CAUGCUGA 



GCCGGACA G CCUGCCUG 



CAGGCAGG GCCGAAAGGCGAGUCAAGGUCU UGUCCGGC 



UGACCUCA G CGUCUUCC 



GGAAGACG GCCGAAAGGCGAGUCAAGGUCU UGAGGUCA 



ACCUCAGC G UCUUCCAG 



CUGGAAGA GCCGAAAGGCGAGUCAAGGUCU GCUGAGGU 



ACCUGCAA G UAAUCCGG 



CCGGAUUA GCCGAAAGGCGAGUCAAGGUCU UUGCAGGU 



GCACAAUG G CGCCUACU 



AGUAGGCG GCCGAAAGGCGAGUCAAGGUCU CAUUGUGC 



CUGCAAGG G CUGGGCAU 



AUGCCCAG GCCGAAAGGCGAGUCAAGGUCU CCUUGCAG 



AGGGCUGG G CAUCAGCU 



AGCUGAUG GCCGAAAGGCGAGUCAAGGUCU CCAGCCCU 



GGGCAUCA G CUGGCUGG 



CCAGCCAG GCCGAAAGGCGAGUCAAGGUCU UGAUGCCC 



AUCAGCUG G CUGGGGCU 



GCCGAAAGGCGAGUCAAGGUCU CAGCUGAU 



UGGCUGGG G CUGCGCUC 



GAGCGCAG GCCGAAAGGCGAGUCAAGGUCU CCCAGCCA 



1603 
1612 



GGAACUGG G CAGUGGAC 
ACUGGGCA G UGGACUGG 
GUGGACUG G CCCUCAUC 
UCUGCUUC G UGCACACG 
UGCACACG G UGCCCUGG 
UGGGACCA G CUCUUUCG 
CGCACCAA G CUCUGCUC 



GCCGAAAGGCGAGUCAAGGUCU CCAGUUCC 
CCAGUCCA GCCGAAAGGCGAGUCAAGGUCU UGCCCAGU 
GAUGAGGG GCCGAAAGGCGAGUCAAGGUCU CAGUCCAC 
CGUGUGCA GCCGAAAGGCGAGUCAAGGUCU GAAGCAGA 
CCAGGGCA GCCGAAAGGCGAGUCAAGGUCU CGUGUGCA 
CGAAAGAG GCCGAAAGGCGAGUCAAGGUCU UGGUCCCA 
GAGCAGAG GCCGAAAGGCGAGUCAAGGUCU UUGGUGCG 
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GAGGACGA G UGUGUGGG 
GUGUGUGG G CGAGGGCC 



GGGCGAGG G CCUGGCCU 



UCCUCUGG GCCGAAAGGCGAGUCAAGGUCU CGGUUOGC 
CCCACACA GCCGAAAGGCGAGUCAAGGUCU UCGUCCUC 
GGCCCUCG GCCGAAAGGCGAGUCAAGGUCU CCACACAC 
AGGCCAGG GCCGAAAGGCGAGUCAAGGUCU CCUCGCCC 
GCCGAAAGGCGAGUCAAGGUCU CAGGCCCU 



GCGCACAG GCCGAAAGGCGAGUCAAGGUCU UGGUGGCA 



1728 
1739 
1746 



GCCCGAGG G CACUGCUG 



5 GCCGAAAGGCGAGUCAAGGUCU CCUCGGGC 



CUGCUGGG G UCCAGGGC 



GCCCUGGA 



JCGAGUCAAGGUCU CCUGGACC 



CCCACCCA G UGUGUCAA 



UUGACACA GCCGAAAGGCGAGUCAAGGUCU UGGGUGGG 



CAACUGCA G CCAC 



GGAACUGG GCCGAAAGGCGAGUCAAGGUCU UGCAGUUG 



UGCAGCCA G UUCCUUCG 



CGAAGGAA GCCGAAAGGCGAGUCAAGGUCU 



CCUUCGGG G CCAGGAGU 



ACUCCUGG GCCGAAAGGCGAGUCAAGGUCU CCCGAAGG 



GGCCAGGA G UGCGUGGA 



GCCGAAAGGCGAGUCAAGGUCU UCCUGGCC 



1795 
1810 
1821 
1833 



AGGAGUGC G UGGAGGAA 



UUCCUCCA GCCGAAAGGCGAGUCAAGGUCU GCACUCCU 



AAUGCCGA G UACUGCAG 



GCCGAAAGGCGAGUCAAGGUCU UCGGCAUU 



CUGCAGGG G CUCCCCAG 



CUGGGGAG GCCGAAAGGCGAGUCAAGGUCU CCCUGCAG 



CCCAGGGA G UAUGUGAA 



UUCACAUA GCCGAAAGGCGAGUCAAGGUCU UCCCUGGG 



AAUGCCAG G CACUGUUU 



AAACAGUG GCCGAAAGGCGAGUCAAGGUCU CUGGCAUU 



UGUUUGCC G UGCCACCC 



GCCGAAAGGCGAGUCAAGGUCU GGCAAACA 



CACCCUGA G UGUCAGCC 



GGCUGACA GCCGAAAGGCGAGUCAAGGUCU UCAGGGUG 



GAGUGUCA G CCCCAGAA 



UUCUGGGG GCCGAAAGGCGAGUCAAGGUCU UGACACUC 



1889 
1894 
1915 
1923 



CCAGAAUG G CUCAGUGA 



UCACUGAG GCCGAAAGGCGAGUCAAGGUCU CAUUCUGG 



AUGGCUCA G UGACCUGU 



ACAGGUCA GCCGAAAGGCGAGUCAAGGUCU UGAGCCAU 



GACCGGAG G CUGACCAG 



CUGGUCAG GCCGAAAGGCGAGUCAAGGUCU CUCCGGUC 



GCUGACCA G UGUGUGGC 



GCCACACA GCCGAAAGGCGAGUCAAGGUCU UGGUCAGC 



AGUGUGUG G CCUGUGCC 



GGCACAGG GCCGAAAGGCGAGUCAAGGUCU CACACACU 



1963 
1966 



CCUUCUGC G UGGCCCGC 



GCGGGCCA GCCGAAAGGCGAGUCAAGGUCU GCAGAAGG 



UCUGCGUG G CCCGCUGC 



GCAGCGGG GCCGAAAGGCGAGUCAAGGUCU CACGCAGA 



CUGCCCCA G CGGUGUGA 



UCACACCG GCCGAAAGGCGAGUCAAGGUCU UGGGGCAG 



CCCCAGCG G UGUGAAAC 



GUUUCACA GCCGAAAGGCGAGUCAAGGUCU CGCUGGGG 



AUCUGGAA G UUUCCAGA 



UCUGGAAA GCCGAAAGGCGAGUCAAGGUCU UUCCAGAU 



2036 
2046 
2096 



UGAGGAGG G CGCAUGCC 



GGCAUGCG GCCGAAAGGCGAGUCAAGGUCU CCUCCUCA 



GCAUGCCA G CCUUGCCC 



GGGCAAGG GCCGAAAGGCGAGUCAAGGUCU UGGCAUGC 



UGACAAGG G CUGCCCCG 
CCCGCCGA G CAGAGAGC 
AGCAGAGA G CCAGCCCU 
GAGAGCCA G CCCUCUGA 
CCUCUGAC G UCCAUCAU 
UCUCUGCG G UGGUUGGC 
CUGCGGUG G UUGGCAUU 
GGUGGUUG G CAUUCUGC 
UUCUGCUG G UCGUGGUC 
UGCUGGUC G UGGUCUUG 
UGGUCGUG G UCUUGGGG 
UCUUGGGG G UGGUCUUU 
UGGGGGUG G UCUUUGGG 



CGGGGCAG 

GCUCUCUG GCCGAAAGGCGAGUCAAGGUCU UCGGCGGG 
AGGGCUGG GCCGAAAGGCGAGUCAAGGUCU UCUCUGCU 
UCAGAGGG GCCGAAAGGCGAGUCAAGGUCU UGGCUCUC 
AUGAUGGA GCCGAAAGGCGAGUCAAGGUCU GUCAGAGG 
GCCAACCA GCCGAAAGGCGAGUCAAGGUCU CGCAGAGA 
AAUGCCAA GCCGAAAGGCGAGUCAAGGUCU CACCGCAG 
GCAGAAUG GCCGAAAGGCGAGUCAAGGUCU CAACCACC 
GACCACGA GCCGAAAGGCGAGUCAAGGUCU CAGCAGAA 
CAAGACCA GCCGAAAGGCGAGUCAAGGUCU GACCAGCA 
CCCCAAGA GCCGAAAGGCGAGUCAAGGUCU CACGACCA 
AAAGACCA GCCGAAAGGCGAGUCAAGGUCU CCCCAAGA 
CCCAAAGA GCCGAAAGGCGAGUCAAGGUCU CACCCCCA 
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2226 
2259 
2263 
2268 
2282 
2287 
2302 
2331 
2341 
2347 
2360 
2369 
2374 



CUCAUCAA G CGACGGCA 
AAGCGACG G CAGCAGAA 
CGACGGCA G CAGAAGAU 
AUCCGGAA G UACACGAU 
GAAACGGA G CUGGUGGA 
CGGAGCUG G UGGAGCCG 
CUGGUGGA G CCGCUGAC 
GACACCUA G CGGAGCGA 
CUAGCGGA G CGAUGCCC 
CCAACCAG G CGCAGAUG 
GAGACGGA G CUGAGGAA 
UGAGGAAG G UGAAGGUG 
AGGUGAAG G UGCUUGGA 
UGGAUCUG G CGCUUUUG 
CGCUUUUG G CACAGUCU 
G UCUACAAG 



UGCCGUCG GCCGAAAGGCGAGUCAAGGUCU UUGAUGAG 
UUCUGCUG GCCGAAAGGCGAGUCAAGGUCU CGUCGCUU 
AUCUUCUG GCCGAAAGGCGAGUCAAGGUCU UGCCGUCG 
AUCGUGUA GCCGAAAGGCGAGUCAAGGUCU UUCCGGAU 
UCCACCAG GCCGAAAGGCGAGUCAAGGUCU UCCGUUUC 
CGGCUCCA GCCGAAAGGCGAGUCAAGGUCU CAGCUCCG 
GUCAGCGG GCCGAAAGGCGAGUCAAGGUCU UCCACCAG 
UCGCUCCG GCCGAAAGGCGAGUCAAGGUCU UAGGUGUC 
GGGCAUCG GCCGAAAGGCGAGUCAAGGUCU UCCGCUAG 
CAUCUGCG GCCGAAAGGCGAGUCAAGGUCU CUGGUUGG 
UUCCUCAG GCCGAAAGGCGAGUCAAGGUCU UCCGUCUC 
CACCUUCA GCCGAAAGGCGAGUCAAGGUCU CUUCCUCA 
UCCAAGCA GCCGAAAGGCGAGUCAAGGUCU CUUCACCU 
CAAAAGCG GCCGAAAGGCGAGUCAAGGUCU CAGAUCCA 
AGACUGUG GCCGAAAGGCGAGUCAAGGUCU CAAAAGCG 
CUUGUAGA GCCGAAAGGCGAGUCAAGGUCU UGUGCCAA 



CUACAAGG G CAUCUGGA 



UCCAGAUG GCCGAAAGGCGAGUCAAGGUCU CCUUGUAG 



2422 
2425 
2434 



AAAUUCCA G UGGCCAUC 



GAUGGCCA GCCGAAAGGCGAGUCAAGGUCU UGGAAUUU 



UUCCAGUG G CCAUCAAA 



UUUGAUGG GCCGAAAGGCGAGUCAAGGUCU CACUGGAA 



CCAUCAAA G UGUUGAGG 



CCUCAACA GCCGAAAGGCGAGUCAAGGUCU UUUGAUGG 



CCCCCAAA G CCAACAAA 



UUUGUUGG GCCGAAAGGCGAGUCAAGGUCU UUUGGGGG 



UAGACGAA G CAUACGUG 



CACGUAUG GCCGAAAGGCGAGUCAAGGUCU UUCGUCUA 



AAGCAUAC G UGAUGGCU 



AGCCAUCA GCCGAAAGGCGAGUCAAGGUCU GUAUGCUU 



G CUGGUGUG 



CACACCAG G 



GAUGGCUG G UGUGGGCU 



AGCCCACA GCCGAAAGGCGAGUCAAGGUCU CAGCCAUC 



UGGUGUGG G CUCCCCAU 



GCCGAAAGGCGAGUCAAGGUCU CCACACCA 



CCUUCUGG G CAUCUGCC 



GGCAGAUG GCCGAAAGGCGAGUCAAGGUCU CCAGAAGG 



CAUCCACG G UGCAGCUG 



CAGCUGCA GCCGAAAGGCGAGUCAAGGUCU CGUGGAUG 



2559 
2563 



ACGGUGCA G CUGGUGAC 



GUCACCAG GCCGAAAGGCGAGUCAAGGUCU UGCACCGU 



UGCAGCUG G UGACACAG 



CUGUGUCA GCCGAAAGGCGAGUCAAGGUCU CAGCUGCA 



GUGACACA G 



GGCAUAAG GCCGAAAGGCGAGUCAAGGUCU UGUGUCAC 



2585 
2627 
2649 
2675 
2694 
2698 



GCCCUAUG G CUGCCUCU 



AGAGGCAG GCCGAAAGGCGAGUCAAGGUCU CAUAGGGC 



ACGCCUGG G CUCCCAGG 



CCUGGGAG GCCGAAAGGCGAGUCAAGGUCU CCAGGCGU 



CUGAACUG G UGUAUGCA 



UGCAUACA GCCGAAAGGCGAGUCAAGGUCU CAGUUCAG 



G CUACCUGG 



CCAGGUAG GCCGAAAGGCGAGUCAAGGUCU UCAUCCCC 



GAUGUGCG G CUCGUACA 



UGUACGAG GCCGAAAGGCGAGUCAAGGUCU CGCACAUC 



UGCGGCUC G UACACAGG 



CCUGUGUA GCCGAAAGGCGAGUCAAGGUCU GAGCCGCA 



GGGACUUG G CCGCUCGG 



CCGAGCGG 



CUCGGAAC G UGCUGGUC 



GACCAGCA GCCGAAAGGCGAGUCAAGGUCU GUUCCGAG 



ACGUGCUG G UCAAGAGU 



ACUCUUGA GCCGAAAGGCGAGUCAAGGUCU CAGCACGU 



2738 
2769 
2773 
2778 
2802 
2819 
2324 



GGUCAAGA G UCCCAACC 
GACUUCGG G CUGGCUCG 
UCGGGCUG G CUCGGCUG 
CUGGCUCG G CUGCUGGA 
GAGACAGA G UACCAUGC 
AGAUGGGG G CAAGGUGC 
GGGGCAAG G UGCCCAUC 



GGUUGGGA GCCGAAAGGCGAGUCAAGGUCU 
CGAGCCAG GCCGAAAGGCGAGUCAAGGUCU CCGAAGUC 
CAGCCGAG GCCGAAAGGCGAGUCAAGGUCU CAGCCCGA 
UCCAGCAG GCCGAAAGGCGAGUCAAGGUCU CGAGCCAG 
GCAUGGUA GCCGAAAGGCGAGUCAAGGUCU UCUGUCUC 
GCACCUUG GCCGAAAGGCGAGUCAAGGUCU CCCCAUCU 
GAUGGGCA GCCGAAAGGCGAGUCAAGGUCU CUUGCCCC 
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2835 
2842 



28S5 
28S8 
2882 



CCCAUCAA G UGGAUGGC 
AGUGGAUG G CGCUGGAG 
GCGCUGGA G UCCAUUCU 
CUCCGCCG G CGGUUCAC 



GCCAUCCA GCCGAAAGGCGAGUCAAGGUCU UUGAUGGG 
CUCCAGCG GCCGAAAGGCGAGUCAAGGUCU CAUCCACU 
AGAAUGGA GCCGAAAGGCGAGUCAAGGUCU UCCAGCGC 
GCCGAAAGGCGAGUCAAGGUCU CGGCGGAG 



UGGGUGAA GCCGAAAGGC 



CCACCAGA G UGAUGUGU 



ACACAUCA GCCGAAAGGCGAGUCAAGGUCU UCUGGUGG 



UGUGUGGA G UUAUGGUG 



CACCAUAA GCCGAAAGGCGAGUCAAGGUCU UCCACACA 



GAGUUAUG G UGUGACUG 



GCCGAAAGGCGAGUCAAGGUCU CAUAACUC 



2916 
2932 



GUGUGGGA G CUGAUGAC 



GUCAUCAG GCCGAAAGGCGAGUCAAGGUCU UCCCACAC 



CUUUUGGG G CCAAACCU 



3GUUUGG GCCGAAAGGCGAGUCAAGGUCU CCCAAAAG 



GGAUCCCA G CCCGGGAG 



IGGUCU UGGGAUCC 



2991 
2994 
3003 



AAGGGGGA G CGGCUGCC 



AGGCGAGUCAAGGUCU UCCCCCUU 



GGGGAGCG G CUGCCCCA 



5 GCCGAAAGGCGAGUCAAGGUCU C 



CUGCCCCA G CCCCCCAU 



3 GCCGAAAGGCGAGUCAAGGUCU UGGGGCAG 



UGAUCAUG G 



ACAUUUGA GCCGAAAGGCGAGUCAAGGUCU CAUGAUCA 



3072 
3087 
3091 
3112 



GAAUGUCG G CCAAGAUU 



3 GCCGAAAGGCGAC 



UUCCGGGA G UUGGUGUC 



GACACCAA GCCGAAAGGCGAGUCAAGGUCU UCCCGGAA 



GGGAGUUG G UGUCUGAA 



UUCAGACA GCCGAAAGGCGAGUCAAGGUCU CAACUCCC 



CCCGCAUG G CCAGGGAC 



GUCCCUGG GCCGAAAGGCGAGUCAAGGUCU CAUGCGGG 



GACCCCCA G CGCUUUGU 



ACAAAGCG GCCGAAAGGCGAGUCAAGGUCU UGGGGGUC 



5CUUUGUG G UCAUCCAG 



CUGGAUGA GCCGAAAGGCGAGUCAAGGUCU CACAAAGC 



! CCCAGCCA 



UGGCUGGG GCCGAAAGGCGAGUCAAGGUCU CCAAGUCC 



322S 
3240 



3335 
3338 
3341 
3360 
3366 
3382 
3390 
3400 
3415 
3419 
3437 
3454 



CCCAGCCA G UCCCUUGG 
CUUGGACA G CACCUUCU 
GGGACCUG G UGGAUGCU 
GCUGAGGA G UAUCUGGU 
AGUAUCUG G UACCCCAG 
GUACCCCA G CAGGGCUU 
CCAGCAGG G CUUCUUCU 
UGCCCCGG G CGCUGGGG 
CGCUGGGG G CAUGGUCC 
GGGGCAUG G UCCACCAC 
CACCACAG G CACCGCAG 
GCACCGCA G CUCAUCUA 
UACCAGGA G UGGCGGUG 
CAGGAGUG G CGGUGGGG 
GAGUGGCG G UGGGGACC 
ACACUAGG G CUGGAGCC 
GGGCUGGA G CCCUCUGA 
AAGAGGAG G CCCCCAGG 
GCCCCCAG G UCUCCACU 
CUCCACUG G CACCCUCC 
CCGAAGGG G CUGGCUCC 
AGGGGCUG G CUCCGAUG 
AUUUGAUG G UGACCUGG 
GAAUGGGG G CAGCCAAG 



GGGACUGG GCCGAAAGGCGAGUCAAGGUCU UGGGCCCA 
CCAAGGGA GCCGAAAGGCGAGUCAAGGUCU UGGCUGGG 
AGAAGGUG GCCGAAAGGCGAGUCAAGGUCU UGUCCAAG 



ACCAGAUA GCCGAAAGGCGAGUCAAGGUCU UCCUCAGC 
CUGGGGUA GCCGAAAGGCGAGUCAAGGUCU CAGAUACU 
AAGCCCUG GCCGAAAGGCGAGUCAAGGUCU UGGGGUAC 
AGAAGAAG GCCGAAAGGCGAGUCAAGGUCU CCUGCUGG 



GGACCAUG GCCGAAAGGCGAGUCAAGGUCU CCCCAGCG 
GUGGUGGA GCCGAAAGGCGAGUCAAGGUCU CAUGCCCC 
CUGCGGUG GCCGAAAGGCGAGUCAAGGUCU CUGUGGUG 
UAGAUGAG GCCGAAAGGCGAGUCAAGGUCU UGCGGUGC 
CACCGCCA GCCGAAAGGCGAGUCAAGGUCU UCCUGGUA 
CCCCACCG GCCGAAAGGCGAGUCAAGGUCU CACUCCUG 
GGUCCCCA GCCGAAAGGCGAGUCAAGGUCU CGCCACUC 
GGOJCCAG GCCGAAAGGCGAGUCAAGGUCU CCUAGUGU 
UCAGAGGG GCCGAAAGGCGAGUCAAGGUCU UCCAGCCC 
CCUGGGGG GCCGAAAGGCGAGUCAAGGUCU CUCCUCUU 
AGUGGAGA GCCGAAAGGCGAGUCAAGGUCU CUGGGGGC 
GGAGGGUG GCCGAAAGGCGAGUCAAGGUCU CAGUGGAG 
GGAGCCAG GCCGAAAGGCGAGUCAAGGUCU CCCUUCGG 
CAUCGGAG GCCGAAAGGCGAGUCAAGGUCU CAGCCCCU 
CCAGGUCA GCCGAAAGGCGAGUCAAGGUCU CAUCAAAU 
CUUGGCUG GCCGAAAGGCGAGUCAAGGUCU CCCCAUUC 
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3465 
3473 
3494 



UGGGGGCA G CCAAGGGG 
GCCAAGGG G CUGCAAAG 
GCUGCAAA G CCUCCCCA 
UGACCCCA G CCCUCUAC 
CCUCUACA G CGGUACAG 
G UACAGUGA 



CCCCUUGG GCCGAAAGGCGAGUCAAGGUCU UGCCCCCA 
CUUUGCAG GCCGAAAGGCGAGUCAAGGUCU CCCUUGGC 
UGGGGAGG GCCGAAAGGCGAGUCAAGGUCU UUUGCAGC 
GUAGAGGG GCCGAAAGGCGAGUCAAGGUCU UGGGGUCA 



GCCGAAAGGCGAGUCAAGGUCU CGCUGUAG 



GCGGUACA G UGAGGACC 



GGUCCUCA GCCGAAAGGCGAGUCAAGGUCU I 



ACCCCACA G UACCCCUG 



CAGGGGUA GCCGAAAGGCGAGUCAAGGUCU UGUGGGGU 



GACUGAUG G CUACGUUG 



CAACGUAG GCCGAAAGGCGAGUCAAGGUCU CAUCAGUC 



355S 
3575 



AUGGCUAC G UUGCCCCC 



AGCCCCCA G CCUGAAUA 



UAUUCAGG GCC 



GUGAACCA G CCAGAUGU 



IGCGAGUCAAGGUCU UGGUUCAC 



3612 
361S 



GAUGUUCG G CCCCAGCC 



GGCUGGGG GCCGAAAGGCGAGUCAAGGUCU CGAACAUC 



CGGCCCCA G CCCCCUUC 



GAAGGGGG GCCGAAAGGCGAGUCAAGGUCU UGGGGCCG 



CCGAGAGG G CCCUCUGC 



CGAAAGGCGAGUC 



ACCUGCUG G UGCCACl 



GAGUGGCA GCCGAAAGGCGAGUCAAGGUCU CAGCAGGU 



CUGGAAAG G 



GCCGAAAGGCGAGUCAAGGUCU CUUUCC? 



AGAAUGGG G UCGUCAAA 



UUUGACGA GCCGAAAGGCGAGUCAAGGUCU CCCAUUCU 



3 UCAAAGAC 



GUCUUUGA GCCGAAAGGCGAGUCAAGGUCU GACCCCAU 



UCAAAGAC G UUUUUGCC 



GGCAAAAA GCCGAAAGGCGAGUCAAGGUCU GUCUUUGA 



3740 
3745 
3759 



CUUUGGGG G UGCCGUGG 



CCACGGCA GCCGAAAGGCGAGUCAAGGUCU CCCCAAAG 



GGGGUGCC G 
AACCCCGA G 



GUUCUCCA GCCGAAAGGCGAGUCAAGGUCU GGCACCCC 



UACUUGAC 



GUCAAGUA GCCGAAAGGCGAGUCAAGGUCU UCGGGGUU 



AGGGAGGA G CUGCCCCU 



AGGGGCAG GCCGAAAGGCGAGUCAAGGUCU UCCUCCCU 



3792 
3815 
3820 
3861 
3868 



GCCCCUCA G CCCCACCC 



GGGUGGGG GCCGAAAGGCGAGUCAAGGUCU UGAGGGGC 



1 CCCAGCCU 



AGGCUGGG GCCGAAAGGCGAGUCAAGGUCU UGAAGGCA 



UCAGCCCA G CCUUCGAC 



GUCGAAGG GCCGAAAGGCGAGUCAAGGUCU UGGGCUGA 



CCACCAGA G CGGGGGGC 



GCCCCCCG GCCGAAAGGCGAGUCAAGGUCU UCUGGUGG 



AGCGGGGG G CUCCACCC 



GGGUGGAG GCCGAAAGGCGAGUCAAGGUCU CCCCCGCU 



UCCACCCA G CACCUUCA 



UGAAGGUG GCCGAAAGGCGAGUCAAGGUCU UGGGUGGA 



CACCUACG G CAGAGAAC 



GUUCUCUG GCCGAAAGGCGAGUCAAGGUCU CGUAGGUG 



AACCCAGA G UACCUGGG 



CCCAGGUA GCCGAAAGGCGAGUCAAGGUCU UCUGGGUU 



GUACCUGG G UCUGGACG 



CGUCCAGA GCCGAAAGGCGAGUCAAGGUCU CCAGGUAC 



GUCUGGAC G UGCCAGUG 



CACUGGCA GCCGAAAGGCGAGUCAAGGUCU GUCCAGAC 



ACGUGCCA G UGUGAACC 



GGUUCACA GCCGAAAGGCGAGUCAAGGUCU UGGCACGU 



ACCAGAAG G CCAAGUCC 



GGACUUGG GCCGAAAGGCGAGUCAAGGUCU CUUCUGGU 



AAGGCCAA G UCCGCAGA 



UCUGCGGA GCCGAAAGGCGAGUCAAGGUCU UUGGCCUU 



CCGCAGAA G CCCUGAUG 



CAUCAGGG GCCGAAAGGCGAGUCAAGGUCU UUCUGCGG 



CUCAGGGA G CAGGGAAG 



CUUCCCUG GCCGAAAGGCGAGUCAAGGUCU UCCCUGAG 



CAGGGAAG G CCUGACUU 



AAGUCAGG GCCGAAAGGCGAGUCAAGGUCU CUUCCCUG 



UUCUGCUG G CAUCAAGA 



UCUUGAUG GCCGAAAGGCGAGUCAAGGUCU CAGCAGAA 



AUCAAGAG G UGGGAGGG 
GUGGGAGG G CCCUCCGA 
CUGCUUGA G UUCCCAGA 
CCCAGAUG G CUGGAAGG 
UGGAAGGG G UCCAGCCU 



GCCGAAAGGCGAGUCAAGGUCU CUCUUGAU 
UCGGAGGG GCCGAAAGGCGAGUCAAGGUCU CCUCCCAC 
UCUGGGAA GCCGAAAGGCGAGUCAAGGUCU UCAAGCAG 
CCUUCCAG GCCGAAAGGCGAGUCAAGGUCU CAUCUGGG 
AGGCUGGA GCCGAAAGGCGAGUCAAGGUCU CCCUUCCA 
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GC-GGUCCA G CCUC3UUG 
CCAGCCUC G UUGGAAGA 
GAGGAACA G CACUGGGG 
ACUGGGGA G 



CAACGAGG GCCGAAAGGCGAGUCAAGGUCU UGGACCCC 
UCUUCCAA GCCGAAAGGCGAGUCAAGGUCU GAGGCUGG 
CCCCAGUG GCCGAAAGGCGAGUCAAGGUCU UGUUCCUC 
CACAAAGA GCCGAAAGGCGAGUCAAGGUCU UCCCCAGU 



AUUCUGAG G CCCUGCCC 



ACUCUAGG G UCCAGUGG 



CCACUGGA GCCGAAAGGCGAGUCAAGGUCU CCUAGAGU 



AGGGUCCA G UGGAUGCC 



GGCAUCCA GCCGAAAGGCGAGUCAAGGUCU UGGACCCU 



AUGCCACA G CCCAGCUU 



G CCGAAAGGCGAGUCAAGGUCU 



ACAGCCCA G CUUGGCCC 



GGGCCAAG GCCGAAAGGCGAGUCAAGGUCU UGGGCUGU 



CCAGCUUG G 



GGAAAGGG GCCGAAAGGCGAGUCAAGGUCU CAAGCUGG 



UUUCAGUA GCCGAAAGGCGAGUCAAGGUCU CCAGGAUC 



UACUGAAA G CCUl 



CCGAAAGGCGAGUCAAGGUCU 



UUAGGGAA G CUGGCCUG 



CAGGCCAG GCC 



■GCGAGUCAAGGUCU UUCCCUAA 



GGAAGCUG G CCUGAGAG 



!UCUCAGG GCCGAAAGGCGAGUCAAGGUCU CAGCUUCC 



GAGGGGAA G CGGCCCUA 



UAGGGCCG GCCGAAAGGCGAGUCAAGGUCU UUCCCCUC 



GGGAAGCG G CCCUAAGG 



, A AGGCGAGUC A AGGUCU 



CUAAGGGA G 



GAAACCUA G UACUSCCC 
AAGGAACA G CAAUGGUG 
CAGCAAUG G UGUCAGUA 
UGGUGUCA G UAUCCAGG 
GUAUCCAG G CUUUGUAC 
UGUACAGA G UGCUUUUC 
UCUGUUUA G UUUUUACU 



CUUAGACA GCCGAAAGGCGAGUCAAGGUCU UCCCUUAG 
AUGGGUCG GCCGAAAGGCGAGUCAAGGUCU UUUUGUUC 
GGGCAGUA GCCGAAAGGCGAGUCAAGGUCU UAGGUUUC 
CACCAUUG GCCGAAAGGCGAGUCAAGGUCU UGUUCCUU 
UACUGACA GCCGAAAGGCGAGUCAAGGUCU CAUUGCUG 
CCUGGAUA GCCGAAAGGCGAGUCAAGGUCU UGACACCA 
GUACAAAG GCCGAAAGGCGAGUCAAGGUCU CUGGAUAC 
GAAAAGCA GCCGAAAGGCGAGUCAAGGUCU UCUGUACA 
AGUAAAAA GCCGAAAGGCGAGUCAAGGUCU UAAACAGA 



Input Sequence = HSERB2R. Cut Site = G/Y 

Stem Length = 8 . Core Sequence = GCcgaaagGCGaGuCaaGGuCu 
HSERB2R (Human c-erb-B-2 mRNA; 4473 bp) 
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Table 60 



Table 60: Substrate Specificity for Class I Ribozymes 



Substrate sequence 


l-9t mutation 


krel 


5'-GCCGU G GGUUGCAC ACCUUUCC-3' 


w.t. 


1.00 


5'-GCCGU G GGUUGCAC ACCUUUCC-3' 


A57G 


2.5 


5'-GCCGA G GGUUGCAC ACCUUUCC-3' 


A57U 


0.24 


5'-GCCGC G GGUUGCAC ACCUUUCC-3' 


A57G • 


0.66 


5'-GCCGG G GGUUGCAC ACCUUUCC-3' 


A57C 


0.57 


5'-GCCGU U GGUUGCAC ACCUUUCC-3' 


w.t 


0.17 


5'-GCCGU A GGUUGCAC ACCUUUCC-3' 


w.t. 


n.d. 


5'-GCCGU C GGUUGCAC ACCUUUCC-3' 


w.t. 


n.d. 


5'-GCCGU G GGUUGCAC ACCUUUCC-3' 


C16U 


0.98 


5'-GCCGU G UGUUGCAC ACCUUUCC-3' 


C16G 


n.d. 


5'-GCCGU G UGUUGCAC ACCUUUCC-3' 


C16A 


0.65 


5'-GCCGU G AGUUGCAC ACCUUUCC-3' 


C16U 


0.45 


5'-GCCGU G CGUUGCAC ACCUUUCC-3' 


C16G 


0.73 


5'-GCCGU G GGUUGCAC ACCUUU-3' 


w.t. 


0.89 


5'-GCCGU G GGUUGCAC ACCU-3' 


w.t. 


1.0 


5'-GCCGU G GGUUGCAC AC-3' 


w.t. 


0.67 
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Table 63 

Table 63: Human PKCa NCH Ribozyme and Substrate Sequence 



Pos 


Substrate 


Seq ID 


Ribozyme 


Rz Seq 
ID 


27 


GGGGGGAC C AUGGCUGA 




UCAGCCAU CUGAUGAG X CGAA IUCCCCCC 




28 


GGGGGACC A UGGCUGAC 




GUCAGCCA CUGAUGAG X CGAA IGUCCCCC 




33 


ACCAUGGC U GACGUUUU 




AAAACGUC CUGAUGAG X CGAA ICCAUGGU 




43 


ACGUUUUC C CGGGCAAC 




GUUGCCCG CUGAUGAG X CGAA IAAAACGU 




44 


CGUUUUCC C GGGCAACG 




CGUUGCCC CUGAUGAG X CGAA IGAAAACG 




49 


UCCCGGGC A ACGACUCC 




GGAGUCGU CUGAUGAG X CGAA ICCCGGGA 




55 


GCAACGAC U CCACGGCG 




CGCCGUGG CUGAUGAG X CGAA IUCGUUGC 




57 


AACGACUC C ACGGCGUC 




GACGCCGU CUGAUGAG X CGAA IAGUCGUU 




58 


ACGACUCC A CGGCGUCU 




AGACGCCG CUGAUGAG X CGAA IGAGUCGU 




66 


ACGGCGUC U CAGGACGU 




ACGUCCUG CUGAUGAG X CGAA IACGCCGU 




68 


GGCGUCUC A GGACGUGG 




CCACGUCC CUGAUGAG X CGAA IAGACGCC 




78 


GACGUGGC C AACCGCUU 




AAGCGGUU CUGAUGAG X CGAA ICCACGUC 




79 


ACGUGGCC A ACCGCUUC 




GAAGCGGU CUGAUGAG X CGAA IGCCACGU 




82 


UGGCCAAC C GCUUCGCC 




GGCGAAGC CUGAUGAG X CGAA IUUGGCCA 




85 


CCAACCGC U UCGCCCGC 




GCGGGCGA CUGAUGAG X CGAA ICGGUUGG 




90 


CGCUUCGC C CGCAAAGG 




CCUUUGCG CUGAUGAG X CGAA ICGAAGCG 




91 


GCUUCGCC C GCAAAGGG 




CCCUUUGC CUGAUGAG X CGAA I GCGAAGC 




94 


UCGCCCGC A AAGGGGCG 




CGCCCCUU CUGAUGAG X CGAA ICGGGCGA 




104 


AGGGGCGC U GAGGCAGA 




UCUGCCUC CUGAUGAG X CGAA ICGCCCCU 




110 


GCUGAGGC A GAAGAACG 




CGUUCUUC CUGAUGAG X CGAA ICCUCAGC 




122 


GAACGUGC A CGAGGUGA 




UCACCUCG CUGAUGAG X CGAA ICACGUUC 




136 


UGAAGGAC C ACAAAUUC 




GAAUUUGU CUGAUGAG X CGAA IUCCUUCA 




137 


GAAGGACC A CAAAUUCA 




UGAAUUUG CUGAUGAG X CGAA IGUCCUUC 




139 


AGGACCAC A AAUUCAUC 




GAUGAAUU CUGAUGAG X CGAA IUGGUCCU 




145 


ACAAAUUC A UCGCGCGC 




GCGCGCGA CUGAUGAG X CGAA IAAUUUGU 




154 


UCGCGCGC U UCUUCAAG 




CUUGAAGA CUGAUGAG X CGAA ICGCGCGA 




157 


CGCGCUUC U UCAAGCAG 




CUGCUUGA CUGAUGAG X CGAA IAAGCGCG 




160 


GCUUCUUC A AGCAGCCC 




GGGCUGCU CUGAUGAG X CGAA IAAGAAGC 




164 


CUUCAAGC A GCCCACCU 




AGGUGGGC CUGAUGAG X CGAA ICUUGAAG 




167 


CAAGCAGC C CACCUUCU 




AGAAGGUG CUGAUGAG X CGAA ICUGCUUG 




168 


AAGCAGCC C ACCUUCUG 




CAGAAGGU CUGAUGAG X CGAA IGCUGCUU 




169 


AGCAGCCC A CCUUCUGC 




GCAGAAGG CUGAUGAG X CGAA IGGCUGCU 




171 


CAGCCCAC C UUCUGCAG 




CUGCAGAA CUGAUGAG X CGAA IUGGGCUG 




172 


AGCCCACC U UCUGCAGC 




GCUGCAGA CUGAUGAG X CGAA IGUGGGCU 




175 


CCACCUUC U GCAGCCAC 




GUGGCUGC CUGAUGAG X CGAA IAAGGUGG 




178 


CCUUCUGC A GCCACUGC 




GCAGUGGC CUGAUGAG X CGAA ICAGAAGG 




181 


UCUGCAGC C ACUGCACC 




GGUGCAGU CUGAUGAG X CGAA ICUGCAGA 






CUGCAGCC A CUGCACCG 




CGGUGCAG CUGAUGAG X CGAA IGCUGCAG 




184 


GCAGCCAC U GCACCGAC 




GUCGGUGC CUGAUGAG X CGAA IUGGCUGC 




187 


GCCACUGC A CCGACUUC 




GAAGUCGG CUGAUGAG X CGAA ICAGUGGC 




189 


CACUGCAC C GACUUCAU 




AUGAAGUC CUGAUGAG X CGAA IUGCAGUG 




193 


GCACCGAC U UCAUCUGG 




CC AG AUG A CUGAUGAG X CGAA IUCGGUGC 




196 


CCGACUUC A UCUGGGGG 




CCCCCAGA CUGAUGAG X CGAA IAAGUCGG 




199 


ACUUCAUC U GGGGGUUU 




AAACCCCC CUGAUGAG X CGAA IAUGAAGU 




215 


UGGGAAAC A AGGCUUCC 




GGAAGCCU CUGAUGAG X CGAA IUUUCCCA 




220 


AACAAGGC U UCCAGUGC 




GCACUGGA CUGAUGAG X CGAA ICCUUGUU 




223 


AAGGCUUC C AGUGCCAA 




UUGGCACU CUGAUGAG X CGAA IAAGCCUU 
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AGGCUUCC A GUGCCAAG 
UCCAGUGC C AAGUUUGC 
CCAGUGCC A AGUUUGCU 
AAGUUUGC U GUUUUGUG 
UUGUGGUC C ACAAGAGG 
UGUGGUCC A CAAGAGGU 
UGGUCCAC A AGAGGUGC 
AGAGGUGC C AUGAAUUU 
GAGGUGCC A UGAAUUUG 
UUUGUUAC U UUUUCUUG 
ACUUUUUC U UGUCCGGG 
UUCUUGUC C GGGUGCGG 
UAAGGGAC C CGACACUG 
AAGGGACC C GACACUGA 
GACCCGAC A CUGAUGAC 
CCCGACAC U GAUGACCC 
CUGAUGAC C CCAGGAGC 
UGAUGACC C CAGGAGCA 
GAUGACCC C AGGAGCAA 
AUGACCCC A GGAGCAAG 
CCAGGAGC A AGCACAAG 
GAGCAAGC A CAAGUUCA 
GCAAGCAC A AGUUCAAA 
ACAAGUUC A AAAUCCAC 
UCAAAAUC C ACACUUAC 
CAAAAUCC A CACUUACG 
AAAUCCAC A CUUACGGA 
AUCCACAC U UACGGAAG 
ACGGAAGC C CCACCUUC 
CGGAAGCC C CACCUUCU 
GGAAGCCC C ACCUUCUG 
GAAGCCCC A CCUUCUGC 
AGCCCCAC C UUCUGCGA 
GCCCCACC U UCUGCGAU 
CCACCUUC U GCGAUCAC 
CUGCGAUC A CUGUGGGU 
GCGAUCAC U GUGGGUCA 
UGUGGGUC A CUGCUCUA 
UGGGUCAC U GCUCUAUG 
GUCACUGC U CUAUGGAC 
CACUGCUC U AUGGACUU 
CUAUGGAC U UAUCCAUC 
GACUUAUC C AUCAAGGG 
ACUUAUCC A UCAAGGGA 
UAUCCAUC A AGGGAUGA 
AAUGUGAC A CCUGCGAU 
UGUGACAC C UGCGAUAU 
GUGACACC U GCGAUAUG 
GAACGUUC A CAAGCAAU 
ACGUUCAC A AGCAAUGC 
UCACAAGC A AUGCGUCA 



CUUGGCAC CUGAUGAG X CGAA IGAAGCCU 
GCAAACUU CUGAUGAG X CGAA ICACUGGA 
AGCAAACU CUGAUGAG X CGAA IGCACUGG 
CACAAAAC CUGAUGAG X CGAA ICAAACUU 
CCUCUUGU CUGAUGAG X CGAA IACCACAA 
ACCUCUUG CUGAUGAG X CGAA IGACCACA 
GCACCUCU CUGAUGAG X CGAA IUGGACCA 
AAAUUCAU CUGAUGAG X CGAA ICACCUCU 
CAAAUUCA CUGAUGAG X CGAA IGCACCUC 
CAAGAAAA CUGAUGAG X CGAA IUAACAAA 
CCCGGACA CUGAUGAG X CGAA IAAAAAGU 
CCGCACCC CUGAUGAG X CGAA IACAAGAA 
CAGUGUCG CUGAUGAG X CGAA IUCCCUUA 
UCAGUGUC CUGAUGAG X CGAA IGUCCCUU 
GUCAUCAG CUGAUGAG X CGAA IUCGGGUC 
GGGUCAUC CUGAUGAG X CGAA IUGUCGGG 
GCUCCUGG CUGAUGAG X CGAA IUCAUCAG 
UGCUCCUG CUGAUGAG X CGAA IGUCAUCA 
UUGCUCCU CUGAUGAG X CGAA IGGUCAUC 
CUUGCUCC CUGAUGAG X CGAA IGGGUCAU 
CUUGUGCU CUGAUGAG X CGAA ICUCCUGG 



UGAACUUG CUGAUGAG X CGAA 
UUUGAACU CUGAUGAG X CGAA IUGCUUGC 
GUGGAUUU CUGAUGAG X CGAA IAACUUGU 
GUAAGUGU CUGAUGAG X CGAA IAUUUUGA 
CGUAAGUG CUGAUGAG X CGAA IGAUUUUG 
UCCGUAAG CUGAUGAG X CGAA IUGGAUUU 
CUUCCGUA CUGAUGAG X CGAA IUGUGGAU 
GAAGGUGG CUGAUGAG X CGAA ICUUCCGU 
AGAAGGUG CUGAUGAG X CGAA IGCUUCCG 
CAGAAGGU CUGAUGAG X CGAA IGGCUUCC 
GCAGAAGG CUGAUGAG X CGAA IGGGCUUC 
UCGCAGAA CUGAUGAG X CGAA IUGGGGCU 
AUCGCAGA CUGAUGAG X CGAA IGUGGGGC 
GUGAUCGC CUGAUGAG X CGAA IAAGGUGG 
ACCCACAG CUGAUGAG X CGAA IAUCGCAG 
UGACCCAC CUGAUGAG X CGAA IUGAUCGC 
UAGAGCAG CUGAUGAG X CGAA IACCCACA 
CAUAGAGC CUGAUGAG X CGAA IUGACCCA 
GUCCAUAG CUGAUGAG X CGAA ICAGUGAC 
AAGUCCAU CUGAUGAG X CGAA IAGCAGUG 
GAUGGAUA CUGAUGAG X CGAA IUCCAUAG 
CCCUUGAU CUGAUGAG X CGAA IAUAAGUC 
UCCCUUGA CUGAUGAG X CGAA IGAUAAGU 
UCAUCCCU CUGAUGAG X CGAA IAUGGAUA 
AUCGCAGG CUGAUGAG X CGAA IUCACAUU 
AUAUCGCA CUGAUGAG X CGAA IUGUCACA 
CAUAUCGC CUGAUGAG X CGAA IGUGUCAC 
AUUGCUUG CUGAUGAG X CGAA IAACGUUC 
GCAUUGCU CUGAUGAG X CGAA IUGAACGU 
UGACGCAU CUGAUGAG X CGAA ICUUGUGA 
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AAUGCGUC A UCAAUGUC 



GACAUUGA CUGAUGAG X CGAA 



GCGUCAUC A AUGUCCCC 



GGGGACAU CUGAUGAG X CGAA IAUGACGC 



UCAAUGUC C CCAGCCUC 



GAGGCUGG CUGAUGAG X CGAA IACAUUGA 



CAAUGUCC C CAGCCUCU 



AGAGGCUG CUGAUGAG X CGAA IGACAUUG 



AAUGUCCC C AGCCUCUG 



CAGAGGCU CUGAUGAG X CGAA IGGACAUU 



AUGUCCCC A GCCUCUGC 



GCAGAGGC CUGAUGAG X CGAA IGGGACAU 



UCCCCAGC C UCUGCGGA 



UCCGCAGA CUGAUGAG X CGAA ICUGGGGA 



CCCCAGCC U CUGCGGAA 



UUCCGCAG CUGAUGAG X CGAA IGCUGGGG 



CCAGCCUC U GCGGAAUG 



CAUUCCGC CUGAUGAG X CGAA IAGGCUGG 



AAUGGAUC A CACUGAGA 



UCUCAGUG CUGAUGAG X CGAA IAUCCAUU 



UGGAUCAC A CUGAGAAG 



CUUCUCAG CUGAUGAG X CGAA IUGAUCCA 



GAUCACAC U GAGAAGAG 



CUCUUCUC CUGAUGAG X CGAA IUGUGAUC 



GGAUUUAC C UAAAGGCU 



AGCCUUUA CUGAUGAG X CGAA IUAAAUCC 



518 
525 



GAUUUACC U AAAGGCUG 



CAGCCUUU CUGAUGAG X CGAA IGUAAAUC 



CUAAAGGC U GAGGUUGC 



GCAACCUC CUGAUGAG X CGAA ICCUUUAG 



GAGGUUGC U GAUGAAAA 



UUUUCAUC CUGAUGAG X CGAA ICAACCUC 



545 
547 



UGAAAAGC U CCAUGUCA 



UGACAUGG CUGAUGAG X CGAA ICUUUUCA 



AAAAGCUC C AUGUCACA 



UGUGACAU CUGAUGAG X CGAA IAGCUUUU 



AAAGCUCC A UGUCACAG 



CUGUGACA CUGAUGAG X CGAA IGAGCUUU 



UCCAUGUC A CAGUACGA 



UCGUACUG CUGAUGAG X CGAA IACAUGGA 



CAUGUCAC A GUACGAGA 



UCUCGUAC CUGAUGAG X CGAA IUGACAUG 



CGAGAUGC A AAAAAUCU 



AGAUUUUU CUGAUGAG X CGAA ICAUCUCG 



AAAAAAUC U AAUCCCUA 



UAGGGAUU CUGAUGAG X CGAA IAUUUUUU 



AUCUAAUC C CUAUGGAU 



AUCCAUAG CUGAUGAG X CGAA IAUUAGAU 



UCUAAUCC C UAUGGAUC 



GAUCCAUA CUGAUGAG X CGAA IGAUUAGA 



CUAAUCCC U AUGGAUCC 



GGAUCCAU CUGAUGAG X CGAA IGGAUUAG 



UAUGGAUC C AAACGGGC 



GCCCGUUU CUGAUGAG X CGAA IAUCCAUA 



AUGGAUCC A 
AAACGGGC U UUCAGAUC 
GGGCUUUC A GAUCCUUA 
UUCAGAUC C UUAUGUGA 
UCAGAUCC U UAUGUGAA 
UGUGAAGC U GAAACUUA 
GCUGAAAC U UAUUCCUG 
ACUUAUUC C UGAUCCCA 
CUUAUUCC U GAUCCCAA 
UCCUGAUC C CAAGAAUG 
CCUGAUCC C AAGAAUGA 
CUGAUCCC A AGAAUGAA 
AUGAAAGC A AGCAAAAA 
AAGCAAGC A AAAAACCA 
CAAAAAAC C AAAACCAU 
AAAAAACC A AAACCAUC 
ACCAAAAC C AUCCGCUC 
CCAAAACC A UCCGCUCC 
AAACCAUC C GCUCCACA 
CCAUCCGC U CCACACUA 
AUCCGCUC C ACACUAAA 
UCCGCUCC A CACUAAAU 
CGCUCCAC A CUAAAUCC 
CUCCACAC U AAAUCCGC 



AGCCCGUU CUGAUGAG X CGAA IGAUCCAU 
GAUCUGAA CUGAUGAG X CGAA ICCCGUUU 



UAAGGAUC CUGAUGAG X C 



i IAAAGCCC 



UCACAUAA CUGAUGAG X CGAA IAUCUGAA 

UUCACAUA CUGAUGAG X CGAA IGAUCUGA 

UAAGUUUC CUGAUGAG X CGAA ICUUCACA 

CAGGAAUA CUGAUGAG X CGAA IUUUCAGC 

UGGGAUCA CUGAUGAG X CGAA IAAUAAGU 

UUGGGAUC CUGAUGAG X CGAA IGAAUAAG 

CAUUCUUG CUGAUGAG X CGAA IAUCAGGA 

UCAUUCUU CUGAUGAG X CGAA IGAUCAGG 

UUCAUUCU CUGAUGAG X CGAA IGGAUCAG 

UUUUUGCU CUGAUGAG X CGAA ICUUUCAU 

UGGUUUUU CUGAUGAG X CGAA ICUUGCUU 

AUGGUUUU CUGAUGAG X CGAA IUUUUUUG 

GAUGGUUU CUGAUGAG X CGAA IGUUUUUU 

GAGCGGAU CUGAUGAG X CGAA IUUUUGGU 

GGAGCGGA CUGAUGAG X CGAA IGUUUUGG 

UGUGGAGC CUGAUGAG X CGAA IAUGGUUU 

UAGUGUGG CUGAUGAG X CGAA I CGGAUGG 

UUUAGUGU CUGAUGAG X CGAA IAGCGGAU 

AUUUAGUG CUGAUGAG X CGAA IGAGCGGA 

GGAUUUAG CUGAUGAG X CGAA IUGGAGCG 

GCGGAUUU CUGAUGAG X CGAA IUGUGGAG 
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ACUAAAUC C GCAGUGGA 
AAAUCCGC A GUGGAAUG 
AAUGAGUC C UUUACAUU 
AUGAGUCC U UUACAUUC 
UCCUUUAC A UUCAAAUU 
UUACAUUC A AAUUGAAA 
AUUGAAAC C UUCAGACA 
UUGAAACC U UCAGACAA 
AAACCUUC A GACAAAGA 
CUUCAGAC A AAGACCGA 
ACAAAGAC C GACGACUG 
CCGACGAC U 



UCCACUGC CUGAUGAG X CGAA IAUUUAGU 

CAUUCCAC CUGAUGAG X CGAA ICGGAUUU 

AAUGUAAA CUGAUGAG X CGAA IACUCAUU 

GAAUGUAA CUGAUGAG X CGAA IGACUCAU 

AAUUUGAA CUGAUGAG X CGAA IUAAAGGA 

UUUCAAUU CUGAUGAG X CGAA IAAUGUAA 

UGUCUGAA CUGAUGAG X CGAA IUUUCAAU 

UUGUCUGA CUGAUGAG X CGAA IGUUUCAA 

UCUUUGUC CUGAUGAG X CGAA IAAGGUUU 

UCGGUCUU CUGAUGAG X CGAA IUCUGAAG 

CAGUCGUC CUGAUGAG X CGAA IUCUUUGU 

CUACAGAC CUGAUGAG X CGAA IUCGUCGG 



AUUUCUAC CUGAUGAG X CGAA IACAGUCG 



UAGAAAUC U GGGACUGG 



CCAGUCCC CUGAUGAG X CGAA IAUUUCUA 



UCUGGGAC U GGGAUCGA 



UCGAUCCC CUGAUGAG X CGAA IUCCCAGA 



GAUCGAAC A ACAAGGAA 



UUCCUUGU CUGAUGAG X CGAA IUUCGAUC 



CGAACAAC A AGGAAUGA 



UCAUUCCU CUGAUGAG X CGAA IUUGUUCG 



GGAAUGAC U UCAUGGGA 



UCCCAUGA CUGAUGAG X CGAA IUCAUUCC 



AUGACUUC A UGGGAUCC 



GGAUCCCA CUGAUGAG X CGAA IAAGUCAU 



AUGGGAUC C CUUUCCUU 



AAGGAAAG CUGAUGAG X CGAA IAUCCCAU 



UGGGAUCC C UUUCCUUU 



AAAGGAAA CUGAUGAG X CGAA IGAUCCCA 



UCCCUUUC C UUUGGAGU 
CCCUUUCC U UUGGAGUU 
UUCGGAGC U GAUGAAGA 
GAAGAUGC C GGCCAGUG 
AUGCCGGC C AGUGGAUG 
UGCCGGCC A GUGGAUGG 
GAUGGUAC A AGUUGCUU 
CAAGUUGC U UAACCAAG 
UGCUUAAC C AAGAAGAA 
GCUUAACC A AGAAGAAG 
GUGAGUAC U ACAACGUA 
AGUACUAC A ACGUACCC 
CAACGUAC C CAUUCCGG 
AACGUACC C AUUCCGGA 
ACGUACCC A UUCCGGAA 
ACCCAUUC C GGAAGGGG 
AAGGAAAC A UGGAACUC 
CAUGGAAC U CAGGCAGA 
UGGAACUC A GGCAGAAA 
ACUCAGGC A GAAAUUCG 
GAGAAAGC C AAACUUGG 
AGAAAGCC A AACUUGGC 
AGCCAAAC U UGGCCCUG 
AACUUGGC C CUGCUGGC 
ACUUGGCC C UGCUGGCA 
CUUGGCCC U GCUGGCAA 
GGCCCUGC U GGCAACAA 
CUGCUGGC A ACAAAGUC 
CUGGCAAC A AAGUCAUC 



CAAAGGAA CUGAUGAG X CGAA IGGAUCCC 
ACUCCAAA CUGAUGAG X CGAA IAAAGGGA 
AACUCCAA CUGAUGAG X CGAA IGAAAGGG 
UCUUCAUC CUGAUGAG X CGAA ICUCCGAA 
CACUGGCC CUGAUGAG X CGAA ICAUCUUC 
CAUCCACU CUGAUGAG X CGAA ICCGGCAU 
CCAUCCAC CUGAUGAG X CGAA IGCCGGCA 
AAGCAACU CUGAUGAG X CGAA IUACCAUC 
CUUGGUUA CUGAUGAG X CGAA ICAACUUG 
UUCUUCUU CUGAUGAG X CGAA IUUAAGCA 
CUUCUUCU CUGAUGAG X CGAA IGUUAAGC 
UACGUUGU CUGAUGAG X CGAA IUACUCAC 
GGGUACGU CUGAUGAG X CGAA IUAGUACU 
CCGGAAUG CUGAUGAG X CGAA IUACGUUG 
UCCGGAAU CUGAUGAG X CGAA IGUACGUU 
UUCCGGAA CUGAUGAG X CGAA IGGUACGU 
CCCCUUCC CUGAUGAG X CGAA IAAUGGGU 
GAGUUCCA CUGAUGAG X CGAA IUUUCCUU 
UCUGCCUG CUGAUGAG X CGAA IUUCCAUG 
UUUCUGCC CUGAUGAG X CGAA IAGUUCCA 
CGAAUUUC CUGAUGAG X CGAA ICCUGAGU 
CCAAGUUU CUGAUGAG X CGAA ICUUUCUC 
GCCAAGUU CUGAUGAG X CGAA IGCUUUCU 
CAGGGCCA CUGAUGAG X CGAA IUUUGGCU 
GCCAGCAG CUGAUGAG X CGAA ICCAAGUU 
UGCCAGCA CUGAUGAG X CGAA IGCCAAGU 
UUGCCAGC CUGAUGAG X CGAA IGGCCAAG 
UUGUUGCC CUGAUGAG X CGAA ICAGGGCC 
GACUUUGU CUGAUGAG X CGAA ICCAGCAG 
GAUGACUU CUGAUGAG X CGAA IUUGCCAG 
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1011 

1012 
1015 



1034 
1036 
1042 
1045 
1051 
1052 
1054 



1203 
1204 
1205 
1208 
1213 
1217 



1220 
1225 
1232 
1235 
123B 



ACAAAGUC A UCAGUCCC 
AAGUCAUC A GUCCCUCU 
CAUCAGUC C CUCUGAAG 
AUCAGUCC C UCUGAAGA 
UCAGUCCC U CUGAAGAC 
AGUCCCUC U GAAGACAG 
CUGAAGAC A GGAAACAA 
CAGGAAAC A ACCUUCCA 
GAAACAAC C UUCCAACA 
AAACAACC U UCCAACAA 
CAACCUUC C AACAACCU 
AACCUUCC A ACAACCUU 
CUUCCAAC A ACCUUGAC 
CCAACAAC C UUGACCGA 
CAACAACC U UGACCGAG 
ACCUUGAC C GAGUGAAA 
AGUGAAAC U CACGGACU 
UGAAACUC A CGGACUUC 
UCACGGAC U UCAAUUUC 
CGGACUUC A AUUUCCUC 
UCAAUUUC C UCAUGGUG 
CAAUUUCC U CAUGGUGU 
AUUUCCUC A UGGUGUUG 
GGUGAUGC U UGCCGACA 
AUGCUUGC C GACAGGAA 
UUGCCGAC A GGAAGGGC 
GGAAGGGC A CAGAAGAA 
AAGGGCAC A GAAGAACU 
AGAAGAAC U GUAUGCAA 
CUGUAUGC A AUCAAAAU 
AUGCAAUC A AAAUCCUG 
UCAAAAUC C UGAAGAAG 
CAAAAUCC U GAAGAAGG 
GGUGAUUC A GGAUGAUG 
UGGAGUGC A CCAUGGUA 
GAGUGCAC C AUGGUAGA 
AGUGCACC A UGGUAGAA 
AGCGAGUC U UGGCCCUG 
GUCUUGGC C CUGCUUGA 
UCUUGGCC C UGCUUGAC 
CUUGGCCC U GCUUGACA 
GGCCCUGC U UGACAAAC 
UGCUUGAC A AACCCCCG 
UGACAAAC C CCCGUUCU 
GACAAACC C CCGUUCUU 
ACAAACCC C CGUUCUUG 
CAAACCCC C GUUCUUGA 
CCCCGUUC U UGACGCAG 
CUUGACGC A GCUGCACU 
GACGCAGC U GCACUCCU 
GCAGCUGC A CUCCUGCU 



GGGACUGA CUGAUGAG X CGAA IACUUUGU 

AGAGGGAC CUGAUGAG X CGAA IAUGACUU 

CUUCAGAG CUGAUGAG X CGAA IACUGAUG 

UCUUCAGA CUGAUGAG X CGAA IGACUGAU 

GUCUUCAG CUGAUGAG X CGAA IGGACUGA 

CUGUCUUC CUGAUGAG X CGAA IAGGGACU 

UUGUUUCC CUGAUGAG X CGAA IUCUUCAG 

UGGAAGGU CUGAUGAG X CGAA IUUUCCUG 

UGUUGGAA CUGAUGAG X CGAA IUUGUUUC 

UUGUUGGA CUGAUGAG X CGAA IGUUGUUU 

AGGUUGUU CUGAUGAG X CGAA IAAGGUUG 

AAGGUUGU CUGAUGAG X CGAA IGAAGGUU 

GUCAAGGU CUGAUGAG X CGAA IUUGGAAG 

UCGGUCAA CUGAUGAG X CGAA IUUGUUGG 

CUCGGUCA CUGAUGAG X CGAA IGUUGUUG 

UUUCACUC CUGAUGAG X CGAA IUCAAGGU 

AGUCCGUG CUGAUGAG X CGAA IUUUCACU 

GAAGUCCG CUGAUGAG X CGAA IAGUUUCA 

GAAAUUGA CUGAUGAG X CGAA IUCCGUGA 

GAGGAAAU CUGAUGAG X CGAA IAAGUCCG 

CACCAUGA CUGAUGAG X CGAA IAAAUUGA 

ACACCAUG CUGAUGAG X CGAA IGAAAUUG 

CAACACCA CUGAUGAG X CGAA IAGGAAAU 

UGUCGGCA CUGAUGAG X CGAA ICAUCACC 

UUCCUGUC CUGAUGAG X CGAA ICAAGCAU 
_________________ 

UUCUUCUG CUGAUGAG X CGAA ICCCUUCC 

AGUUCUUC CUGAUGAG X CGAA IUGCCCUU 

UUGCAUAC CUGAUGAG X CGAA IUUCUUCU 

AUUUUGAU CUGAUGAG X CGAA ICAUACAG 

CAGGAUUU CUGAUGAG X CGAA IAUUGCAU 

CUUCUUCA CUGAUGAG X CGAA IAUUUUGA 

CCUUCUUC CUGAUGAG X CGAA IGAUUUUG 

CAUCAUCC CUGAUGAG X CGAA IAAUCACC 

UACCAUGG CUGAUGAG X CGAA ICACUCCA 

UCUACCAU CUGAUGAG X CGAA IUGCACUC 

UUCUACCA CUGAUGAG X CGAA IGUGCACU 

CAGGGCCA CUGAUGAG X CGAA IACUCGCU 

UCAAGCAG CUGAUGAG X CGAA ICCAAGAC 

GUCAAGCA CUGAUGAG X CGAA IGCCAAGA 

UGUCAAGC CUGAUGAG X CGAA IGGCCAAG 

GUUUGUCA CUGAUGAG X CGAA ICAGGGCC 

CGGGGGUU CUGAUGAG X CGAA IUCAAGCA 

AGAACGGG CUGAUGAG X CGAA IUUUGUCA 

AAGAACGG CUGAUGAG X CGAA IGUUUGUC 

CAAGAACG CUGAUGAG X CGAA IGGUUUGU 

UCAAGAAC CUGAUGAG X CGAA IGGGUUUG 

CUGCGUCA CUGAUGAG X CGAA IAACGGGG 

AGUGCAGC CUGAUGAG X CGAA ICGUCAAG 

AGGAGUGC CUGAUGAG X CGAA ICUGCGUC 

AGCAGGAG CUGAUGAG X CGAA ICAGCUGC 
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1242 
1243 
1246 
1249 
1250 
1254 
1265 
1270 
1276 



1301 
1303 
1309 
1310 
1312 
1316 
1319 
1340 
1341 
1343 
1347 
1354 
1362 
1371 
1372 
1384 
1388 
1391 
1405 



1459 
1467 



1532 
1533 
1545 



AGCUGCAC U CCUGCUUC 
CUGCACUC C UGCUUCCA 
UGCACUCC U GCUUCCAG 
ACUCCUGC U UCCAGACA 
CCUGCUUC C AGACAGUG 
CUGCUUCC A GACAGUGG 
UUCCAGAC A GUGGAUCG 
GGAUCGGC U GUACUUCG 
GGCUGUAC U UCGUCAUG 
ACUUCGUC A UGGAAUAU 
AAUAUGUC A ACGGUGGG 
GUGGGGAC C UCAUGUAC 
UGGGGACC U CAUGUACC 
GGGACCUC A UGUACCAC 
UCAUGUAC C ACAUUCAG 
CAUGUACC A CAUUCAGC 
UGUACCAC A UUCAGCAA 
CCACAUUC A GCAAGUAG 
CAUUCAGC A AGUAGGAA 
UAAGGAAC C ACAAGCAG 
AAGGAACC A CAAGCAGU 
GGAACCAC A AGCAGUAU 
CCACAAGC A GUAUUCUA 
CAGUAUUC U AUGCGGCA 
UAUGCGGC A GAGAUUUC 
GAGAUUUC C AUCGGAUU 
AGAUUUCC A UCGGAUUG 
GAUUGUUC U UUCUUCAU 
GUUCUUUC U UCAUAAAA 
CUUUCUUC A UAAAAGAG 
GAGGAAUC A UUUAUAGG 
UAGGGAUC U GAAGUUAG 
AUAACGUC A UGUUGGAU 
UUGGAUUC A GAAGGACA 
AGAAGGAC A UAUCAAAA 
GACAUAUC A AAAUUGCU 
AAAAUUGC U GACUUUGG 
UUGCUGAC U UUGGGAUG 
GGAUGUGC A AGGAACAC 
CAAGGAAC A CAUGAUGG 
AGGAACAC A UGAUGGAU 
AUGGAGUC A CGACCAGG 
GUCACGAC C AGGACCUU 
UCACGACC A GGACCUUC 
ACCAGGAC C UUCUGUGG 
CCAGGACC U UCUGUGGG 
GGACCUUC U GUGGGACU 
UGUGGGAC U CCAGAUUA 
UGGGACUC C AGAUUAUA 
GGGACUCC A GAUUAUAU 
UAUAUCGC C CCAGAGAU 



CUGAUGAG X CGAA IUGCAGCU 
UGGAAGCA CUGAUGAG X CGAA IAGUGCAG 
CUGGAAGC CUGAUGAG X CGAA IGAGUGCA 
UGUCUGGA CUGAUGAG X CGAA I CAGGAGU 
CACUGUCU CUGAUGAG X CGAA IAAGCAGG 
CCACUGUC CUGAUGAG X CGAA IGAAGCAG 
CGAUCCAC CUGAUGAG X CGAA IUCUGGAA 
CGAAGUAC CUGAUGAG X CGAA ICCGAUCC 
CAUGACGA CUGAUGAG X CGAA IUACAGCC 
AUAUUCCA CUGAUGAG X CGAA IACGAAGU 
CCCACCGU CUGAUGAG X CGAA IACAUAUU 
GUACAUGA CUGAUGAG X CGAA IUCCCCAC 
GGUACAUG CUGAUGAG X CGAA IGUCCCCA 
GUGGUACA CUGAUGAG X CGAA IAGGUCCC 
CUGAAUGU CUGAUGAG X CGAA IUACAUGA 
GCUGAAUG CUGAUGAG X CGAA IGUACAUG 
UUGCUGAA CUGAUGAG X CGAA IUGGUACA 
CUACUUGC CUGAUGAG X CGAA IAAUGUGG 
UUCCUACU CUGAUGAG X CGAA ICUGAAUG 
CUGCUUGU CUGAUGAG X CGAA IUUCCUUA 
ACUGCUUG CUGAUGAG X CGAA IGUUCCUU 
AUACUGCU CUGAUGAG X CGAA IUGGUUCC 
UAGAAUAC CUGAUGAG X CGAA ICUUGUGG 
UGCCGCAU CUGAUGAG X CGAA IAAUACUG 
GAAAUCUC CUGAUGAG X CGAA ICCGCAUA 
AAUCCGAU CUGAUGAG X CGAA IAAAUCUC 
CAAUCCGA CUGAUGAG X CGAA IGAAAUCU 
AUGAAGAA CUGAUGAG X CGAA IAACAAUC 
UUUUAUGA CUGAUGAG X CGAA IAAAGAAC 
CUCUUUUA CUGAUGAG X CGAA IAAGAAAG 
CCUAUAAA CUGAUGAG X CGAA IAUUCCUC 
CUAACUUC CUGAUGAG X CGAA IAUCCCUA 
AUCCAACA CUGAUGAG X CGAA IACGUUAU 
UGUCCUUC CUGAUGAG X CGAA IAAUCCAA 
UUUUGAUA CUGAUGAG X CGAA IUCCUUCU 
AGCAAUUU CUGAUGAG X CGAA IAUAUGUC 
CCAAAGUC CUGAUGAG X CGAA ICAAUUUU 
CAUCCCAA CUGAUGAG X CGAA IUCAGCAA 
GUGUUCCU CUGAUGAG X CGAA ICACAUCC 
CCAUCAUG CUGAUGAG X CGAA IUUCCUUG 
AUCCAUCA CUGAUGAG X CGAA IUGUUCCU 
CCUGGUCG CUGAUGAG X CGAA IACUCCAU 
AAGGUCCU CUGAUGAG X CGAA IUCGUGAC 
GAAGGUCC CUGAUGAG X CGAA IGUCGUGA 
CCACAGAA CUGAUGAG X CGAA IUCCUGGU 
CCCACAGA CUGAUGAG X CGAA IGUCCUGG 
AGUCCCAC CUGAUGAG X CGAA IAAGGUCC 
UAAUCUGG CUGAUGAG X' CGAA IUCCCACA 
UAUAAUCU CUGAUGAG X CGAA IAGUCCCA 
AUAUAAUC CUGAUGAG X CGAA IGAGUCCC 
AUCUCUGG CUGAUGAG X CGAA ICGAUAUA 
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1548 
1560 



1581 
1588 



AUAUCGCC C CAGAGAUA 
UAUCGCCC C AGAGAUAA 
AUCGCCCC A GAGAUAAU 
AUAAUCGC U UAUCAGCC 
CGCUUAUC A GCCGUAUG 
UUAUCAGC C GUAUGGAA 
GGAAAAUC U GUGGACUG" 
CUGUGGAC U GGUGGGCC 
UGGUGGGC C UAUGGCGU 
GGUGGGCC U AUGGCGUC 
AUGGCGUC C UGUUGUAU 
UGGCGUCC U GUUGUAUG 
UGAAAUGC U UGCCGGGC 
AUGCUUGC C GGGCAGCC 
UGCCGGGC A 



UAUCUCUG CUGAUGAG X CGAA IGCGAUAU 

UUAUCUCU CUGAUGAG X CGAA IGGCGAUA 

AUUAUCUC CUGAUGAG X CGAA IGGGCGAU 

GGCUGAUA CUGAUGAG X CGAA ICGAUUAU 

CAUACGGC CUGAUGAG X CGAA IAUAAGCG 

UUCCAUAC CUGAUGAG X CGAA ICUGAUAA 

CAGUCCAC CUGAUGAG X CGAA IAUUUUCC 

GGCCCACC CUGAUGAG X CGAA IUCCACAG 

ACGCCAUA CUGAUGAG X CGAA ICCCACCA 

GACGCCAU CUGAUGAG X CGAA IGCCCACC 

AUACAACA CUGAUGAG X CGAA IACGCCAU 

CAUACAAC CUGAUGAG X CGAA IGACGCCA 

GCCCGGCA CUGAUGAG X CGAA ICAUUUCA 

GGCUGCCC CUGAUGAG X CGAA ICAAGCAU 



AUGGAGGC CUGAUGAG X ( 



i ICCCGGCA 



CGGGCAGC C UCCAl 



CAAAUGGA CUGAUGAG X CGAA ICUGCCCG 



GGGCAGCC U CCAUUUGA 



UCAAAUGG CUGAUGAG X CGAA IGCUGCCC 



GCAGCCUC C AUUUGAUG 



CAUCAAAU CUGAUGAG X CGAA IAGGCUGC 



CAGCCUCC A UUUGAUGG 



CCAUCAAA CUGAUGAG X CGAA IGAGGCUG 



AGACGAGC U AUUUCAGU 



ACUGAAAU CUGAUGAG X CGAA ICUCGUCU 



1670 
1674 
1678 
1685 



GCUAUUUC A GUCUAUCA 



UGAUAGAC CUGAUGAG X CGAA IAAAUAGC 



UUUCAGUC U AUCAUGGA 



UCCAUGAU CUGAUGAG X CGAA IACUGAAA 



AGUCUAUC A UGGAGCAC 



GUGCUCCA CUGAUGAG X CGAA IAUAGACU 



CAUGGAGC A CAACGUUU 



AAACGUUG CUGAUGAG X CGAA ICUCCAUG 



UGGAGCAC A ACGUUUCC 



GGAAACGU CUGAUGAG X CGAA IUGCUCCA 



1695 
1696 



AACGUUUC C UAUCCAAA 



UUUGGAUA CUGAUGAG X CGAA IAAACGUU 



ACGUUUCC U AUCCAAAA 



UUUUGGAU CUGAUGAG X CGAA IGAAACGU 



UUCCUAUC C 



AGGAUUUU CUGAUGAG X CGAA IAUAGGAA 



UCCUAUCC A AAAUCCUU 



AAGGAUUU CUGAUGAG X CGAA IGAUAGGA 



CCAAAAUC C UUGUCCAA 



UUGGACAA CUGAUGAG X CGAA IAUUUUGG 



CAAAAUCC U UGUCCAAG 



CUUGGACA CUGAUGAG X CGAA IGAUUUUG 



UCCUUGUC C AAGGAGGC 



GCCUCCUU CUGAUGAG X CGAA IACAAGGA 



CCUUGUCC A AGGAGGCU 



AGCCUCCU CUGAUGAG X CGAA IGACAAGG 



AAGGAGGC U GUUUCUAU 



AUAGAAAC CUGAUGAG X CGAA ICCUCCUU 



1728 
1732 
1735 



GCUGUUUC U AUCUGCAA 



UUGCAGAU CUGAUGAG X CGAA IAAACAGC 



UUUCUAUC U GCAAAGGA 



UCCUUUGC CUGAUGAG X CGAA IAUAGAAA 



CUAUCUGC A AAGGACUG 



CAGUCCUU CUGAUGAG X CGAA ICAGAUAG 



CAAAGGAC U GAUGACCA 



UGGUCAUC CUGAUGAG X CGAA IUCCUUUG 



1749 
1750 



CUGAUGAC C AAACACCC 



GGGUGUUU CUGAUGAG X CGAA IUCAUCAG 



UGAUGACC A AACACCCA 



UGGGUGUU CUGAUGAG X CGAA IGUCAUCA 



GACCAAAC A CCCAGCCA 



UGGCUGGG CUGAUGAG X CGAA IUUUGGUC 



CCAAACAC C CAGCCAAG 



CUUGGCUG CUGAUGAG X CGAA IUGUUUGG 



CAAACACC C AGCCAAGC 



GCUUGGCU CUGAUGAG X CGAA IGUGUUUG 



1758 
1761 
1762 



AAACACCC A GCCAAGCG 



CGCUUGGC CUGAUGAG X CGAA IGGUGUUU 



CACCCAGC C AAGCGGCU 



AGCCGCUU CUGAUGAG X CGAA ICUGGGUG 



ACCCAGCC A AGCGGCUG 



CAGCCGCU CUGAUGAG X CGAA IGCUGGGU 



1781 
1782 
1808 



CAAGCGGC U GGGCUGUG 
GGCUGGGC U GUGGGCCU 
CUGUGGGC C UGAGGGGG" 
UGUGGGCC U GAGGGGGa" 
GAGAGAGC A UGCCUUCU 



CACAGCCC CUGAUGAG X CGAA 

AGGCCCAC CUGAUGAG X CGAA ICCCAGCC 

CCCCCUCA CUGAUGAG X CGAA ICCCACAG 

UCCCCCUC CUGAUGAG X CGAA IGCCCACA 

AGAAGGCA CUGAUGAG X CGAA ICUCUCUC 
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1813 
1816 
1819 



1858 
1859 
1862 
1863 
1865 
1866 
1870 



1903 
1909 
1915 
1918 
1920 
1928 
1931 
1932 
1936 
1941 
1943 
1944 
1945 
1947 
1952 
1955 
1965 
1969 
1975 
1976 



GAGCAUGC C UUCUUCCG 
AGCAUGCC U UCUUCCGG 
AUGCCUUC U UCCGGAGG 
CCUUCUUC C GGAGGAUC 
GGAUCGAC U GGGAAAAA 
GGAAAAAC U GGAGAACA 
UGGAGAAC A GGGAGAUC 
GGGAGAUC C AGCCACCA 
GGAGAUCC A GCCACCAU 
GAUCCAGC C ACCAUUCA 
AUCCAGCC A CCAUUCAA 
CCAGCCAC C AUUCAAGC 
CAGCCACC A UUCAAGCC 
CACCAUUC A AGCCCAAA 
AUUCAAGC C CAAAGUGU 
UUCAAGCC C AAAGUGUG 
UCAAGCCC A AAGUGUGU 
UGUGUGGC A AAGGAGCA 
AAAGGAGC A GAGAACUU 
CAGAGAAC U UUGACAAG 
ACUUUGAC A AGUUCUUC 
ACAAGUUC U UCACACGA 
AGUUCUUC A CACGAGGA 
UUCUUCAC A CGAGGACA 



2009 
2010 



2023 
2024 
2025 



ACGAGGAC A 
AGGACAGC C CGUCUUAA 
GGACAGCC C GUCUUAAC 
AGCCCGUC U UAACACCA 
GUCUUAAC A CCACCUGA 
CUUAACAC C ACCUGAUC 
UUAACACC A CCUGAUCA 
AACACCAC C UGAUCAGC 
ACACCACC U GAUCAGCU 
ACCUGAUC A GCUGGUUA 
UGAUCAGC U GGUUAUUG 
GUUAUUGC U AACAUAGA 
UUGCUAAC A UAGACCAG 
ACAUAGAC C AGUCUGAU 
CAUAGACC A GUCUGAUU 
GACCAGUC U GAUUUUGA 
AAGGGUUC U CGUAUGUC 
CGUAUGUC A ACCCCCAG 
AUGUCAAC C CCCAGUUU 
UGUCAACC C CCAGUUUG 
GUCAACCC C CAGUUUGU 
UCAACCCC C AGUUUGUG 
CAACCCCC A GUUUGUGC 
GUUUGUGC A CCCCAUCU 
UUGUGCAC C CCAUCUUA 
UGUGCACC C CAUCUUAC 
GUGCACCC C AUCUUACA 



CGGAAGAA CUGAUGAG X CGAA ICAUGCUC 

CCGGAAGA CUGAUGAG X CGAA IGCAUGCU 

CCUCCGGA CUGAUGAG X CGAA IAAGGCAU 

GAUCCUCC CUGAUGAG X CGAA IAAGAAGG 

UUUUUCCC CUGAUGAG X CGAA IUCGAUCC 

UGUUCUCC CUGAUGAG X CGAA IUUUUUCC 

GAUCUCCC CUGAUGAG X CGAA IUUCUCCA 

UGGUGGCU CUGAUGAG X CGAA IAUCUCCC 

AUGGUGGC CUGAUGAG X CGAA IGAUCUCC 

UGAAUGGU CUGAUGAG X CGAA ICUGGAUC 

UUGAAUGG CUGAUGAG X CGAA IGCUGGAU 

GCUUGAAU CUGAUGAG X CGAA IUGGCUGG 

GGCUUGAA CUGAUGAG X CGAA IGUGGCUG 

UUUGGGCU CUGAUGAG X CGAA IAAUGGUG 

ACACUUUG CUGAUGAG X CGAA ICUUGAAU 

CACACUUU CUGAUGAG X CGAA IGCUUGAA 

ACACACUU CUGAUGAG X CGAA IGGCUUGA 

UGCUCCUU CUGAUGAG X CGAA ICCACACA 

AAGUUCUC CUGAUGAG X CGAA ICUCCUUU 

CUUGUCAA CUGAUGAG X CGAA IUUCUCUG 

GAAGAACU CUGAUGAG X CGAA IUCAAAGU 

UCGUGUGA CUGAUGAG X CGAA IAACUUGU 

UCCUCGUG CUGAUGAG X CGAA IAAGAACU 

UGUCCUCG CUGAUGAG X CGAA IUGAAGAA 

AGACGGGC CUGAUGAG X CGAA IUCCUCGU 

UUAAGACG CUGAUGAG X CGAA ICUGUCCU 

GUUAAGAC CUGAUGAG X CGAA IGCUGUCC 



UGGUGUUA CUGAUGAG X C 



. IACGGGCU 



UCAGGUGG CUGAUGAG X CGAA IUUAAGAC 

GAUCAGGU CUGAUGAG X CGAA IUGUUAAG 

UGAUCAGG CUGAUGAG X CGAA IGUGUUAA 

GCUGAUCA CUGAUGAG X CGAA IUGGUGUU 

AGCUGAUC CUGAUGAG X CGAA IGUGGUGU 

UAACCAGC CUGAUGAG X CGAA IAUCAGGU 

CAAUAACC CUGAUGAG X CGAA ICUGAUCA 

UCUAUGUU CUGAUGAG X CGAA ICAAUAAC 

CUGGUCUA CUGAUGAG X CGAA IUUAGCAA 

AUCAGACU CUGAUGAG X CGAA IUCUAUGU 

AAUCAGAC CUGAUGAG X CGAA IGUCUAUG 

UCAAAAUC CUGAUGAG X CGAA IACUGGUC 

GACAUACG CUGAUGAG X CGAA IAACCCUU 

CUGGGGGU CUGAUGAG X CGAA IACAUACG 

AAACUGGG CUGAUGAG X CGAA IUUGACAU 

CAAACUGG CUGAUGAG X CGAA IGUUGACA 

ACAAACUG CUGAUGAG X CGAA IGGUUGAC 

CACAAACU CUGAUGAG X CGAA IGGGUUGA 

GCACAAAC CUGAUGAG X CGAA I GGGGUUG 

AGAUGGGG CUGAUGAG X CGAA ICACAAAC 

UAAGAUGG CUGAUGAG X CGAA IUGCACAA 

GUAAGAUG CUGAUGAG X CGAA IGUGCACA 

UGUAAGAU CUGAUGAG X CGAA IGGUGCAC 
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UGCACCCC A UCUUACAG 



CUGUAAGA CUGAUGAG X CGAA IGGGUGCA 



ACCCCAUC U UACAGAGU 



ACUCUGUA CUGAUGAG X CGAA IAUGGGGU 



CAUCUUAC A GAGUGCAG 



CUGCACUC CUGAUGAG X CGAA IUAAGAUG 



CAGAGUGC A GUAUGAAA 



UUUCAUAC CUGAUGAG X CGAA ICACUCUG 



UAUGAAAC U CACCAGCG 



CGCUGGUG CUGAUGAG X CGAA IUUUCAUA 



UGAAACUC A CCAGCGAG 



CUCGCUGG CUGAUGAG X CGAA IAGUUUCA 



AAACUCAC C AGCGAGAA 



UUCUCGCU CUGAUGAG X CGAA IUGAGUUU 



AACUCACC A GCGAGAAC 



GUUCUCGC CUGAUGAG X CGAA IGUGAGUU 



GCGAGAAC A AACACCUC 



GAGGUGUU CUGAUGAG X CGAA IUUCUCGC 



GAACAAAC A CCUCCCCA 



UGGGGAGG CUGAUGAG X CGAA IUUUGUUC 



ACAAACAC C UCCCCAGC 



GCUGGGGA CUGAUGAG X CGAA IUGUUUGU 



CAAACACC U CCCCAGCC 



GGCUGGGG CUGAUGAG X CGAA IGUGUUUG 



AACACCUC C CCAGCCCC 



GGGGCUGG CUGAUGAG X CGAA IAGGUGUU 



ACACCUCC C CAGCCCCC 



GGGGGCUG CUGAUGAG X CGAA IGAGGUGU 



CACCUCCC C AGCCCCCA 



UGGGGGCU CUGAUGAG X CGAA IGGAGGUG 



ACCUCCCC A GCCCCCAG 



CUGGGGGC CUGAUGAG X CGAA IGGGAGGU 



UCCCCAGC C CCCAGCCC 



GGGCUGGG CUGAUGAG X CGAA I CUGGGGA 



CCCCAGCC C CCAGCCCU 



AGGGCUGG CUGAUGAG X CGAA IGCUGGGG 



CCCAGCCC C CAGCCCUC 



GAGGGCUG CUGAUGAG X CGAA IGGCUGGG 



CCAGCCCC C AGCCCUCC 



GGAGGGCU CUGAUGAG X CGAA IGGGCUGG 



CAGCCCCC A GCCCUCCC 



GGGAGGGC CUGAUGAG X CGAA IGGGGCUG 



CCCCCAGC C CUCCCCGC 



GCGGGGAG CUGAUGAG X CGAA I CUGGGGG 



CCCCAGCC C UCCCCGCA 



UGCGGGGA CUGAUGAG X CGAA IGCUGGGG 



CCCAGCCC U CCCCGCAG 



CUGCGGGG CUGAUGAG X CGAA IGGCUGGG 



CAGCCCUC C CCGCAGUG 



CACUGCGG CUGAUGAG X CGAA IAGGGCUG 



AGCCCUCC C CGCAGUGG 



CCACUGCG CUGAUGAG X CGAA IGAGGGCU 



GCCCUCCC C GCAGUGGA 



UCCACUGC CUGAUGAG X CGAA I GGAGGGC 



CUCCCCGC A GUGGAAGU 



ACUUCCAC CUGAUGAG X CGAA I CGGGGAG 



AGUGAAUC C UUAACCCU 



AGGGUUAA CUGAUGAG X CGAA IAUUCACU 



GUGAAUCC U UAACCCUA 



UAGGGUUA CUGAUGAG X CGAA IGAUUCAC 



UCCUUAAC C CUAAAAUU 



AAUUUUAG CUGAUGAG X CGAA IUUAAGGA 



CCUUAACC C UAAAAUUU 



AAAUUUUA CUGAUGAG X CGAA I GUUAAGG 



CUUAACCC U AAAAUUUU 



AAAAUUUU CUGAUGAG X CGAA I GGUUAAG 



UUUAAGGC C ACGGCUUG 



CAAGCCGU CUGAUGAG X CGAA ICCUUAAA 



UUAAGGCC A CGGCUUGU 



ACAAGCCG CUGAUGAG X CGAA IGCCUUAA 



GCCACGGC U UGUGUCUG 



CAGACACA CUGAUGAG X CGAA ICCGUGGC 



CUUGUGUC U GAUUCCAU 



AUGGAAUC CUGAUGAG X CGAA IACACAAG 



UCUGAUUC C AUAUGGAG 



CUCCAUAU CUGAUGAG X CGAA IAAUCAGA 



CUGAUUCC A UAUGGAGG 



CCUCCAUA CUGAUGAG X CGAA I GAAUCAG 



AUGGAGGC C UGAAAAUU 



AAUUUUCA CUGAUGAG X CGAA ICCUCCAU 



UGGAGGCC U GAAAAUUG 



CAAUUUUC CUGAUGAG X CGAA IGCCUCCA 



UAUUAGUC C AAAUGUGA 



UCACAUUU CUGAUGAG X CGAA IACUAAUA 



AUUAGUCC A AAUGUGAU 



AUCACAUU CUGAUGAG X CGAA IGACUAAU 



AUGUGAUC A ACUGUUCA 



UGAACAGU CUGAUGAG X CGAA IAUCACAU 



UGAUCAAC U GUUCAGGG 



CCCUGAAC CUGAUGAG X CGAA IUUGAUCA 



AACUGUUC A GGGUCUCU 



AGAGACCC CUGAUGAG X CGAA IAACAGUU 



UCAGGGUC U CUCUCUUA 



UAAGAGAG CUGAUGAG X CGAA IACCCUGA 



AGGGUCUC U CUCUUACA 



UGUAAGAG CUGAUGAG X CGAA IAGACCCU 



GGUCUCUC U CUUACAAC 



GUUGUAAG CUGAUGAG X CGAA IAGAGACC 



UCUCUCUC U UACAACCA 



UGGUUGUA CUGAUGAG X CGAA IAGAGAGA 



UCUCUUAC A ACCAAGAA 



UUCUUGGU CUGAUGAG X CGAA IUAAGAGA 
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2223 | CUUACAAC C AAGAACAU | | AUG UUCUU CUGAUGAG X CGAA IUUGUAAG 

2224 UUACAACC A AGAACAUU AAUGUUCU CUGAUGAG X CGAA IGUUGUAA 
2230 CCAAGAAC A UUAUCUUA UAAGAUAA CUGAUGAG X CGAA IUUCUUGG 

"2236 ACAUUAUC U UAGUGGAA UUCCACUA CUGAUGAG X CGAA IAUAAUGU 



Input Sequence = PRKCA. Cut Site = CH/ . 

Stem Length = 8 . Core Sequence = CUGAUGAG X CGAA (X = GCCGUUAGGC or other stem II) 
PRKCA (Homo sapiens protein kinase C, alpha (PRKCA) mRNA. ; 2245 bp) 
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Table 64 



Table 64: Activity of ribozyme core substituted analogues 



A 



RYH/ 


All ribo 1-15.1 
Kobs (min -1 ) 


G-5, A-6, G-8, G-12, 1-15.1 ribo 
2'-0-allyl environment (min"l) 


U-4=ribo U 


U-4=2' -amino U 


U-4=2'-0-alkylU 


GCA 


0.39 


0.10 


0.08 


0.02 


GCC 


0.19 


0.03 


0.01 


0.003 


GCU 


0.028 


0.025 


0.013 


0.002 


B 


RYH/ 


All ribo A-15.1 
KobsOnin-l) 


G-5, A-6, G-8, G-12, A-15.1 ribo 
2'-0-allyl environment Kqos (min'l) 


U-4=ribo U 


U-4=2' -amino U 


U-4=2'-0-alkylU 


GUA 


0.12 


0.06 


0.04 


0.01 


GUC 


0.15 


0.015 


0.014 


0.001 


GUU 


0.04 


0.031 


0.012 


0.008 



Comparison of single turnover cleavage rates for GCH and GUH substrates with T-15.1 
and A-l 5.1 ribozymes and ribozyme analogs. Conditions: Single turnover (250 nM 
substrate, 2.5 uM ribozyme) pH 6.0, 37 C, 1 0 mM Mg ++ 
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Claims: 

We claim: 

5 1 . An enzymatic nucleic acid molecule having formula 4 namely: 



(X)i 




wherein each X, Y, and Z represents independently a nucleotide which may be the 
same or different; 1 is an integer greater than or equal to 3; m is an integer greater than 

10 1 ; n is an integer greater than 1; 0 is an integer greater than or equal to 3; Z' is a 
nucleotide complementary to Z; Y' is a nucleotide complementary to Y; each X(l) and 
X(o) are oligonucleotides which are of sufficient length to stably interact 
independently with a target nucleic acid sequence; W is a linker of > 2 nucleotides; A, 
U, G, and C represent nucleotides; C is 2'-amino; and represents a chemical 

15 linkage. 
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2. An enzymatic nucleic acid molecule having formula 5 namely: 



s- (X) 0 




5 wherein each X, Y, and Z represents independently a nucleotide which may be the 
same or different; 1 is an integer greater than or equal to 3; n is an integer greater than 
1; 0 is an integer greater than or equal to 3; Z' is a nucleotide complementary to Z; 
each X(i) and X( 0 ) are oligonucleotides which are of sufficient length to stably interact 
independently with a target nucleic acid sequence; W is a linker of > 2 nucleotides in 
10 length or may be a non-nucleotide linker; A, U, G, and C represent nucleotides; C is 
2 '-amino; and represents a chemical linkage. 

3. The enzymatic nucleic acid molecule of claims 1 or 2, wherein 1 is selected from 
the group consisting of 4, 5, 6, 7, 8, 9, 10, 11, 12, and 15. 

4. The enzymatic nucleic acid molecule of claim 1, wherein m is selected from the 
15 group consisting of 2, 3, 4, 5, 6, and 7. 

5. The enzymatic nucleic acid molecule of claims 1 or 2, wherein n is selected from 
the group consisting of 2, 3, 4, 5, 6, and 7. 



6. The enzymatic nucleic acid molecule of claims 1 or 2, wherein o is selected from 
the group consisting of 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, and 15. 
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7. The enzymatic nucleic acid molecule of claims 1 or 2, wherein 1 and o are of the 
same length. 

8. The enzymatic nucleic acid molecule of claims 1 or 2, wherein 1 and o are of 
different length. 

5 9. The enzymatic nucleic acid molecule of claims 1 or 2, wherein the target nucleic 
acid sequence is selected from the group consisting of an RNA, DNA and 
RNA/DNA mixed polymer. 

10. The enzymatic nucleic acid molecule of claims 1 or 2, wherein said chemical 
linkage is selected from the group consisting of phosphate ester linkage, amide 

10 linkage, phosphorothioate, and phosphorodithioate. 

1 1 . The enzymatic nucleic acid molecule of claims 1 or 2, wherein said CCDs selected 
from the group consisting of 2'-deoxy-2'-NH 2 and 2'-deoxy-2'-0-NH 2 . 

12. A method for inhibiting expression of a gene in a cell, comprising the step of 
administering to said cell the enzymatic nucleic acid molecule of claims 1 or 2 

15 under conditions suitable for said inhibition. 

13. A method of cleaving a separate RNA molecule comprising, contacting the 
enzymatic nucleic acid molecule of claims 1 or 2 with said separate RNA molecule 
under conditions suitable for the cleavage of said separate RNA molecule. 

14. The method of claim 13, wherein said cleavage is carried out in the presence of a 
20 divalent cation. 

15. The method of claim 14, wherein said divalent cation is Mg 2+ . 

16. The enzymatic nucleic acid molecule of claims 1 or 2, wherein said enzymatic 
nucleic acid molecule is chemically synthesized. 

17. The enzymatic nucleic acid molecule of claims 1 or 2, wherein said enzymatic 
25 nucleic acid molecule comprises at least one ribonucleotide. 

18. The enzymatic nucleic acid molecule of claims 1 or 2, wherein said enzymatic 
nucleic acid molecule comprises no ribonucleotide residues. 

19. The enzymatic nucleic acid molecule of claims 1 or 2, wherein said enzymatic 
nucleic acid molecule comprises at least one 2-amino modification. 
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20. The enzymatic nucleic acid molecule of claims 1 or 2, wherein said enzymatic 
nucleic acid molecule comprises at least three phosphorothioate modifications. 

21. The enzymatic nucleic acid molecule of claim 20, wherein said phosphorothioate 
modification is at the 5 '-end of said enzymatic nucleic acid molecule. 

5 22. The enzymatic nucleic acid molecule of claims 1 or 2, wherein said enzymatic 
nucleic acid molecule comprises a 5'-cap or a 3'-cap or both a 5'-cap and a 3'-cap. 

23. The enzymatic nucleic acid molecule of claim 22, wherein said 5-cap is 
phosphorothioate modification. 

24. The enzymatic nucleic acid molecule of claim 22, wherein said 3'-cap is an 
10 inverted abasic moiety. 

25. A compound having the formula 3 : 




wherein R is independently any nucleoside selected from the group consisting of T-O- 
methyl-2,6-diaminopurine riboside; 2'-deoxy-2'amino-2,6-diaminopurine riboside; 2'- 

15 (N-alanyl) amino-2'-deoxy-uridine; 2'-(A^phenylalanyl)amino-2'-deoxy-uridine; 2'- 
deoxy -2'-(jV-p-alanyl) amino ; 2'-deoxy-2'-(lysiyl) amino uridine; 2'-C-allyl uridine; 
2'-0-amino-uridine; 2'-0-methylthiomethyl adenosine; 2'-0-methylthiomethyl 
cytidine ; 2'-0-methylthiornethyl guanosine; 2'-0-methylthiomethyl-uridine; 2'- 
deoxy-2'-(iV-histidyl) amino uridine; 2'-deoxy-2'-amino-5-methyl cytidine; 2'-(N-$- 

20 carboxamidine-p-alanyl)amino-2'-deoxy-uridine; 2'-deoxy-2'-(N-P-alanyl)-guanosine; 
2'-0-amino-adenosine; 2'-(AT-lysyl)amino-2 , -deoxy-cytidine; 2'-Deoxy -2'-(L- 
histidine) amino Cytidine; 5-Imidazoleacetic acid 2-deoxy uridine, 5-[3-(N-4- 
imidazoleacetyl)aminopropynyl]-2'-0-methyl uridine, 5-(3-aminopropynyl)-2'-0- 
methyl uridine, 5-(3-aminopropyl)-2'-0-methyl uridine, 5-[3-(N-4- 

25 imidazoleacetyl)aminopropyl]-2'-0-methyl uridine, 5-(3-aminopropyl)-2'-deoxy-2- 
fluoro uridine, 2'-Deoxy-2'-(p-alanyl-L-histidyl)amino uridine, 2'-deoxy-2'-P- 
alaninamido-uridine, 3-(2'-deoxy-2'-fluoro-P-D-ribofuranosyl)piperazino[2,3- 
D]pyrimidine-2-one, 5-[3-(N-4-imidazoleacetyl)aminopropyl]-2'-deoxy-2'-fluoro 
uridine, 5-[3-(N-4-imidazoleacetyl)aminopropynyl]-2'-deoxy-2'-fluoro uridine, 5-E- 
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(2-carboxyvinyl-2'-deoxy-2'-fluoro uridine, 5-[3-(N-4-aspartyl)aminopropynyl-2'- 
fluoro uridine, 5-(3-aminopropyl)-2'-deoxy-2-fiuoro cytidine, and 5-[3-(N-4- 
succynyl)aminopropyl-2 ' -deoxy-2-fluoro cytidine. 

26. A process for incorporation of the compounds of claim 25 into an oligonucleotide 
5 comprising the step of contacting said compound with a mixture comprising a 

nucleic acid template, an RNA polymerase enzyme, and an enhancer of modified 
nucleotide triphosphate incorporation, under conditions suitable for the 
incorporation of said compound into said oligonucleotide. 

27. The process of claim 26, wherein said RNA polymerase is a T7 RNA polymerase. 

10 28. The process of claim 26, wherein said RNA polymerase is a mutant T7 RNA 
polymerase. 

29. The process of claim 26, wherein said RNA polymerase is a SP6 RNA polymerase. 

30. The process of claim 26, wherein said RNA polymerase is a mutant SP6 RNA 
polymerase. 

15 31. The process of claim 26, wherein said RNA polymerase is a T3 RNA polymerase. 

32. The process of claim 26, wherein said RNA polymerase is a mutant T3 RNA 
polymerase. 

33. The process of claim 26, wherein said enhancer of modified nucleotide 
triphosphate incorporation is selected from the group consisting of LiCl, methanol, 

20 polyethylene glycol, diethyl ether, propanol, methylamine, and ethanol. 

34. A process for the synthesis of a pyrimidine nucleotide triphosphate comprising the 
steps of: 

a. monophosphorylation, wherein a pyrimidine nucleoside is contacted with a 
mixture comprising a phosphorylating reagent, a trialkyl phosphate and 

25 dimethylaminopyridine, under conditions suitable for the formation of a pyrimidine 
nucleotide monophosphate; and 

b. pyrophosphorylation, wherein said pyrimidine monophosphate from step (a) is 
contacted with a pyrophosphorylating reagent under conditions suitable for the 
formation of said pyrimidine nucleotide triphosphate. 
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35. The process of claim 34, wherein said pyrimidine nucleoside triphosphate is 
uridine triphosphate. 

36. The process of claim 34, wherein said uridine triphosphate has a 2'-sugar 
modification. 

5 37. The process of claim 36, wherein said uridine triphosphate is T-O- 
methylthiomethyl uridine triphosphate. 

38. The process of claim 34, wherein said phosphorylating agent is selected from the 
group consisting of phosphorus oxychloride, phospho-tris-triazolides and phospho- 
tris-triimidazolides. 

10 39. The process of claim 34, wherein said trialkylphosphate is triethyl phosphate. 

40. The process of claim 34, wherein said pyrophosphorylating reagent is tributyl 
ammonium pyrophosphate. 

41. The process of claim 26, wherein said oligonucleotide is RNA. 

42. The process of claim 26, wherein said oligonucleotide is an enzymatic nucleic acid 
15 molecule. 

43. The process of claim 26, wherein said oligonucleotide is an aptamer. 

44. A kit for synthesis of an oligonucleotide comprising an RNA polymerase, an 
enhancer of modified nucleotide triphosphate incorporation and at least one 
compound of claim 25. 

20 45. A kit for synthesis of an oligonucleotide comprising a DNA polymerase, an 
enhancer of modified nucleotide triphosphate incorporation and at least one 
compound of claim 25. 

46. The kit of claim 44, wherein said RNA polymerase is a bacteriophage T7 RNA 
polymerase. 

25 47. The kit of claim 44, wherein said RNA polymerase is a bacteriophage SP6 RNA 
polymerase. 

48. The kit of claim 44, wherein said RNA polymerase is a bacteriophage T3 RNA 
polymerase. 
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49. The kit of claim 44, wherein said RNA polymerase is a mutant T7 RNA 
polymerase. 

50. The kit of claim 44 or 45, wherein said kit comprises at least two different 
compounds of claim 25. 

5 5 1 . A nucleic acid catalyst comprising a histidyl modification, wherein said nucleic 
acid catalyst is able to catalyze an endonuclease reaction in the absence of a metal 
ion co-factor. 

52. The nucleic acid catalyst of claim 51, wherein said catalyst is able to cleave a 
separate nucleic acid molecule. 

10 53. The nucleic acid catalyst of claim 52, wherein said separate nucleic acid molecule 
is an RNA molecule. 

54. The nucleic acid catalyst of claim 52, wherein said separate nucleic acid molecule 
is a DNA molecule. 

55. The nucleic acid catalyst of claim 51, wherein said nucleic acid catalyst comprises 
15 at least one ribonucleotide. 

56. The enzymatic nucleic acid molecule of claim 2, wherein said nucleic acid 
molecule has an endonuclease activity to cleave RNA of HER2 gene. 

57. The enzymatic nucleic acid molecule of claim 56, wherein said nucleic acid 
molecule comprises sequences complementary to any of substrate sequences 

20 defined as Target sequence in Tables 58, 59 and 62. 

58. The enzymatic nucleic acid molecule of claim 56, wherein said nucleic acid 
molecule comprises any of ribozyme sequences defined as Ribozyme sequence in 
Tables 58, 59 and 62, 

59. A method for treating cancer using the enzymatic nucleic acid molecule of claim 
25 56. 

60. The method of claim 59, wherein said cancer is breast cancer. 

61. A method for treating conditions associated with the level of HER2 gene using the 
enzymatic nucleic acid molecule of claim 56. 



WO 01/16312 



658 



PCT/US00/23998 



62. The enzymatic nucleic acid molecule of claim 56, wherein said enzymatic nucleic 
acid molecule comprises a substrate binding region which has between 5 and 30 
nucleotides complementary to the RNA. 

63. The enzymatic nucleic acid molecule of claim 56, wherein said enzymatic nucleic 
5 acid molecule comprises a substrate binding region which has between 7 and 12 

nucleotides complementary to the RNA. 

64. A mammalian cell including the enzymatic nucleic acid molecule of claim 56. 

65. The mammalian cell of claim 64, wherein said mammalian cell is a human cell. 

66. A mammalian cell including the enzymatic nucleic acid molecule of claims 1 or 2. 
10 67. The mammalian cell of claim 66, wherein said mammalian cell is a human cell. 

68. A method for inhibiting expression of HER2 gene in a cell, comprising the step of 
administering to said cell the enzymatic nucleic acid molecule of claim 56 under 
conditions suitable for said inhibition. 

69. A method of cleaving RNA derived from HER2 gene comprising, contacting the 
15 enzymatic nucleic acid molecule of claim 56 with said RNA molecule under 

conditions suitable for the cleavage of said RNA molecule. 

70. A pharmaceutical composition comprising the enzymatic nucleic acid molecule of 
any of claims 1 or 2. 

71. A pharmaceutical composition comprising the enzymatic nucleic acid molecule of 
20 claim 56. 

72. A method of treatment of a patient having a condition associated with the level of 
HER2, wherein said patient is administered the enzymatic nucleic acid molecule of 
claim 56 under conditions suitable for said treatment. 

73. The method of claim 72, wherein said method is performed in conjunction with 
25 one or more other therapies. 

74. The method of claim 59, wherein said enzymatic nucleic acid molecule is used in 
conjunction with one or more other therapies. 

75. The enzymatic nucleic acid molecule of claim 56, wherein said enzymatic nucleic 
acid molecule comprises at least one sugar modification. 
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76. The enzymatic nucleic acid molecule of claim 56, wherein said enzymatic nucleic 
acid molecule comprises at least one nucleic acid hase modification. 

77. The enzymatic nucleic acid molecule of claim 56, wherein said enzymatic nucleic 
acid molecule comprises at least one phosphate backbone modification. 

5 78. The enzymatic nucleic acid molecule of claim 56, wherein said phosphate 
backbone modification is selected from the group consisting of phosphorothioate, 
phosphorodithioate and amide. 

79. An enzymatic nucleic acid molecule which down regulates expression of genes 
selected from the group consisting of beta site APP-cleaving enzyme (BACE) and 

10 telomerase reverse transciptase (TERT) genes. 

80. The enzymatic nucleic acid molecule of claim 79, wherein said gene is the beta site 
APP-cleaving enzyme (BACE). 

81. The enzymatic nucleic acid molecule of claim 79, wherein said gene is the 
telomerase reverse transcriptase (TERT). 

15 82. A nucleic acid molecule which down regulates expression of genes selected from 
the group consisting of protein-tyrosine phosphatase- IB (PTP-1B), methionine 
aminopeptidase (MetAP-2), hepatitis B virus (HBV), phospholamban (PLN), and 
presenilin (ps-2) genes. 

83. The nucleic acid molecule of claim 82, wherein said nucleic acid molecule is an 
20 enzymatic nucleic acid molecule. 

84. The nucleic acid molecule of claim 82, wherein said nucleic acid molecule is an 
antisense nucleic acid molecule. 

85. The nucleic acid molecule of any of claims 82-84, wherein said gene is the protein- 
tyrosine phosphatase- IB (PTP-1B). 

25 86. The nucleic acid molecule of any of claims 82-84, wherein said gene is the 
methionine aminopeptidase (MetAP-2). 

87. The nucleic acid molecule of any of claims 82-84, wherein said gene is the 
hepatitis B virus (HBV). 
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88. The nucleic acid molecule of any of claims 82-84, wherein said gene is the 
phospholamban (PLN). 

89. The nucleic acid molecule of any of claims 82-84, wherein said gene is the 
presenilin (ps-2). 

5 90. The enzymatic nucleic acid molecule of any of claims 79 or 83, wherein said 
enzymatic nucleic acid molecule is adapted for use to treat diseases and conditions 
related to the expression of genes selected from the group consisting of beta site 
APP-cleaving enzyme (BACE), telomerase reverse transciptase (TERT), protein- 
tyrosine phosphatase- IB (PTP-1B), methionine aminopeptidase (MetAP-2), 
10 hepatitis B virus (HBV), phospholamban (PLN), and presenilin (ps-2) genes. 

91. The nucleic acid molecule of claim 82, wherein said nucleic acid molecule is 
adapted for use to treat diseases and conditions related to the expression of genes 
selected from the group consisting of protein-tyrosine phosphatase- IB (PTP-1B), 
methionine aminopeptidase (MetAP-2), hepatitis B virus (HBV), phospholamban 

15 (PLN), and presenilin (ps-2) genes. 

92. The enzymatic nucleic acid molecule of any of claims 79 or 83, wherein said 
enzymatic nucleic acid molecule has an endonuclease activity to cleave RNA 
encoded by said beta site APP-cleaving enzyme (BACE), telomerase reverse 
transciptase (TERT), protein-tyrosine phosphatase- IB (PTP-1B), methionine 

20 aminopeptidase (MetAP-2), hepatitis B virus (HBV), phospholamban (PLN), and 

presenilin (ps-2) genes. 

93. The enzymatic nucleic acid of any of claims 79 or 83, wherein a binding arm of 
said enzymatic nucleic acid molecule comprise sequences complementary to any 
of the sequences defined as Target or Substrate sequence in Tables 3-30, and 36- 

25 43. 

94. The enzymatic nucleic acid molecule of any of claims 79 or 83 wherein said 
enzymatic nucleic acid molecule comprises any of the sequences defined as 
Ribozyme or DNAzyme sequence in Tables 3-29, and 37-43. 

95. The nucleic acid molecule of claim 84, wherein said antisense nucleic acid 
30 molecule comprises sequence complementary to any of the sequences defined as 

Target or Substrate sequence in Tables 3-12, 24-30, and 36-43. 
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96. The enzymatic nucleic acid molecule of any of claims 79 or 83, wherein said 
enzymatic nucleic acid molecule is in a hammerhead (HH) motif. 

97. The enzymatic nucleic acid molecule of any of claims 79 or 83, wherein said 
enzymatic nucleic acid molecule is in a zinzyme (Class II) motif. 

5 98. The enzymatic nucleic acid molecule of any of claims 79 or 83, wherein said 
enzymatic nucleic acid molecule is in a amberzyme (Class 1) motif. 

99. The enzymatic nucleic acid molecule of any of claims 79 or 83, wherein said 
enzymatic nucleic acid molecule is in a hairpin, hepatitis Delta virus, group I 
intron, VS nucleic acid, or RNAse P nucleic acid motif. 

10 100. The enzymatic nucleic acid molecule of claim 97, wherein said zinzyme motif 
comprises sequences complementary to any of the substrate sequences shown in 
Tables 21, 27 and 40. 

101. The enzymatic nucleic acid molecule of claim 98, wherein said amberzyme 
motif comprises sequences complementary to any of the substrate sequences 

15 shown in Tables 23, 29, and 42. 

102. The enzymatic nucleic acid molecule of any of claims 79 or 83, wherein said 
enzymatic nucleic acid molecule is in a NCH motif. 

103. The enzymatic nucleic acid molecule of any of claims 79 or 83, wherein said 
enzymatic nucleic acid molecule is in a G-cleaver motif. 

20 104. The enzymatic nucleic acid molecule of any of claims 79 or 83, wherein said 
enzymatic nucleic acid molecule is a DNAzyme. 

105. The enzymatic nucleic acid molecule of any of claims 79 or 83, wherein said 
enzymatic nucleic acid molecule comprises between 12 and 100 bases 
complementary to the RNA of genes selected from the group consisting of beta site 

25 " APP-cleaving enzyme (BACE), telomerase reverse transciptase (TERT), protein- 
tyrosine phosphatase- IB (PTP-1B), methionine aminopeptidase (MetAP-2), 
hepatitis B virus (HBV), phospholamban (PLN), and presenilin (ps-2) genes. 

106. The enzymatic nucleic acid of any of claims 79 or 83, wherein said enzymatic 
nucleic acid molecule comprises between 14 and 24 bases complementary to the 

30 RNA of genes selected from the group consisting of beta site APP-cleaving 

enzyme (BACE), telomerase reverse transciptase (TERT), protein-tyrosine 
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phosphatase- IB (PTP-1B), methionine aminopeptidase (MetAP-2), hepatitis B 
virus (HBV), phospholamban (PLN), and presenilin (ps-2) genes. 

107. The enzymatic nucleic acid molecule of any of claims 79 or 83, wherein said 
enzymatic nucleic acid is chemically synthesized. 

5 108. The enzymatic nucleic acid molecule of any of claims 79 or 83, wherein said 
enzymatic nucleic acid comprises at least one 2'-sugar modification. 

109. The enzymatic nucleic acid molecule of any of claims 79 or 83, wherein said 
enzymatic nucleic acid comprises at least one nucleic acid base modification. 

110. The enzymatic nucleic acid molecule of any of claims 79 or 83, wherein said 
10 enzymatic nucleic acid comprises at least one phosphate backbone modification. 

111. A mammalian cell including the enzymatic nucleic acid molecule of any of 
claims 79 or 83, wherein said mammalian cell is not a living human. 

112. The mammalian cell of claim 111, wherein said mammalian cell is a human 
cell. 

15 113. The antisense nucleic acid molecule of claim 84, wherein said antisense nucleic 
acid is chemically synthesized. 

114. The antisense nucleic acid molecule of claim 84, wherein said antisense nucleic 
acid comprises at least one 2' -sugar modification. 

115. The antisense nucleic acid molecule of claim 84, wherein said antisense nucleic 
20 acid comprises at least one nucleic acid base modification. 

116. The antisense nucleic acid molecule of claim 84, wherein said antisense nucleic 
acid comprises at least one phosphate backbone modification. 

117. A mammalian cell including the antisense nucleic acid molecule of claim 84, 
wherein said mammalian cell is not a living human. 

25 118. The mammalian cell of claim 117, wherein said mammalian cell is a human 
cell. 

119. A method of reducing BACE activity in a cell, comprising the step of 
contacting said cell with the enzymatic nucleic acid molecule of claim 80, under 
conditions suitable for said inhibition. 
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120. A method of reducing TERT activity in a cell, comprising the step of 
contacting said cell with the enzymatic nucleic acid molecule of claim 81, under 
conditions suitable for said inhibition. 

121. A method of reducing PTP-1B activity in a cell, comprising the step of 
5 contacting said cell with the nucleic acid molecule of claim 85, under conditions 

suitable for said inhibition. 

122. A method of reducing MetAP-2 activity in a cell, comprising the step of 
contacting said cell with the nucleic acid molecule of claim 86, under conditions 
suitable for said inhibition. 

10 123. A method of reducing HBV activity in a cell, comprising the step of contacting 
said cell with the nucleic acid molecule of claim 87, under conditions suitable for 
said inhibition. 

124. A method of reducing phospholamban (PLN) activity in a cell, comprising the 
step of contacting said cell with the nucleic acid molecule of claim 88, under 

15 conditions suitable for said inhibition. 

125. A method of reducing presenilin-2 (ps-2) activity in a cell, comprising the step 
of contacting said cell with the nucleic acid molecule of claim 89, under conditions 
suitable for said inhibition. 

126. A method of treatment of a patient having a condition associated with the level 
20 of BACE, comprising contacting cells of said patient with the enzymatic nucleic 

acid molecule of claim 80, under conditions suitable for said treatment. 

127. A method of treatment of a patient having a condition associated with the level 
of TERT, comprising contacting cells of said patient with the enzymatic nucleic 
acid molecule of claim 81, under conditions suitable for said treatment. 

25 128. A method of treatment of a patient having a condition associated with the level 
of PTP-1B, comprising contacting cells of said patient with the nucleic acid 
molecule of claim 85, under conditions suitable for said treatment. 

129. A method of treatment of a patient having a condition associated with the level 
of MetAP-2, comprising contacting cells of said patient with the nucleic acid 
30 molecule of claim 86, under conditions suitable for said treatment. 
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130. A method of treatment of a patient having a condition associated with the level 
of HBV, comprising contacting cells of said patient with the nucleic acid molecule 
of claim 87, under conditions suitable for said treatment. 

131. A method of treatment of a patient having a condition associated with the level 
5 of phospholamban (PLN), comprising contacting cells of said patient with the 

nucleic acid molecule of claim 88, under conditions suitable for said treatment. 

132. A method of treatment of a patient having a condition associated with the level 
of presenilin-2 (ps-2), comprising contacting cells of said patient with the nucleic 
acid molecule of claim 89, under conditions suitable for said treatment. 

10 133. The method of any of claims 126-132 further comprising the use of one or 
more drug therapies under conditions suitable for said treatment. 

134. A method of cleaving RNA of BACE gene, comprising, contacting the 
enzymatic nucleic acid molecule of claim 80, with said RNA under conditions 
suitable for the cleavage of said RNA. 

15 135. A method of cleaving RNA of TERT gene, comprising, contacting the 
enzymatic nucleic acid molecule of claim 81, with said RNA under conditions 
suitable for the cleavage of said RNA. 

136. A method of cleaving RNA of PTP-1B gene, comprising, contacting the 
enzymatic nucleic acid molecule of claim 85, with said RNA under conditions 

20 suitable for the cleavage of said RNA. 

137. A method of cleaving RNA of MetAP-2 gene, comprising, contacting the 
enzymatic nucleic acid molecule of claim 86, with said RNA under conditions 
suitable for the cleavage of said RNA. 

138. A method of cleaving RNA of HBV gene, comprising, contacting the 
25 enzymatic nucleic acid molecule of claim 87, with said RNA under conditions 

suitable for the cleavage of said RNA. 

139. A method of cleaving RNA of phospholamban (PLN) gene, comprising, 
contacting the enzymatic nucleic acid molecule of claim 88, with said RNA under 
conditions suitable for the cleavage of said RNA. 
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140. A method of cleaving RNA of presenilin-2 (ps-2) gene, comprising, contacting 
the enzymatic nucleic acid molecule of claim 89, with said RNA under conditions 
suitable for the cleavage of said RNA. 

141. The method of any of claims 134-140, wherein said cleavage is carried out in 
5 the presence of a divalent cation. 

142. The method of claim 141, wherein said divalent cation is Mg2+. 

143. The enzymatic nucleic acid molecule of any of claims 79 or 83, wherein said 
enzymatic nucleic acid comprises a cap structure, wherein the cap structure is at 
the 5'-end or 3'-end or both the 5'-end and the 3'-end. 

10 144. The antisense nucleic acid molecule of claim 84, wherein said antisense nucleic 
acid comprises a cap structure, wherein the cap structure is at the 5'-end or 3'-end 
or both the 5 '-end and the 3 '-end. 

145. The enzymatic nucleic acid molecule of claim 96, wherein said hammerhead 
motif comprises sequences complementary to any of sequences defined as Target 

15 or Substrate sequences in Tables 3, 9, 13, 18, 24, and 37. 

146. The enzymatic nucleic acid molecule of claim 102, wherein said NCH motif 
comprises sequences complementary to any of sequences defined as Target or 
Substrate sequences in Tables 4, 10, 14, 19, 25, and 38. 

147. The enzymatic nucleic acid molecule of claim 103, wherein said G-cleaver 
20 motif comprises sequences complementary to any of sequences defined as Target 

or Substrate sequences in Tables 5, 11, 15, 20, 26, and 39. 

148. The enzymatic nucleic acid molecule of claim 104, wherein said DNAzyme 
comprises sequences complementary to any of sequences defined as Target or 
Substrate sequences in Tables 6, 16, 22, 28, and 41. 

25 1 49. The method of any of claims 119-125 or 133, wherein said enzymatic nucleic 
acid molecule is in a hammerhead motif. 

150. The method of any of claims 119-125 or 133, wherein said nucleic acid 
molecule is a DNAzyme. 
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151. An expression vector comprising nucleic acid sequence encoding at least one 
enzymatic nucleic acid molecule of any of claims 79 or 83, in a manner which 
allows expression of that enzymatic nucleic acid molecule. 

152. An expression vector comprising nucleic acid sequence encoding at least one 
5 antisense nucleic acid molecule of claim 84, in a manner which allows expression 

of that antisense nucleic acid molecule. 

153. A mammalian cell including an expression vector of any of claims 151 or 152, 
wherein said mammalian cell is not a living human. 

154. The mammalian cell of claim 153, wherein said mammalian cell is a human 
10 cell. 

155. The expression vector of claim 151, wherein said enzymatic nucleic acid 
molecule is in a hammerhead motif. 

156. The expression vector of claim 151, wherein said expression vector further 
comprises a sequence for an antisense nucleic acid molecule complementary to the 

15 RNA of genes selected from the group consisting of beta site APP-cleaving 

enzyme (BACE), telomerase reverse transciptase (TERT), protein-tyrosine 
phosphatase- IB (PTP-1B), methionine aminopeptidase (MetAP-2), hepatitis B 
virus (HBV), phospholamban (PLN), and presenilin (ps-2) genes. 

157. The expression vector of claim 151, wherein said expression vector comprises 
20 sequence encoding at least two said enzymatic nucleic acid molecules, which may 

be same or different. 

158. The expression vector of claim 157, wherein one said expression vector further 
comprises sequence encoding antisense nucleic acid molecule complementary to 
the RNA of genes selected from the group consisting of beta site APP-cleaving 

25 enzyme (BACE), telomerase reverse transciptase (TERT), protein-tyrosine 

phosphatase- IB (PTP-1B), methionine aminopeptidase (MetAP-2), hepatitis B 
virus (HBV), phospholamban (PLN), and presenilin (ps-2) genes. 

159. A method for treatment of Alzheimer's disease comprising the step of 
administering to a patient the enzymatic nucleic acid molecule of claim 80 under 

30 conditions suitable for said treatment. 
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160. The method of claim 159, wherein said treatment of Alzheimer's disease is 
treatment of dementia. 

161. A method for treatment of Alzheimer's disease comprising the step of 
administering to a patient the antisense nucleic acid molecule of claim 89 under 

5 conditions suitable for said treatment. 

162. A method for treatment of diabetes comprising the step of administering to a 
patient the nucleic acid molecule of claim 85 under conditions suitable for said 
treatment. 

163. The method of claim 162, wherein said diabetes is type I diabetes. 
10 164. The method of claim 162, wherein said diabetes is type II diabetes. 

165. A method for treatment of diabetes comprising the step of administering to a 
patient the antisense nucleic acid molecule of claim 85 under conditions suitable 
for said treatment. 

166. A method for treatment of obesity comprising the step of administering to a 
15 patient the nucleic acid molecule of claim 85 under conditions suitable for said 

treatment. 

167. A method for treatment of obesity comprising the step of administering to a 
patient the antisense nucleic acid molecule of claim 85 under conditions suitable 
for said treatment. 

20 1 68. A method for treatment of heart disease comprising the step of administering to 
a patient the nucleic acid molecule of claim 88 under conditions suitable for said 
treatment. 

169. The method of claim 168, wherein said heart disease is heart failure. 

170. The method of claim 168, wherein said heart disease is congestive heart failure. 

25 171. A method for treatment of pressure overload hypertrophy, or dilated 
cardiomyopathy, or both, comprising the step of administering to a patient the 
nucleic acid molecule of claim 88 under conditions suitable for said treatment. 
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172. A method for treatment of cancer comprising the step of administering to a 
patient the nucleic acid molecule of claim 86 under conditions suitable for said 
treatment. 

173. A method for treatment of hepatitis comprising the step of administering to a 
5 patient the nucleic acid molecule of claim 87 under conditions suitable for said 

treatment. 

174. A method for treatment of hepatocellular carcinoma comprising the step of 
administering to a patient the nucleic acid molecule of claim 87 under conditions 
suitable for said treatment. 

10 175. The method of claim 159, wherein said enzymatic nucleic acid molecule is in a 
hammerhead motif. 

176. The method of claim 159, wherein said method further comprises 
administering to said patient the enzymatic nucleic acid molecule in conjunction 
with one or more of other therapies. 

15 177. The method of any of claims 162, 165-168, or 171-174, wherein said nucleic 
acid molecule is an enzymatic nucleic acid molecule. 

178. The method of any of claims 162, 166-168, or 171-174, wherein said nucleic 
acid molecule is an antisense nucleic acid molecule. 

179. The method of any of claims 162, 165-168, or 171-174, wherein said method 
20 further comprises administering to said patient the nucleic acid molecule in 

conjunction with one or more of other therapies. 

180. The enzymatic nucleic acid molecule of any of claims 79 or 83, wherein said 
enzymatic nucleic acid molecule comprises at least five ribose residues; at least ten 
2'-0-methyl modifications, and a 3'- end modification. 

25 181. The enzymatic nucleic acid molecule of claim 180, wherein said enzymatic 
nucleic acid molecule further comprises phosphorothioate linkages on at least three 
of the 5' terminal nucleotides. 

182. The enzymatic nucleic acid molecule of claim 180, wherein said 3'- end 
modification is 3 '-3' inverted abasic moiety. 
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183. The enzymatic nucleic acid molecule of claim 104, wherein said DNAzyme 
comprises at least ten 2'-0-methyl modifications and a 3 '-end modification. 

184. The enzymatic nucleic acid molecule of claim 183, wherein said DNAzyme 
further comprises phosphorothioate linkages on at least three of the 5' terminal 
nucleotides. 

185. The enzymatic nucleic acid molecule of claim 183, wherein said 3'- end 
modification is 3 '-3' inverted abasic moiety. 

186. An enzymatic nucleic acid molecule having formula 1 : 



^ -G-A-A-I D -3> 

L 

^(N) n~ (N)p— A— G— N — A — G U C E 5 ' 

wherein N represents independently a nucleotide or a non-nucleotide linker, which 
may be same or different; D and E are independently oligonucleotides of length 
sufficient to stably interact with a target RNA molecule; o and n are integers 
independently greater than or equal to 1, wherein if (N)o and (N)n are nucleotides, 
(N)o and (N)n are optionally able to interact by hydrogen bond interaction; • indicates 
base-paired interaction; L is a linker which may be present or absent, but when 
present, is a nucleotide linker, a non-nucleotide linker, or a combination of nucleotide 
and a non-nucleotide linker; p is an integer 0 or 1 ; represents a chemical 

linkage; and A, U, I, C and G represent adenosine, uridine, inosine, cytidine and 
guanosine nucleotides, respectively. 

187. An enzymatic nucleic acid molecule having formula 2 : 

^ C — G-A-A-I D- 3' 

L 

^ (N) n G-(N) P -A- G- N- A- G~ U- C~ E - 5' 

wherein N represents independently a nucleotide or a non-nucleotide linker, which 
may be same or different; D and E are independently oligonucleotides of length 
sufficient to stably interact with a target RNA molecule; o and n are integers 
independently greater than or equal to 0, wherein if (N)o and (N)n are nucleotides, 
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(N)o and (N)n are optionally able to interact by hydrogen bond interaction; • indicates 
base-paired interaction; L is a linker which may be present or absent, but when 
present, is a nucleotide linker, a non-nucleotide linker, or a combination of nucleotide 
and a non-nucleotide linker; p is an integer 0 or 1; represents a chemical 

5 linkage; and A, U, I, C and G represent adenosine, uridine, inosine, cytidine and 
guanosine nucleotides, respectively. 

188. The enzymatic nucleic acid molecule of claims 186 or 187, wherein said D and 
E are independently of length selected from the group consisting of 3, 4, 5, 6, 7, 8, 

9, 10, 11, 12, 13, 14, 15, 17, and 20 nucleotides. 

10 189. The enzymatic nucleic acid molecule of claims 186 or 187, wherein said D and 
E are of the same length. 

190. The enzymatic nucleic acid molecule of claims 186 or 187, wherein said D and 
E are of different length. 

191. The enzymatic nucleic acid molecule of claim 186, wherein said o and n are 
15 independently integers selected from the group consisting of 2, 3, 4, 5, 6, 7, 8, 9, 

10, 15,20, 25, 30, 35, and 50. 

192. The enzymatic nucleic acid molecule of claim 187, wherein said o and n are 
independently integers selected from the group consisting of 1, 2, 3, 4, 5, 6, 7, 8, 9, 
10, 15, 20, 25, 30, 35, and 50. 

20 193. The enzymatic nucleic acid molecule of claims 186 or 187, wherein said (N)o 
and (N)n comprise nucleotides that are complementary to each other. 

194. The enzymatic nucleic acid molecule of claims 186 or 187, wherein said (N)o 
and (N)n are of the same length. 

195. The enzymatic nucleic acid molecule of claims 186 or 187, wherein said (N)o 
25 and (N)n are of different length. 

196. The enzymatic nucleic acid molecule of claims 186 or 187, wherein said L is a 
nucleotide linker. 

197. The enzymatic nucleic acid molecule of claim 196, wherein said nucleotide 
linker is of length between 3-50 nucleotides. 
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198. The enzymatic nucleic acid molecule of claim 196, wherein said nucleotide 
linker is an aptamer. 

199. The enzymatic nucleic acid molecule of claim 196 wherein said nucleotide 
linker is selected from the group consisting of 5'-GAAA-3' and 5'-GUUA-3'. 

5 200. The enzymatic nucleic acid molecule of claims 186 or 187, wherein said L is a 
non-nucleotide linker. 

201. The enzymatic nucleic acid molecule of claims 186 or 187, wherein said 
chemical linkage is independently or in combination selected from the group 
consisting of phosphate ester linkage, amide linkage, phosphorothioate, arabino, 

10 arabinofluoro, and phosphorodithioate. 

202. The enzymatic nucleic acid molecule of claims 1 86 or 187, wherein said p is 1 . 

203. The enzymatic nucleic acid molecule of claim 202, wherein said N of (N)p is 
independently selected from the group consisting of adenosine, uridine, and 
cytidine. 

15 204. The enzymatic nucleic acid molecule of claims 186 or 187 wherein said 
enzymatic nucleic acid molecule is chemically synthesized. 

205. The enzymatic nucleic acid molecule of claims 186 or 187, wherein said 
enzymatic nucleic acid molecule comprises at least three ribonucleotide residues. 

206. The enzymatic nucleic acid molecule of claims 186 or 187, wherein said 
20 enzymatic nucleic acid molecule comprises at least four ribonucleotide residues. 

207. The enzymatic nucleic acid molecule of claims 186 or 187, wherein said 
enzymatic nucleic acid molecule comprises at least five ribonucleotide residues. 

208. The enzymatic nucleic acid molecule of claims 186 or 187, wherein said I is 
selected from the group consisting of ribo-inosine and xylo-inosine. 

25 209. The enzymatic nucleic acid molecule of claims 186 or 187, wherein said 
enzymatic nucleic acid molecule comprises at least one sugar modification. 

210. The enzymatic nucleic acid molecule of claims 186 or 187, wherein said 
enzymatic nucleic acid molecule comprises at least nucleic acid base modification. 



WO 01/16312 



672 



PCT/US00/23998 



211. The enzymatic nucleic acid molecule of claims 186 or 187, wherein said 
enzymatic nucleic acid molecule comprises at least one phosphate backbone 
modification. 

212. The enzymatic nucleic acid molecule of claim 209, wherein said sugar 
5 modification is selected from the group consisting of 2'-H, 2'-0-methyl, 2'-0- 

allyl, and 2'-deoxy-2'-amino. 

213. The enzymatic nucleic acid molecule of claim 211, wherein said phosphate 
backbone modification is selected from the group consisting of phosphorothioate, 
phosphorodithioate and amide. 

10 214. The enzymatic nucleic acid molecule of claims 186 or 187 wherein said 
enzymatic nucleic acid molecule comprises a 5'-cap or a 3'-cap or both a 5'-cap 
and a 3 '-cap. 

215. The enzymatic nucleic acid molecule of claim 214, wherein said 5 '-cap is a 
phosphorothioate modification of at least one 5'-terminal nucleotide in said 

15 enzymatic nucleic acid molecule. 

216. The enzymatic nucleic acid molecule of claim 214, wherein said 5'-cap is a 
phosphorothioate modification of at least two 5 '-terminal nucleotide in said 
enzymatic nucleic acid molecule. 

217. The enzymatic nucleic acid molecule of claim 214, wherein said 5'-cap is a 
20 phosphorothioate modification of at least three 5 '-terminal nucleotide in said 

enzymatic nucleic acid molecule. 

218. The enzymatic nucleic acid molecule of claim 214, wherein said 3'-cap is a 3'- 
3' inverted abasic moiety. 

219. The enzymatic nucleic acid molecule of claim 214, wherein said 3'-cap is a 3'- 
25 3 ' inverted nucleotide moiety. 

220. A method for inhibiting expression of a gene in a cell, comprising the step of 
administering to said cell the enzymatic nucleic acid molecule of claims 186 or 187 
under conditions suitable for said inhibition. 

221. A method of cleaving a separate RNA molecule comprising, contacting the 
30 enzymatic nucleic acid molecule of claims 186 or 187 with said separate RNA 
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molecule under conditions suitable for the cleavage of said separate RNA 
molecule. 

222. The method of claim 221, wherein said cleavage is carried out in the presence 
of a divalent cation. 

5 223. The method of claim 222, wherein said divalent cation is Mg2+. 

224. The enzymatic nucleic acid molecule of claims 186 or 187, wherein said 
enzymatic nucleic acid molecule has an endonuclease activity to cleave RNA 
derived from HER2 gene. 

225. The enzymatic nucleic acid molecule of claim 224, wherein said enzymatic 
10 nucleic acid molecule comprises sequences complementary to any of NCH 

substrate sequence of Table 34. 

226. The enzymatic nucleic acid molecule of claim 224, wherein said enzymatic 
nucleic acid molecule comprises any of the NCH ribozyme sequences shown in 
Table 34. 

15 227. The enzymatic nucleic acid molecule of claim 224, wherein said enzymatic 
nucleic acid molecule is used to treat cancer. 

228. The enzymatic nucleic acid molecule of claim 224, wherein said cancer is 
breast cancer. 

229. The enzymatic nucleic acid molecule of claim 224, wherein said enzymatic 
20 nucleic acid molecule is used to treat conditions associated with the level of HER2 

gene. 

230. An enzymatic nucleic acid molecule, wherein said enzymatic nucleic acid 
molecule comprises any of sequence shown as NCH ribozyme sequence in Table 
31. 

25 231. The enzymatic nucleic acid molecule of claim 224, wherein said enzymatic 
nucleic acid molecule comprises a substrate binding region which has between 5 
and 30 nucleotides complementary to the RNA. 

232. The enzymatic nucleic acid molecule of claim 224, wherein said enzymatic 
nucleic acid molecule comprises a substrate binding region which has between 7 
30 and 12 nucleotides complementary to the RNA. 
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233. A mammalian cell including the enzymatic nucleic acid molecule of claim 224, 
wherein said mammalian cell is not a living human. 

234. The mammalian cell of claim 233, wherein said mammalian cell is a human 
cell. 

235. A mammalian cell including the enzymatic nucleic acid molecule of claims 1 86 
or 187, wherein said mammalian cell is not a living human. 

236. The mammalian cell of claim 235, wherein said mammalian cell is a human 
cell. 

237. A method for inhibiting expression of HER2 gene in a cell, comprising the step 
of administering to said cell the enzymatic nucleic acid molecule of claim 224 
under conditions suitable for said inhibition. 

238. A method of cleaving RNA derived from HER2 gene comprising, contacting 
the enzymatic nucleic acid molecule of claim 224 with said RNA molecule under 
conditions suitable for the cleavage of said RNA molecule. 

239. A pharmaceutical composition comprising the enzymatic nucleic acid molecule 
of any of claims 186 or 187. 

240. A pharmaceutical composition comprising the enzymatic nucleic acid molecule 
of claim 224. 

241 . A method of treatment of a patient having a condition associated with the level 
of HER2, wherein said patient is administered the enzymatic nucleic acid molecule 
of claim 224 under conditions suitable for said treatment. 

242. The method of claim 241, wherein said method is performed in conjunction 
with one or more other therapies. 

243. The enzymatic nucleic acid molecule of claim 227, wherein said enzymatic 
nucleic acid molecule is used in conjunction with one or more other therapies. 

244. The enzymatic nucleic acid molecule of claims 186 or 187, wherein said 
nucleic acid molecule comprises at least five ribose residues; a 2'-C-allyl 
modification at position No. 4 of said enzymatic nucleic acid; at least ten 2'-0- 
alkyl modifications, and a 3'- cap structure. 
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245. The enzymatic nucleic acid molecule of claim 244, wherein said 2'-0-alkyl 
modifications is selected from the group consisting of 2'-0-methyl and 2'-0-allyl. 

246. The enzymatic nucleic acid molecule of claim 244, wherein said 3'-cap is 3'-3' 
inverted abasic moiety. 

5 247. The enzymatic nucleic acid molecule of claim 244, wherein said 3'-cap is 3'-3' 
inverted nucleotide. 

248. The enzymatic nucleic acid molecule of claim 244, wherein said enzymatic 
nucleic acid comprises phosphorothioate linkages in at least three of the 5' terminal 
nucleotides. 

10 249. The . enzymatic nucleic acid molecule of claims 186 or 187, wherein said 
nucleic acid molecule comprises at least five ribose residues; a 2'-deoxy-2' -amino 
modification at position Nos. 4 and 7 of said enzymatic nucleic acid; at least ten 2'- 
O-alkyl modifications, and a 3'- cap structure. 

250. The enzymatic nucleic acid molecule of claim 249, wherein said 2'-0-alkyl 
15 modifications is selected from the group consisting of 2'-0-methyl and 2'-0-allyl. 

251. The enzymatic nucleic acid molecule of claim 249, wherein said 3 '-cap is 3 '-3' 
inverted abasic moiety. 

252. The enzymatic nucleic acid molecule of claim 249, wherein said 3 '-cap is 3 '-3' 
inverted nucleotide. 

20 253. The enzymatic nucleic acid molecule of claim 249, wherein said enzymatic 
nucleic acid comprises phosphorothioate linkages in at least three of the 5' terminal 
nucleotides. 

254. The enzymatic nucleic acid molecule of claim 224, wherein said enzymatic 
nucleic acid molecule comprises at least one sugar modification. 

25 255. The enzymatic nucleic acid molecule of claim 224, wherein said enzymatic 
nucleic acid molecule comprises at least one nucleic acid base modification. 

256. The enzymatic nucleic acid molecule of claim 224, wherein said enzymatic 
nucleic acid molecule comprises at least one phosphate backbone modification. 
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257. The enzymatic nucleic acid molecule of claim 224, wherein said phosphate 
backbone modification is selected from the group consisting of phosphorothioate, 
phosphorodithioate and amide. 

258. The enzymatic nucleic acid molecule of claim 224, wherein said nucleic acid 
5 molecule comprises at least five ribose residues; a 2'-C-allyl modification at 

position No. 4 of said enzymatic nucleic acid; at least ten 2'-0-alkyl modifications, 
and a 3'- cap structure. 

259. The enzymatic nucleic acid molecule of claim 258, wherein said 2'-0-alkyl 
modifications is selected from the group consisting of 2'-0-methyl and 2'-0-allyl. 

10 260. The enzymatic nucleic acid molecule of claim 258, wherein said 3'-cap is 3'-3' 
inverted abasic moiety. 

261. The enzymatic nucleic acid molecule of claim 258, wherein said 3 '-cap is 3 '-3' 
inverted nucleotide. 

262. The enzymatic nucleic acid molecule of claim 258, wherein said enzymatic 
15 nucleic acid comprises phosphorothioate linkages in at least three of the 5' terminal 

nucleotides. 

263. The enzymatic nucleic acid molecule of claim 224, wherein said nucleic acid 
molecule comprises at least five ribose residues; a 2'-deoxy-2'-amino modification 
at position Nos. 4 and 7 of said enzymatic nucleic acid; at least ten 2'-0-alkyl 

20 modifications, and a 3'- cap structure. 

264. The enzymatic nucleic acid molecule of claim 263, wherein said 2'-0-alkyl 
modifications is selected from the group consisting of 2'-0-methyl and 2'-0-allyl. 

265. The enzymatic nucleic acid molecule of claim 263, wherein said 3 '-cap is 3 '-3 ' 
inverted abasic moiety. 

25 266. The enzymatic nucleic acid molecule of claim 263, wherein said 3'-cap is 3'-3' 
inverted nucleotide. 

267. The enzymatic nucleic acid molecule of claim 263, wherein said enzymatic 
nucleic acid comprises phosphorothioate linkages in at least three of the 5' terminal 
nucleotides. 
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268. The enzymatic nucleic acid molecule of claim 186, wherein said enzymatic 
nucleic acid molecule is capable of down-regulating the expression of protein 
kinase C alpha (PKC alpha) gene. 

269. A method for inhibiting expression of a PKC alpha gene in a cell, comprising 
5 the step of administering to said cell the enzymatic nucleic acid molecule of claim 

268 under conditions suitable for said inhibition. 

270. A method of cleaving a PKC alpha RNA molecule comprising, contacting the 
enzymatic nucleic acid molecule of claim 268 with said separate PKC alpha RNA 
molecule under conditions suitable for the cleavage of said PKC alpha RNA 

10 molecule. 

271. The method of claim 270, wherein said cleavage is carried out in the presence 
of a divalent cation. 

272. The method of claim 271, wherein said divalent cation is Mg2+. 

273. The enzymatic nucleic acid molecule of claim 268, wherein said enzymatic 
15 nucleic acid molecule has an endonuclease activity to cleave RNA derived from 

PKC alpha gene. 

274. The enzymatic nucleic acid molecule of claim 273, wherein said enzymatic 
nucleic acid molecule comprises sequences complementary to any of NCH 
substrate sequence of Table 63. 

20 275. The enzymatic nucleic acid molecule of claim 273 wherein said enzymatic 
nucleic acid molecule comprises any of the NCH ribozyme sequences shown in 
Table 63. 

276. The enzymatic nucleic acid molecule of claim 268, wherein said enzymatic 
nucleic acid molecule is used to treat cancer. 

25 277. The enzymatic nucleic acid molecule of claim 276, wherein said cancer is 
selected from the group consisting of lung, breast, colon, prostate, bladder, ovary, 
melanoma, and glioblastoma cancer. 

278. The enzymatic nucleic acid molecule of claim 268, wherein said enzymatic 
nucleic acid molecule is used to treat conditions associated with the level of PKC 
30 alpha gene. 
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279. The enzymatic nucleic acid molecule of claim 268, wherein said D and E 
independently has between 5 and 30 nucleotides complementary to the RNA. 

280. The enzymatic nucleic acid molecule of claim 268, wherein said D and E 
independently has between 7 and 12 nucleotides complementary to the RNA. 

5 28 1 . A mammalian cell including the enzymatic nucleic acid molecule of claim 268, 
wherein said mammalian cell is not a living human. 

282. The mammalian cell of claim 281, wherein said mammalian cell is a human 
cell. 

283. A pharmaceutical composition comprising the enzymatic nucleic acid molecule 
10 of claim 238. 

284. A pharmaceutical composition comprising the enzymatic nucleic acid molecule 
of claim 273. 

285. A method of treatment of a patient having a condition associated with the level 
of PKC alpha, wherein said patient is administered the enzymatic nucleic acid 

15 molecule of claim 268 under conditions suitable for said treatment. 

286. The method of claim 285, wherein said method is performed in conjunction 
with one or more other therapies. 

287. The enzymatic nucleic acid molecule of claim 286, wherein said enzymatic 
nucleic acid molecule is used in conjunction with one or more other therapies. 

20 288. An antisense nucleic acid molecule comprising sequence complementary to 
any of substrate sequence in Tables 13-23. 

289. The antisense nucleic acid molecule of claim 288, wherein said enzymatic 
nucleic acid is chemically synthesized. 

290. The antisense nucleic acid molecule of claim 288, wherein said antisense 
25 nucleic acid comprises at least one 2'-sugar modification. 

291. The antisense nucleic acid molecule of claim 288, wherein said antisense 
nucleic acid comprises at least one nucleic acid base modification. 

292. The antisense nucleic acid molecule of claim 288, wherein said antisense 
nucleic acid comprises at least one phosphate backbone modification. 
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293. A mammalian cell including the antisense nucleic acid molecule of claim 288, 
wherein said mammalian cell is not a living human. 

294. The mammalian cell of claim 293, wherein said mammalian cell is a human 
cell. 
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